Benchmarks#
In order to make future research results comparable, we propose several task-specific benchmarks that allow users to measure the performance of suggested motion planning algorithms for object manipulation and trajectory planning in the context of magnetic levitation systems. To simplify measurements for users, MagBotSim includes auxiliary functions (see Benchmark Utils) to measure specific properties, e.g. corrective movements. In the following, \(n_1, n_2, n_3, n_4\in\mathbb{N}\) denote the total number of goals or the number of successfully reached goals used in the measurements.
Note
Since MagBotSim currently contains examples for object pushing and trajectory planning, the following benchmarks focus on these two tasks. However, we will add new environments to MagBotSim in the future and update the benchmarks accordingly.
Benchmarks Object Pushing#
We propose the following benchmarks for object pushing tasks in which the object has to be pushed to a specific goal. The goal can either be a position or a pose (position and orientation). We distinguish two types of corrective movements, as suggested in our previous work Precision-Focused Reinforcement Learning Model for Robotic Object Pushing: overshoot and distance corrections. If the distance between the object and target is smaller than the position threshold in time step \(t\) and larger than the position threshold in the next time step \(t+1\), a corrective movement is necessary to push the object back to the target area. Therefore, these corrections are referred to as overshoot corrections. If the distance between the object and goal increases within one time step and decreases later within the episode, this correction is considered as the second type of corrective movement, i.e. a distance correction. Our latest results can be found here: Latest Object Pushing Results.
Metric |
Measurement |
|---|---|
Success Rate |
Number of successfully reached goals divided by the total number of goals \(n_1\). |
Throughput |
Number of successfully reached goals \(n_2\) divided by the time required to reach these goals. |
Overshoot Corrections |
Mean number of overshoot corrections for successfully reached goals \(n_2\). |
Distance Corrections |
Mean number of distance corrections for successfully reached goals \(n_2\). Can be measured using Corrective Movement Measurement (Overshoot and Distance Corrections). |
Collisions |
Mean number of collisions for the total number of goals \(n_1\) (including any obstacle - object collisions). Of these:
|
We have created 10 benchmark environments (B0-B9) for pushing tasks, each with a defined number of movers and tile configurations.
Benchmark Environment |
Environment Image |
Environment Shape (in Tiles) |
Number of Movers |
|---|---|---|---|
B0 |
|
(3, 3) |
1 |
B1 |
|
(4, 3) |
1 |
B2 |
|
(8, 5) |
1 |
B3 |
|
(7, 7) |
2 |
B4 |
|
(8, 6) |
2 |
B5 |
|
(11, 6) |
2 |
B6 |
|
(8, 6) |
2 |
B7 |
|
(7, 7) |
2 |
B8 |
|
(8, 6) |
2 |
B9 |
|
(8, 8) |
2 |
Benchmarks Trajectory Planning#
We propose the following benchmarks for trajectory planning tasks in which one or typically many movers have to reach varying goals without leaving tiles or colliding with static or dynamic obstacles. Similar to the object pushing task, a mover-specific goal can either be a position or a pose (position and orientation). \(n\in\mathbb{N}\) denotes the number of movers. Our latest results can be found here: Latest Trajectory Planning Results.
Metric |
Measurement |
|---|---|
Success Rate |
Number of successfully reached goals divided by the total number of goals \(n_1\). |
Makespan |
Number of milliseconds required by the slowest mover to reach the total number of goals \(n_2\). |
Throughput |
Number of milliseconds required by all movers to reach the total number of goals \(n_3\). |
Collisions |
Number of collisions for the total number of goals \(n_1\). Of these:
|
Smoothness |
Mean weighted sum of jerk, acceleration, and velocity of all movers within the total time period \(t\in\mathbb{N}\). Can be measured Energy Efficiency Measurement. |
Process Time |
Process time required by all movers to successfully reach the total number of goals \(n_4\). |
We have created 10 benchmark environments (B0-B9) for trajectory planning tasks, each with a defined number of movers and tile configurations.
Benchmark Environment |
Environment Image |
Environment Shape (in Tiles) |
Number of Movers |
|---|---|---|---|
B0 |
|
(4, 3) |
3 |
B1 |
|
(4, 3) |
3 |
B2 |
|
(8, 5) |
4 |
B3 |
|
(7, 7) |
4 |
B4 |
|
(8, 6) |
4 |
B5 |
|
(11, 6) |
3 |
B6 |
|
(8, 6) |
4 |
B7 |
|
(7, 7) |
4 |
B8 |
|
(8, 6) |
5 |
B9 |
|
(8, 8) |
5 |