33To accomplish this objective, the thesis must carry out the following tasks:
- model the skill as a DMP,
- select an RL algorithm for improving the modelled skill,
- evaluate the safety of the exploration method.
34To test the effectiveness of the new approach for generating safe trajectories works, a challenging enough real-world task is needed for experimentation.
35For this purpose, the thesis will use the ball-in-a-cup game, as this game can be seen as a benchmark problem in robotics [50].
36The game consists of a ball attached to a string, which is then attached to the bottom of a cup.
37The objective of the game is to get the ball into the cup.
38This is achieved by inducing a movement on the ball that quickly moves the cup back and forth and then pulls it up and moves the cup under the ball.
39Although the game is simple by nature, it is not that easy to learn, as it requires fast, precise movement, and even small changes in the movement can have a drastic impact on the trajectory of the ball.
40The ball-in-a-cup game is of interest, as a robot cannot get the ball into the cup by merely executing the movement produced by the initially learned model, thus requiring subsequent RL to successfully master the skill.