How do I set up rewards to account for unmanned aerial vehicle crashes?

Asked May 20 '20 at 19:01

Active May 26 '20 at 11:10

Viewed 63 times

I am working on a project to implement a collision avoidance algorithm on a real unmanned aerial vehicle (UAV).

I'm interested in understanding the process to set up a negative reward to account for scenarios wherein there is a UAV crash. This can be done very easily during the simulation (if the UAV touches any object, the episode stops giving a negative reward). In the real world, a UAV crash would usually entail it hitting a wall or an obstacle, which is difficult to model.

My initial plan is to stop the RL episode and manually input a negative reward (to the algorithm) each time a crash occurs. Any improvements to this plan would be highly appreciated!

edited May 26 '20 at 11:10

nbro

39,006
12
98
176

asked May 20 '20 at 19:01

desert_ranger

This is a bad idea. Primarily because RL takes many, many episodes which will are not practical on a real device. You're better off planning using classical MPC for trajectory generation. – FourierFlux May 21 '20 at 23:52

How do I set up rewards to account for unmanned aerial vehicle crashes?

0 Answers0