Reinforcement Learning based Algorithm with Safety Handling and Risk Perception

Conference Paper (2016)
Author(s)

S. Shyamsundar

T Mannucci (TU Delft - Control & Simulation)

Erik-Jan van Kampen (TU Delft - Control & Simulation)

Research Group
Control & Simulation
Copyright
© 2016 S. Shyamsundar, T. Mannucci, E. van Kampen
DOI related publication
https://doi.org/10.1109/SSCI.2016.7849367
More Info
expand_more
Publication Year
2016
Language
English
Copyright
© 2016 S. Shyamsundar, T. Mannucci, E. van Kampen
Research Group
Control & Simulation

Abstract

Navigation in an unknown or uncertain environment is a challenging task for an autonomous agent. The agent is expected to behave independently and to learn the suitable action to take for a given situation. Reinforcement Learning could be used to help the agent adapt to an unknown environment and learn the right actions to take. This paper presents the setup and the results of a reinforcement learning problem utilizing Q-learning and a Safety Handling Exploration with Risk Perception Algorithm (SHERPA) for safe exploration in an unknown environment. The agent has to explore its environment safely and must learn the optimal action for a given situation from the feedback received from the environment. The results show that the agent can learn a value function converged to within 10% of the optimal values after 5000 iterations. The simulation results show that the proposed approach ensures that the agent explores an unknown environment safely and learns the desirable actions for a given situation.

No files available

Metadata only record. There are no files for this record.