Reinforcement Learning based Algorithm with Safety Handling and Risk Perception

Conference Paper (2016)
Author(s)

S. Shyamsundar

Tommaso Mannucci (TU Delft - Aerospace Engineering)

Erik-Jan van Kampen (TU Delft - Aerospace Engineering)

Research Group
Control & Simulation
DOI related publication
https://doi.org/10.1109/SSCI.2016.7849367 Final published version
More Info
expand_more
Publication Year
2016
Language
English
Research Group
Control & Simulation
Event
2016 IEEE Symposium Series on Computational Intelligence (2016-10-06 - 2016-10-09), Athens, Greece
Downloads counter
302
Collections
Institutional Repository

Abstract

Navigation in an unknown or uncertain environment is a challenging task for an autonomous agent. The agent is expected to behave independently and to learn the suitable action to take for a given situation. Reinforcement Learning could be used to help the agent adapt to an unknown environment and learn the right actions to take. This paper presents the setup and the results of a reinforcement learning problem utilizing Q-learning and a Safety Handling Exploration with Risk Perception Algorithm (SHERPA) for safe exploration in an unknown environment. The agent has to explore its environment safely and must learn the optimal action for a given situation from the feedback received from the environment. The results show that the agent can learn a value function converged to within 10% of the optimal values after 5000 iterations. The simulation results show that the proposed approach ensures that the agent explores an unknown environment safely and learns the desirable actions for a given situation.