Human demonstrations for fast and safe exploration in reinforcement learning

Conference Paper (2017)
Author(s)

G.K. Schonebaum (TU Delft - Education AE)

Jaime Junell (TU Delft - Control & Simulation)

Erik Jan Van Kampen (TU Delft - Control & Simulation)

Research Group
Control & Simulation
Copyright
© 2017 G.K. Schonebaum, J. Junell, E. van Kampen
DOI related publication
https://doi.org/10.2514/6.2017-1069
More Info
expand_more
Publication Year
2017
Language
English
Copyright
© 2017 G.K. Schonebaum, J. Junell, E. van Kampen
Research Group
Control & Simulation
ISBN (electronic)
9781624104497
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Reinforcement learning is a promising framework for controlling complex vehicles with a high level of autonomy, since it does not need a dynamic model of the vehicle, and it is able to adapt to changing conditions. When learning from scratch, the performance of a reinforcement learning controller may initially be poor and -for real life applications- unsafe. In this paper the effects of using human demonstrations on the performance of reinforcement learning is investigated, using a combination of offline and online least squares policy iteration. It is found that using the human as an efficient explorer improves learning time and performance for a benchmark reinforcement learning problem. The benefit of the human demonstration is larger for problems where the human can make use of its understanding of the problem to efficiently explore the state space. Applied to a simplified quadrotor slung load drop off problem, the use of human demonstrations reduces the number of crashes during learning. As such, this paper contributes to safer and faster learning for model-free, adaptive control problems.

Files

PaperGerbenSchonebaum.pdf
(pdf | 3.3 Mb)
- Embargo expired in 31-01-2018
License info not available