Hierarchical Reinforcement Learning for Model-Free Flight Control

None, None

Hierarchical Reinforcement Learning for Model-Free Flight Control

A sample efficient tabular approach using Q(lambda)-learning and options in a traditional flight control structure

Master Thesis (2019)

Author(s)

J.M. Hoogvliet (TU Delft - Aerospace Engineering)

Contributor(s)

EJ van Kampen – Mentor (TU Delft - Control & Simulation)

Faculty

Aerospace Engineering

Copyright

Reinforcement Learning Simulation Control Aircraft Hierarchical Reinforcement Learning Flight Control Systems Sample efficiency Flat reinforcement learning

To reference this document use:

https://resolver.tudelft.nl/uuid:d66efdb7-d7c7-4c44-9b50-64678ffdf60d

More Info

expand_more

Publication Year

2019

Language

English

Copyright

Graduation Date

04-11-2019

Awarding Institution

Delft University of Technology

Programme

['Aerospace Engineering']

Faculty

Aerospace Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Reinforcement learning (RL) is a model-free adaptive approach to learn a non-linear control law for flight control. However, for flat-RL (FRL) the size of the search space grows exponentially with the number of states, resulting in low sample efficiency. This research aims to improve the efficiency with Hierarchical Reinforcement Learning (HRL). Performance in terms of the number of samples and the mean tracking error is evaluated on an altitude reference tracking task using a simulated F16 aircraft model. FRL is used as the baseline performance index. HRL is used to define a three-level learning structure, re-using an existing flight control structure. Finally, options is used with HRL to add temporal abstraction. It is shown that by re-using the flight control structure the learning process is made more sample efficient. Adding options further increases this efficiency, but does not lead to better tracking
performance.

Files

Hierarchical_Reinforcement_Lea... (pdf)

(pdf | 20.7 Mb)

License info not available