Uncertainty-Aware Reinforcement Learning for Flight Control

Mastering the Mystery of Flight

Master Thesis (2024)
Author(s)

M. Homola (TU Delft - Aerospace Engineering)

Contributor(s)

E. Van Kampen – Mentor (TU Delft - Control & Simulation)

Y. Li – Mentor (TU Delft - Control & Simulation)

Faculty
Aerospace Engineering
Copyright
© 2024 Marek Homola
More Info
expand_more
Publication Year
2024
Language
English
Copyright
© 2024 Marek Homola
Graduation Date
19-01-2024
Awarding Institution
Delft University of Technology
Programme
['Aerospace Engineering']
Faculty
Aerospace Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In the rapidly evolving aviation sector, the quest for safer and more efficient flight operations has historically relied on traditional Automatic Flight Control Systems (AFCS) based on high-fidelity models. However, such models not only incur high development costs but also struggle to adapt to new, complex aircraft designs and unexpected operational conditions. As an alternative, deep Reinforcement Learning (RL) has emerged as a promising solution for model-free, adaptive flight control. Yet, RL-based approaches pose significant challenges in terms of sample efficiency and safety assurance. Addressing these gaps, this paper introduces Returns Uncertainty-Navigated Distributional Soft Actor-Critic (RUN-DSAC). Designed to enhance the learning efficiency, adaptability, and safety of flight control systems, RUN-DSAC leverages the rich uncertainty information inherent in the returns distribution to refine the decision-making process. When applied to the attitude tracking task on a high-fidelity non-linear fixed-wing aircraft model, RUN-DSAC demonstrates superior performance in learning efficiency, adaptability to varied and unforeseen flight scenarios, and robustness in fault tolerance that outperforms the current state-of-the-art SAC and DSAC algorithms.

Files

Marek_Homola_MSc_Thesis.pdf
(pdf | 32.9 Mb)
License info not available