Uncertainty-Aware Reinforcement Learning for Flight Control

None, None

Uncertainty-Aware Reinforcement Learning for Flight Control

Mastering the Mystery of Flight

Master Thesis (2024)

Author(s)

M. Homola (TU Delft - Aerospace Engineering)

Contributor(s)

E. Van Kampen – Mentor (TU Delft - Control & Simulation)

Y. Li – Mentor (TU Delft - Control & Simulation)

Faculty

Aerospace Engineering

Copyright

Reinforcement Learning Autonomous System Autonomous Flight Control Deep Distributional Reinforcement Learning Uncertainty-Driven Control Safety-Critical Systems Sample Efficient Machine Learning

To reference this document use:

https://resolver.tudelft.nl/uuid:1553d8c9-79df-44dd-8c9f-d2eb9f7017b8

More Info

expand_more

Publication Year

2024

Language

English

Copyright

Graduation Date

19-01-2024

Awarding Institution

Delft University of Technology

Programme

['Aerospace Engineering']

Faculty

Aerospace Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In the rapidly evolving aviation sector, the quest for safer and more efficient flight operations has historically relied on traditional Automatic Flight Control Systems (AFCS) based on high-fidelity models. However, such models not only incur high development costs but also struggle to adapt to new, complex aircraft designs and unexpected operational conditions. As an alternative, deep Reinforcement Learning (RL) has emerged as a promising solution for model-free, adaptive flight control. Yet, RL-based approaches pose significant challenges in terms of sample efficiency and safety assurance. Addressing these gaps, this paper introduces Returns Uncertainty-Navigated Distributional Soft Actor-Critic (RUN-DSAC). Designed to enhance the learning efficiency, adaptability, and safety of flight control systems, RUN-DSAC leverages the rich uncertainty information inherent in the returns distribution to refine the decision-making process. When applied to the attitude tracking task on a high-fidelity non-linear fixed-wing aircraft model, RUN-DSAC demonstrates superior performance in learning efficiency, adaptability to varied and unforeseen flight scenarios, and robustness in fault tolerance that outperforms the current state-of-the-art SAC and DSAC algorithms.

Files

Marek_Homola_MSc_Thesis.pdf

(pdf | 32.9 Mb)

License info not available