Using Explainable Artificial Intelligence to Improve Transparency of Reinforcement Learning for Online Adaptive Flight Control

None, None

Using Explainable Artificial Intelligence to Improve Transparency of Reinforcement Learning for Online Adaptive Flight Control

Breaking Open the Black Box

Master Thesis (2022)

Author(s)

J.A.J. van Zijl (TU Delft - Aerospace Engineering)

Contributor(s)

Erik-Jan Van Kampen – Mentor (TU Delft - Control & Simulation)

Tiago Miguel Monteiro Monteiro Nunes – Mentor (TU Delft - Control & Simulation)

Faculty

Aerospace Engineering

Copyright

Deep learning Machine learning Online learning Black box Explainable artificial intelligence Adaptive control SHAP Reinforcement learning (RL)

To reference this document use:

https://resolver.tudelft.nl/uuid:66a5fdba-6508-4b6f-b20f-c1766ec4fc7f

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Graduation Date

10-02-2022

Awarding Institution

Delft University of Technology

Programme

Aerospace Engineering

Faculty

Aerospace Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep Reinforcement Learning (DRL) shows great potential for flight control, due to its adaptability, fault-tolerance, and as it does not require an accurate system model. However, these techniques, like many machine learning applications, are considered black-box as their inner workings are hidden. This paper aims to break open the black box of RL for adaptive flight control by applying Shapley Additive Explanations (SHAP). The generated explanations are aimed at control experts, but can be useful for anyone interested in RL for adaptive flight control. This research proposes a novel Constant Weight Segment Detection (CWSD) algorithm, facilitating the use of eXplainable Artificial Intelligence techniques to adaptive RL. The algorithm and its usefulness are tested on an Adaptive Critic Design controlling a high-fidelity model of a Cessna Citation aircraft. It is demonstrated that SHAP in combination with CWSD provides detailed and useful insights into the relation between input and output of the RL algorithm. Using SHAP, linear relations between input and output are discovered, simplifying the understanding of the learned strategy.

Files

MScThesis_JAJvanZijl.pdf

(pdf | 6.43 Mb)

License info not available