Safe Reinforcement Learning in Flight Control

Introduction to Safe Incremental Dual Heuristic Programming

Master Thesis (2020)
Author(s)

R.R. Feith (TU Delft - Aerospace Engineering)

Contributor(s)

Erik Jan Kampen – Mentor (TU Delft - Control & Simulation)

Faculty
Aerospace Engineering
Copyright
© 2020 Rick Feith
More Info
expand_more
Publication Year
2020
Language
English
Copyright
© 2020 Rick Feith
Graduation Date
30-01-2020
Awarding Institution
Delft University of Technology
Programme
['Aerospace Engineering']
Faculty
Aerospace Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Online continuous reinforcement learning has shown promising result in flight control achieving near optimal control within seconds and the capability to adapt to sudden changes in the environment. However no guarantees about safety can be given, needed for use in general aviation. Furthermore performance is often dependent on the precise tuning of hyperparameters inside the system. As a new initiative in providing safety guarantees Safe Incremental Dual Heuristic Programming (SIDHP) is presented. SIDHP combines the fast learning speed of Incremental Dual Heuristic Programming (IDHP) with a safety layer, able to keep the aircraft within a predetermined safe flight envelope. SIDHP is demonstrated and compared to IDHP using a high fidelity flight simulation of a Cessna Citation-II in three separate experiments. SIDHP shows to be more robust with respect to changing hyperparameters compared to IDHP and results in less failures overall.

Files

License info not available