Safe Reinforcement Learning in Flight Control

None, None

Safe Reinforcement Learning in Flight Control

Introduction to Safe Incremental Dual Heuristic Programming

Master Thesis (2020)

Author(s)

R.R. Feith (TU Delft - Aerospace Engineering)

Contributor(s)

Erik Jan Kampen – Mentor (TU Delft - Control & Simulation)

Faculty

Aerospace Engineering

Copyright

To reference this document use:

https://resolver.tudelft.nl/uuid:07f53daa-b236-4bb9-b010-bc6654383744

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Graduation Date

30-01-2020

Awarding Institution

Delft University of Technology

Programme

['Aerospace Engineering']

Faculty

Aerospace Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Online continuous reinforcement learning has shown promising result in flight control achieving near optimal control within seconds and the capability to adapt to sudden changes in the environment. However no guarantees about safety can be given, needed for use in general aviation. Furthermore performance is often dependent on the precise tuning of hyperparameters inside the system. As a new initiative in providing safety guarantees Safe Incremental Dual Heuristic Programming (SIDHP) is presented. SIDHP combines the fast learning speed of Incremental Dual Heuristic Programming (IDHP) with a safety layer, able to keep the aircraft within a predetermined safe flight envelope. SIDHP is demonstrated and compared to IDHP using a high fidelity flight simulation of a Cessna Citation-II in three separate experiments. SIDHP shows to be more robust with respect to changing hyperparameters compared to IDHP and results in less failures overall.

Files

Thesis_Rick_Feith_4218272.pdf

(pdf | 10.5 Mb)

License info not available