Safe Reinforcement Learning in Flight Control

Introduction to Safe Incremental Dual Heuristic Programming

Master thesis (2020)

Authors

R.R. Feith Aerospace Engineering

Contributors

E. van Kampen (supervisor 1)

Faculty

Aerospace Engineering

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:07f53daa-b236-4bb9-b010-bc6654383744

Published Date

30-01-2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Aerospace Engineering

Abstract

Online continuous reinforcement learning has shown promising result in flight control achieving near optimal control within seconds and the capability to adapt to sudden changes in the environment. However no guarantees about safety can be given, needed for use in general aviation. Furthermore performance is often dependent on the precise tuning of hyperparameters inside the system. As a new initiative in providing safety guarantees Safe Incremental Dual Heuristic Programming (SIDHP) is presented. SIDHP combines the fast learning speed of Incremental Dual Heuristic Programming (IDHP) with a safety layer, able to keep the aircraft within a predetermined safe flight envelope. SIDHP is demonstrated and compared to IDHP using a high fidelity flight simulation of a Cessna Citation-II in three separate experiments. SIDHP shows to be more robust with respect to changing hyperparameters compared to IDHP and results in less failures overall.

Files

Thesis_Rick_Feith_4218272.pdf

(.pdf | 10.5 Mb)