Continuous state and action Q-learning framework applied to quadrotor UAV control

Conference paper (2019)

Authors

Anton Naruta Student

T. Mannucci

E. van Kampen

DOI: https://doi.org/10.2514/6.2019-0145

To reference this document use:

http://resolver.tudelft.nl/uuid:d9ff62ff-d781-4ecd-9fca-518668cb99f1

More Info

expand_more

Published Date

2019

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper describes an implementation of a reinforcement learning-based framework applied to the control of a multi-copter rotorcraft. The controller is based on continuous state and action Q-learning. The policy is stored using a radial basis function neural network. Distance-based neuron activation is used to optimize the generalization algorithm for computational performance. The training proceeds off-line, using a reduced-order model of the controlled system. The model is identified and stored in the form of a neural network. The framework incorporates a dynamics inversion controller, based on the identified model. Simulated flight tests confirm the controller’s ability to track the reference state signal and outperform a conventional proportional-derivative(PD) controller. The contributions of the developed framework are a computationally-efficient method to store a Q-function generalization, continuous action selection based on local Q-function approximation and a combination of model identification and offline learning for inner-loop control of a UAV system.

Files

AIAA_2019_Anton_Naruta.pdf

(.pdf | 2.64 Mb)