High-Dimensional Optimal State-Feedback Mapping using Deep Neural Networks for Agile Quadrotor Flight

None, None

High-Dimensional Optimal State-Feedback Mapping using Deep Neural Networks for Agile Quadrotor Flight

Master Thesis (2021)

Author(s)

R.C. Chotalal (TU Delft - Aerospace Engineering)

Contributor(s)

C. de Wagter – Mentor (TU Delft - Control & Simulation)

G.C.H.E. de Croon – Mentor (TU Delft - Control & Simulation)

Faculty

Aerospace Engineering

Copyright

Neural Networks Optimal Control Micro air vehicles

To reference this document use:

https://resolver.tudelft.nl/uuid:8f90894b-933a-4649-90a0-1bbc1de6c0d6

More Info

expand_more

Publication Year

2021

Language

English

Copyright

Graduation Date

13-04-2021

Awarding Institution

Delft University of Technology

Programme

['Aerospace Engineering']

Faculty

Aerospace Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

For most robotics applications, optimal control remains a promising solution for solving complex control tasks. One example is the time-optimal flight of Micro Air Vehicles (MAVs), where strict computational requirements fail to resolve such algorithms onboard. Recent work on the use of deep neural networks for guidance and control (G&CNets) has shown that these biologically inspired models approximate well the optimal control solution while requiring a fraction of the computational cost. Although previous attempts resulted in successful flight tests, training occurred on large-scale datasets based on a 3-DoF model. Since model refinement leads to higher generation time, in this work, we show that G&CNets trained on small-sized datasets can mimic the optimal control solution of a full 6-DoF quadrotor model. The cost function used in the generation process penalizes the altitude error and mixes both time and power-optimal objectives weighted by a varying homotopy parameter. Trained networks output the vertical thrust command and body rates based on the vehicle's position, velocity, and attitude. The proposed controller transfers well onboard for different flight scenarios: (i) longitudinal, lateral and diagonal flight; (ii) hovering with and without the effect of disturbances and (iii) waypoint tracking experiment. Through a Monte-Carlo test campaign, it is demonstrated that G&CNets trained on small datasets provide similar results to those with 100 times more samples. To the best of our knowledge, this work is the first implementation of a high-dimensional G&CNet in the control loop of a real MAV.

Files

MScThesis_Rohan_Chotalal.pdf

(pdf | 13.2 Mb)

- Embargo expired in 20-04-2024

License info not available