Exploring Reinforcement Learning for Constrained Wing Shape Optimization

Master thesis (2024)

Authors

N.E. van Putten Aerospace Engineering

Contributors

Carmine Varriale Flight Performance and Propulsion - Aerospace Engineering (mentor)

K. Swannet Flight Performance and Propulsion - Aerospace Engineering (graduation committee member)

Faculty

Aerospace Engineering, Aerospace Engineering

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:eaeb85d2-b44a-45ad-9646-bf9d2dca8cd4

Published Date

06-06-2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Aerospace Engineering

Abstract

In this paper, the Proximal Policy Optimization (PPO) algorithm is used to perform a constrained wing shape optimization. The PPO algorithm is a Machine Learning (ML) algorithm that improves itself by repeatedly performing the same optimization and learning from its results. The complete adaptation of the PPO framework to the design problem is detailed and evaluated. Not only was the PPO framework able to consistently optimize the wing 4% further than the Particle Swarm Optimization (PSO) algorithm, it was able to do so 35 times faster once the model is fully trained. The PPO framework was able to find more efficient wing shapes than the PSO framework. The trained PPO model was able to optimize the wing of other similar aircraft, even without direct retraining. These results illustrate that PPO could be a promising technique for automated aerospace design problems. Due to the significant training time of the ML approach, the PPO algorithm is not an effective replacement of traditional optimization algorithms for design problems where only a single optimization is required.

Files

TU_Delft_Thesis_NEvanPutten.pd... (.pdf)

(.pdf | 17.2 Mb)