Exploring Reinforcement Learning for Constrained Wing Shape Optimization

None, None

Exploring Reinforcement Learning for Constrained Wing Shape Optimization

Master Thesis (2024)

Author(s)

N.E. van Putten (TU Delft - Aerospace Engineering)

Contributor(s)

Carmine Varriale – Mentor (TU Delft - Aerospace Engineering)

K. Swannet – Graduation committee member (TU Delft - Aerospace Engineering)

Faculty

Aerospace Engineering

Reinforcement Learning (RL) Machine Learning (ML) Transfer Learning Particle Swarm Optimization MDAO Aerodynamic Shape Optimisation

To reference this document use

https://resolver.tudelft.nl/uuid:eaeb85d2-b44a-45ad-9646-bf9d2dca8cd4

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

06-06-2024

Awarding Institution

Delft University of Technology

Programme

Aerospace Engineering

Faculty

Aerospace Engineering

Downloads counter

468

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In this paper, the Proximal Policy Optimization (PPO) algorithm is used to perform a constrained wing shape optimization. The PPO algorithm is a Machine Learning (ML) algorithm that improves itself by repeatedly performing the same optimization and learning from its results. The complete adaptation of the PPO framework to the design problem is detailed and evaluated. Not only was the PPO framework able to consistently optimize the wing 4% further than the Particle Swarm Optimization (PSO) algorithm, it was able to do so 35 times faster once the model is fully trained. The PPO framework was able to find more efficient wing shapes than the PSO framework. The trained PPO model was able to optimize the wing of other similar aircraft, even without direct retraining. These results illustrate that PPO could be a promising technique for automated aerospace design problems. Due to the significant training time of the ML approach, the PPO algorithm is not an effective replacement of traditional optimization algorithms for design problems where only a single optimization is required.

Files

TU_Delft_Thesis_NEvanPutten.pd... (pdf)

(pdf | 17.2 Mb)

License info not available