Fleet Planning Under Demand Uncertainty

None, None

Fleet Planning Under Demand Uncertainty

A Reinforcement Learning Approach

Master Thesis (2020)

Author(s)

M.C.T.C. de Koning (TU Delft - Aerospace Engineering)

Contributor(s)

B.F. Santos – Mentor (TU Delft - Air Transport & Operations)

Faculty

Aerospace Engineering

Copyright

Deep Q-Network Fleet Planning Problem Dynamic Fleet Policy Ornstein-Uhlenbeck

To reference this document use:

https://resolver.tudelft.nl/uuid:67125be4-e9d3-46c6-9983-300d71b3511f

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Graduation Date

06-03-2020

Awarding Institution

Delft University of Technology

Programme

['Aerospace Engineering']

Faculty

Aerospace Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This work proposes a model-free reinforcement learning approach to learn a long-term fleet planning problem subjected to air-travel demand uncertainty. The aim is to develop a dynamic fleet policy which adapts over time by intermediate assessments of the states. A Deep Q-network is trained to estimate the optimal fleet decisions based on the airline and network conditions. An end-to-end learning set-up is developed, where an optimisation algorithm evaluates the fleet decisions by comparing the optimal fleet solution profit to the estimated fleet solution profit. The stochastic evolution of air-travel demand is sampled by an adaptation of the mean-reversion Ornstein-Uhlenbeck process, adjusting the air-travel demand growth at each route for general network-demand growth to capture network trends. A case study is demonstrated for three demand scenarios for a small airline operating on a domestic US airport network. It is proven that the Deep Q-network can improve the prediction values of the fleet decisions by considering the air-travel demand as input states. Secondly, the trained fleet policy is able to generate near-optimal fleet solutions and shows comparable results to a reference deterministic optimisation algorithm.

Files

Report.pdf

(pdf | 6.8 Mb)

License info not available