Prediction of Aircraft Take-off Weight using Machine Learning

More Info
expand_more

Abstract

Predicting aircraft Take-Off Weight (TOW) has been a long-sought task by aviation stakeholders, especially for operational and regulatory bodies involved in flight planning. Unfortunately, TOW being a sensitive parameter to operational trends and cost indices, aircraft operators tend to keep it confidential. In recent years, Machine Learning (ML) algorithms have achieved increased prediction accuracy and capabilities in the field, provided the availability of TOW data. This paper studies the implementation of gradient boosting algorithms as well as Random Forests to better understand which algorithm is best-suited for aircraft TOW prediction (prior to take-off) solely based on Flight PLan (FPL) and Terminal Aerodrome Forecast (TAF) parameters. The study focused on flights at Amsterdam Airport Schiphol (AMS) for training the algorithms, using an 80-20% train-test split. Between Gradient Boosting Decision Trees (GBDTs), LightGBM, XGBoost, and Random Forests, GBDTs achieved the smallest Mean Absolute Percentage Error (MAPE) with 1.71 and 2.17% on the training and testing datasets, respectively. The most influencing feature proved to be the requested cruise speed, followed by great circle distance between airports, and aircraft type. The model was validated on Paris - Charles de Gaulle Airport (CDG) and Brussels South Charleroi Airport (CRL), proving its independence from airport type. However, the distribution of flights in the training dataset, especially that of aircraft and airline types, proved to be an influencing factor for the model's applicability to other airports. Future work includes expanding the training dataset to all flights in the European network, and introducing trajectory-based features such as aircraft speed intent. With a larger training dataset, neural network algorithms could also be explored. Finally, regarding the improvement of trajectory predictions, it was found that better accuracy of TOW predictions does not suffice and that other operational parameters' effect should be investigated, especially speed profiles.