Circular Image

J. Sun

info

Please Note

80 records found

The “feasibility” of climate-optimal routing

Conference paper (2025) - Junzi Sun, Xavier Olive
The environmental impact of aviation has been the subject of significant research efforts for several decades. While reducing carbon emissions has reached a consensus across different stakeholders, leading to efforts to reduce route inefficiencies in air traffic management systems, other climate effects like contrails have spurred a different kind of discussion in the research community. Some research has rapidly moved into the pre-operational phase, aiming to reduce the climate impact of aviation by optimizing flight trajectories to avoid contrail formation. Notably, recent commercial projects from Google and Breakthrough Energy have been fast-tracking the operational perspective of contrail avoidance. In our past research, we have established a robust and fast methodology for trajectory optimization to minimize contrail formation based on TOP, a trajectory optimizer using the OpenAP aircraft performance model. In this paper, we address the practical challenges of implementing contrail-aware routing strategies in the aviation industry. We analyze the trade-offs between fuel consumption and contrail avoidance, the impact of weather forecast uncertainty on contrail mitigation strategies, the effects of contrail mitigation on airspace capacity and network operations, and the implications of contrail reduction strategies on the aviation industry and regulatory frameworks. We use a dataset of flight trajectories over Europe on a day with significant contrail potential to conduct a data-driven analysis of these challenges. Then, we demonstrate the potential difficulties in implementing contrail-optimal routing in practice, especially concerning the uncertainties in weather forecasts, airspace capacity, and the responsibility for optimal routing. Overall, we argue that contrail optimal routing should be approached with caution, and it may not be as straightforward as promoted by some stakeholders. ...
Journal article (2025) - Gabriel Jarry, Ramon Dalmau, Philippe Very, Junzi Sun
Accurately estimating aircraft fuel flow is critical for evaluating new procedures, designing next-generation aircraft, and monitoring the environmental impact of current aviation practices. This paper investigates the generalization capabilities of deep learning models for fuel flow prediction, focusing on their performance with aircraft types not included in the training data. We propose a novel methodology that combines neural network architectures with domain generalization techniques to improve robustness and reliability across different aircraft types. Using a comprehensive dataset of 101 aircraft types, split into training (64 types) and generalization (37 types) sets with each type represented by 1,000 flights, we introduce a pseudo-distance metric to quantify aircraft type similarity and explore sampling strategies to improve model performance in data-limited regions. Our findings show that for unseen aircraft types, especially with noise regularization, the model outperforms baselines such as corrected proxy estimates. This study demonstrates the potential of blending domain-specific insights with advanced machine learning techniques to develop scalable, accurate, and generalizable fuel flow estimation models. ...

Improving Crowdsourced Flight Trajectories with ADS-C Data

Conference paper (2025) - Junzi Sun, Xavier Olive, Martin Strohmeier, Vincent Lenders
The OpenSky Network has been collecting and providing crowdsourced air traffic surveillance data since 2013. The network has primarily focused on Automatic Dependent Surveillance-Broadcast (ADS-B) data, which provides highfrequency position updates over terrestrial areas. However, the ADS-B signals are limited over oceans and remote regions, where ground-based receivers are scarce. To address these coverage gaps, the OpenSky Network has begun incorporating data from the Automatic Dependent Surveillance-Contract (ADS-C) system, which uses satellite communication to track aircraft positions over oceanic regions and remote areas. In this paper, we analyze a dataset of over 720,000 ADS-C messages collected in 2024 from around 2,600 unique aircraft via the Alphasat satellite, covering Europe, Africa, and parts of the Atlantic Ocean. We present our approach to combining ADS-B and ADS-C data to construct detailed long-haul flight paths, particularly for transatlantic and African routes. Our findings demonstrate that this integration significantly improves trajectory reconstruction accuracy, allowing for better fuel consumption and emissions estimates. We illustrate how combined data captures flight patterns across previously underrepresented regions across Africa. Despite coverage limitations, this work marks an important advancement in providing open access to global flight trajectory data, enabling new research opportunities in air traffic management, environmental impact assessment, and aviation safety. ...

Insights from Japan’s High-Density Airspace and Meteorological Conditions

Conference paper (2025) - Katsuhiro Sekine, Junzi Sun, Tomoki Hasegawa, Eri Itoh
Persistent contrails significantly contribute to aviation’s climate impact through radiative forcing effects. Japanese airspace, characterized by high traffic density, prevalent short-haul flights, and diverse meteorological conditions, exhibits unique contrail formation patterns requiring tailored mitigation strategies. However, approaches such as altitude adjustments for contrail avoidance may lead to air traffic concentration at specific altitudes, raising aviation safety concerns. Therefore, this study identifies high-impact regions in Japanese airspace where contrail mitigation strategies can be effectively applied. Using the CoCiP model, CARATS Open Data, and ERA5 reanalysis, the analysis highlights critical seasonal and geographical patterns of contrail formation. Based on CARATS Open Data from 2019, which includes 399,541 flights across en route and oceanic airspace, April to June emerge as peak periods for contrail energy forcing (EF), driven by stable, humid atmospheric conditions. High-EF hotspots in southwestern, central, and northern Japan align with dense air traffic routes, with 1.71% of flights accounting for 80% of total contrail EF. A strong correlation between contrail altitude and persistence underscores the effectiveness of altitude adjustments for mitigation. Targeted strategies, such as nighttime altitude changes and interventions in high-EF sectors, could significantly reduce aviation’s climate impact. These findings establish a foundation for integrating contrail reduction measures into air traffic management systems in Japan, providing actionable insights for balancing climate benefits and operational safety. ...
Journal article (2025) - Junzi Sun, Esther Roosenbrand
Condensation trails, or contrails, are line-shaped clouds that are produced by an aircraft engine exhaust. These contrails often impact climate significantly due to their potential warming effect. Identification of contrail formation through satellite images has been an ongoing research challenge. Traditional computer vision techniques struggle with varying imagery conditions, and supervised machine learning approaches often require a large amount of hand-labeled images. This study researches few-shot transfer learning and provides an innovative approach for contrail segmentation with a few labeled images. The methodology leverages backbone segmentation models, which are pretrained on existing image datasets and fine-tuned using an augmented contrail-specific dataset. We also introduce a new loss function, SR loss, which enhances contrail line detection by incorporating Hough transformation in model training. This transformation improves performance over generic image segmentation loss functions. The openly shared few-shot learning library, contrail-seg, has demonstrated that few-shot learning can be effectively applied to contrail segmentation with the new loss function. ...
Conference paper (2025) - Phillipe Lothaller, Marta Ribeiro, Junzi Sun, Jasper de Wilde, Alexander Piva
Aircraft carry additional fuel reserves, referred to as contingency fuel, used to account for unforeseen events during a flight. Previous research has attempted to quantify the magnitude of such events, most notably the probability of adverse weather or ATFM regulation, yet their inherent unpredictability introduces uncertainty and frequently results in the overestimation of contingency fuel requirements. Recent studies use data-driven fuel-burn predictions to better estimate contingency fuel sizing; however, most are confined to specific routes or regions, limiting generalizability. To address this, we utilise real operational airline data covering both regional and intercontinental flights, and develop a quantile regression framework for predicting contingency fuel requirements, capable of adapting to more diverse set of flight characteristics. Our framework integrates flight-plan data, TAF weather forecasts, and proxy congestion features to predict required contingency fuel at varying quantile levels, enabling trade-offs between efficiency and safety. Unlike the current Statistical Contingency Fuel process, which applies different coverage levels by risk category, this evaluation uses a single fixed quantile for all flights when generating predictions. In a four-month out-of-sample evaluation, a single fixed quantile matched the safety performance of the Statistical Contingency Fuel process while reducing excess fuel carriage by up to 235,364 kg (≈11%). A more conservative quantile configuration yielded smaller savings but reduced abnormal flight-phase events by 22.2%. The key drivers of the final predictions are evaluated, offering pilots and dispatchers transparent explanations that can build trust and reduce reliance on discretionary fuel loading. ...

An Optimal Trajectory Approach

Conference paper (2025) - Aidana Tassanbi, Junzi Sun, Jacco Hoekstra
The mass of an aircraft is crucial for performance-related studies, such as predicting flight trajectories and analyzing flight emissions. In these studies, the flight trajectories are often reconstructed using a point-mass aircraft performance model combined with flight profiles from surveillance data and take-off mass information. However, airlines do not usually disclose take-off mass information, considering its sensitive nature. Thus, aircraft masses often need to be assumed or estimated. This paper presents a simple and computationally effective approach for estimating take-off mass using only open data and models. We explore the strong correlation between take-off mass, flight distance, cruise altitude, and partially, the airspeed during the cruise. The main idea is to generate fuel-optimal trajectories with known masses and distances, and then compare them with actual flight data. The optimal trajectories are generated using the open aircraft performance and optimization library. By assuming that actual flights follow quasi-fuel-optimal trajectories, the take-off mass of a flight can be estimated based on simple regression models trained on the optimal trajectory dataset. This open-loop take-off mass estimation approach requires no proprietary information from aircraft manufacturers or airlines. We verified the model with an anonymized dataset containing actual A320 flights with known take-off mass. Our two- and three-feature multi-linear models yield mean absolute percentage errors of 5.95 % and 4.89 %, respectively. This study is another step forward in open science and a contribution to the aircraft trajectory studies. ...
Conference paper (2025) - C. Dolman, M.J. Ribeiro, Junzi Sun, P.R.J.R. Lothaller, Jasper de Wilde, Alexander Piva, F.A.K. Vossen
Air traffic delays have a major impact on the aviation industry, affecting airlines, passengers, and the broader ecosystem. With increasing regulatory and sustainability pressures, accurate delay predictions are critical as they allow for precise determination of the contingency and discretionary fuel required for flights. This research aims to develop an explainable supervised learning model to improve existing en route delay predictions, focusing on intercontinental flights from North America to Amsterdam Schiphol Airport. While prior studies have explored flight delay prediction, they have not addressed two critical research gaps identified in this research: the inclusion of day-of-operations features, such as passenger information, aircraft weights, and cost index, and the use of transatlantic flight data for predictions 90 minutes before departure. To address these gaps, two Gradient-Boosted models, CatBoost and LightGBM, were trained using internal airline, airport, and METAR data. Both models outperformed the airline’s current in-use statistical model, with CatBoost achieving an MAE of 3.44 minutes and RMSE of 4.61 minutes and LightGBM achieving an MAE of 3.43 minutes and RMSE of 4.56 minutes. The most significant performance increase over the current model was observed under adverse weather conditions. This research advances en route delay prediction by providing more accurate delay forecasts, particularly in critical weather conditions, and proposes practical improvements to support future studies focused on enhancing model adaptability across diverse operational contexts. ...
Conference paper (2025) - Ana Maria Mekerishvili, Junzi Sun, Patrick Jonk, Vincent de Vries
Radiotelephony remains the primary medium for pilot-controller communication, yet extracting structured information from spoken exchanges is challenging. Deep learning approaches often depend on large annotated datasets, limiting use in data-scarce environments. This study evaluates open-source Large Language Models for Structured Information Extraction from ATC communications, with applications in assisting or automating pseudo-pilot tasks. We evaluate Llama 3.3 (70B) with baseline prompting and Gemma 3 (4B) with baseline and fine-tuned variants on 496 utterances from NLR’s ATM simulator: NARSIM (NLR ATC real-time simulator). Performance is assessed on human transcripts and ASR outputs from Whisper models, with varying prompt contexts. Cross-sector generalization is tested across two ATC sectors. Using manual scoring, Llama 3.3 achieves micro-F1 0.95 on human transcripts and 0.86 on fine-tuned Whisper outputs. While Gemma 3 performed weaker in its baseline form, fine-tuning on a small sample led to notable improvements. Results demonstrate the potential of LLMs for ATC applications without the need for large annotated datasets. ...
Journal article (2025) - Enrico Spinielli, Junzi Sun, Martin Strohmeier, Xavier Olive, Quinten Goens, Rainer Koele, Allan Tart, John Fitzgerald
EUROCONTROL's Performance Review Commission launched the 2024 PRC Data Challenge in July 2024 with the aim of engaging with data scientists and aviation enthusiasts for the development of an open model to estimate an aircraft's take-off weight. The dataset for the challenge represents a unique instance of otherwise difficult-to-obtain flight information and could be reused for educational purposes or to further improve the outcome of the challenge. ...
Journal article (2025) - Maarten Beltman, Marta Ribeiro, Jasper de Wilde, Junzi Sun
Punctuality is a key performance indicator for any airline, especially hub-and-spoke airlines, given their focus on short passenger connections. Flights that are delayed at departure need to compensate for lost time whilst airborne. Because fuelling takes place well before scheduled departure, predicted departure delays determine the planned fuel amounts for en-route speed optimization. To prevent unnecessary fuel burn, airlines benefit from highly accurate departure delay predictions. This study aims to extend previous work on airline departure delay forecasting to a dynamic and probabilistic domain, whilst incorporating novel day-of-operations airline information to further minimize prediction errors. Random Forest, CatBoost, and Deep Neural Network models are proposed for a case study on departure flights of a major hub-and-spoke airline from its hub airport between 1 January 2020 and 1 August 2023. The Random Forest model is selected for its probabilistic performance and high accuracy in predicting delays between 5 and 25 min, for which en-route speed optimization has the largest effect. At the 90 min prediction horizon, the model reaches a Mean Absolute Error of 8.46 min and a Root Mean Square Error of 11.91 min. For 76% of flights, the actual delay is within the predicted probability distribution range. Finally, this study puts a strong emphasis on explainability. Flight dispatchers are therefore provided with the main factors impacting the prediction, explaining the context of the flight. The versatility of the model is demonstrated in two shadow runs within the procedures of an international airline, where delays caused by familiar and unfamiliar factors were successfully predicted. ...
Conference paper (2025) - Xavier Olive, Yan Lok Cheung, Junzi Sun
tangram is an open research framework for real-time processing of high-throughput geospatial surveillance streams, with a primary application in ADS-B and Mode S surveillance data. While large-scale historical datasets have motivated extensive aviation research, the transfer of methods and algorithms to live data streams remains less documented. This transfer is significantly more challenging: real-time analyses are difficult to implement, debug, and reproduce. tangram addresses this gap by providing a modular and extensible platform that lowers the technical barriers for researchers, enabling them to integrate their own algorithms without developing a full streaming infrastructure, so that they can focus on addressing their own research questions. This paper introduces the architecture of the platform and demonstrates its flexibility through four representative use cases in air traffic management: integrating weather forecasts, estimating fuel flow, analysing contrail formation, and monitoring airport performance. Together, these examples illustrate how tangram enables reproducible, real-time experimentation and opens new perspectives for operationally relevant research in aviation. ...
Conference paper (2024) - Junzi Sun, E.J. Roosenbrand, Yuki Nonaka, Eri Itoh
Estimating contrail formation and proposing effective mitigation strategies have posed significant challenges for the aviation industry in recent years. This study utilizes the Japanese airspace as a case study to address the challenge of assessing and minimizing the environmental impact of contrails. Initially, we introduce a novel combination of meteorological and flight trajectory data sources, followed by a comparative data quality analysis. Drawing on four months of data during different seasons from 2023, we conduct an in-depth analysis of contrail formation within the Japanese airspace, uniquely quantifying contrails with highresolution data to provide insights into their geographical and seasonal variations. Subsequently, we examine the effectiveness of altitude diversions as a mitigation strategy. Our findings identify clear geographical and seasonal influences on contrail formation in the region. We illustrate that altitude diversions can significantly reduce contrail formation with minimal altitude adjustments of up to 2000 ft. We found that minor altitude diversions can mitigate between 70% and 90% of the persistent contrail formed near the Japan region. Notably, the results also highlight a concerning phenomenon: during warmer months, such as July, a higher quantity and percentage of persistent contrails is observed, and a larger proportion of these contrails cannot be mitigated through altitude diversions. This result could intensify positive radiative forcing during warmer periods, underscoring the need for further research into contrail mitigation strategies. ...
Journal article (2024) - Junzi Sun, E.J. Roosenbrand, Benoit Figuet, Thomas Dubot, Manuel Waltert, Xavier Olive
Contrails, formed under specific atmospheric conditions, have a noteworthy role in heat-trapping within the atmosphere. This study bridges the gap between theoretical contrail formation models and real-world data by employing flight information from OpenSky and meteorological data from the European Centre for Medium-Range Weather Forecasts. We introduce a computationally efficient contrail estimation module, leveraging a client-server architecture that allows on-demand weather data interpolation via an API, significantly reducing computational load and enhancing performance locally. The study also benchmarks the entire pipeline, from data acquisition to contrail prediction, offering a robust tool for future air traffic studies requiring interpolated weather data. ...
Journal article (2024) - Junzi Sun, E.J. Roosenbrand
Contrails, formed under specific atmospheric conditions, have a noteworthy role in heat-trapping within the atmosphere. This study bridges the gap between theoretical contrail formation models and real-world data by employing flight information from OpenSky and meteorological data from the European Centre for Medium-Range Weather Forecasts. We introduce a computationally efficient contrail estimation module, leveraging a client-server architecture that allows on-demand weather data interpolation via an API, significantly reducing computational load and enhancing performance locally. The study also benchmarks the entire pipeline, from data acquisition to contrail prediction, offering a robust tool for future air traffic studies requiring interpolated weather data. ...

Exploring Language Model Embodied Air Traffic Agents

Conference paper (2024) - Justas Andriuškevičius, Junzi Sun
Recent developments in language models have created new opportunities in air traffic control studies. The current focus is primarily on text and language-based use cases. However, these language models may offer a higher potential impact in the air traffic control domain, thanks to their ability to interact with air traffic environments in an embodied agent form. They also provide a language-like reasoning capability to explain their decisions, which has been a significant roadblock for the implementation of automatic air traffic control. This paper investigates the application of a language model-based agent with function-calling and learning capabilities to resolve air traffic conflicts without human intervention. The main components of this research are foundational large language models, tools that allow the agent to interact with the simulator, and a new concept, the experience library. An innovative part of this research, the experience library, is a vector database that stores synthesized knowledge that agents have learned from interactions with the simulations and language models. To evaluate the performance of our language model-based agent, both open-source and closed-source models were tested. The results of our study reveal significant differences in performance across various configurations of the language model-based agents. The best-performing configuration was able to solve almost all 120 but one imminent conflict scenarios, including up to four aircraft at the same time. Most importantly, the agents are able to provide human-level text explanations on traffic situations and conflict resolution strategies. ...
Journal article (2024) - Antoine Salgas, Junzi Sun, Scott Delbecq, Thomas Planès, Gilles Lafforgue
The study of the environmental transition of the aviation sector calls for prospective traffic scenarios. Detailed traffic and emissions inventories are often needed to refine the available analyses and to enable the simulation of regionalised scenarios. In the past studies, these are generally based on commercial, proprietary traffic data, making their dissemination problematic and reducing the reproducibility of the science produced. Open-source alternatives do exist but with limited geographical coverage. This study bridges this gap by presenting an innovative open-source dataset detailing 2019's global air traffic flows and associated CO2 emissions. A comprehensive approach that compiles diverse flight data sources is presented. The remaining data gaps are addressed by constructing a route network through systematic Wikipedia parsing and by estimating the related traffic using socio-economic data. Then, an aircraft performance model to estimate CO2 emissions is implemented. This methodology promises reinforced reproducibility and broader data accessibility in aviation environmental research. Several reference datasets are used to evaluate the accuracy of the open-source dataset. Despite various levels of accuracy for individual routes, major traffic flows are well estimated at the country and continental levels, albeit with room for refinement to ensure consistent data reliability. To facilitate the exploration of the dataset, the AeroSCOPE tool has been developed. To initiate research prospects, use cases of this dataset are proposed, concerning the network potential of electric and hydrogen-powered aircraft and inequalities in air transport. ...
Conference paper (2024) - Esther Roosenbrand, Junzi Sun, Jacco Hoekstra
Contrail optimization offers an efficient and cost-effective way for aviation to immediately reduce its climate impact. Open-source optimization, wherein the contrail and emission effects are balanced based on meteorological open data, has been presented in previous work. However, prior research overlooks the importance of using forecasting data, as opposed to post-processed reanalysis data. For contrail optimization to be implementable, forecasting data needs to be available at a sufficient quality in the flight planning stage in order to perform the optimization. In this paper, a fully open non-linear optimal control flight optimization is implemented and applied using both forecasting and reanalysis data. A total of 120 days (175.440 flights) of flight data from OpenSky are used in the analysis. We show that forecasts with larger lookahead times (up to 12 hours) are equally effective when compared to more recent forecasts (1 hour lookahead time) for contrail optimization, with equally high accuracy. However, when compared to more accurate post-processed reanalysis data, there are considerable differences in predicted contrails formed. This research shows there is still a long way to go before we can actually implement contrail optimal flight planning. ...
Conference paper (2024) - A.I. Gheorghe, Junzi Sun, M.J. Ribeiro, Pascal Hop, Benjamin Cramet
Predicting aircraft Take-Off Weight (TOW) has been a long-standing goal for aviation stakeholders, especially for operational and regulatory bodies involved in flight planning. Accurate TOW values would enable better emissions computation, leading to more effective regulation of aviation’s climate impact. However, aircraft operators prefer to keep TOWs confidential because they are sensitive to operational trends and cost indices. Consequently, many works have attempted to circumvent this gap by predicting TOW values. Unfortunately, limited success has been achieved primarily due to the lack of accurate real-world operational data. This study is unique in utilizing operational TOW data provided by airlines. We predict TOW before takeoff based solely on Flight Plan and Terminal Aerodrome Forecast parameters, primarily focusing on flights at Amsterdam Airport Schiphol. The accuracy of several Machine Learning algorithms is directly compared. The best Mean Absolute Percentage Error of 2.17% on the Schiphol testing dataset is achieved. The model is further validated on flights at Paris- Charles de Gaulle Airport and Brussels South Charleroi Airport with errors of 4.07% and 3.41%. We found that the distribution of flights in the training dataset, particularly aircraft and airline types, significantly influenced the model’s applicability. Recommendations are also made on how to improve the model further. ...
Conference paper (2024) - R.W. Vos, Junzi Sun, J.M. Hoekstra
Air traffic sector demand and capacity balancing are necessary for safe and efficient flight execution. Demand and capacity are determined in current operations based on schedules and flight plans. This research aims to improve air traffic demand forecasting by exploring machine learning-based trajectory prediction, specifically the newly emerged transformer-based neural network models. The predicted trajectories are considered to improve demand forecasts for Air Traffic Control in the Netherlands. We successfully built a transformer neural network using available traffic messages from the EuroControl B2B connection and actual trajectories obtained from the OpenSky ADS-B repository. A new lost function is specifically designed to improve this prediction model’s performance. This trajectory predictor could accurately generate trajectories, outperforming the flight plan and other neural network approaches by a good margin. For demand prediction, introducing improved trajectories provided small gains that could lead to more stable predictions. ...