Safety Cages for Reliable Machine Learning
A framework for exoplanet spectroscopy in the Ariel mission, extended with error-aware model specialisation
N.I. Grens (TU Delft - Aerospace Engineering)
G.C.H.E. de Croon – Mentor (TU Delft - Control & Simulation)
Luís F. Simões – Mentor (ML Analytics)
S.M. Cazaux – Graduation committee member (TU Delft - Planetary Exploration)
Vincent Meijer – Graduation committee member (TU Delft - Operations & Environment)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Ensuring the reliability of machine learning models in safety-critical space missions remains a significant challenge, especially when ground-truth data is unavailable for real-time validation. While machine learning can augment pipelines by extracting transmission spectra from complex light curves, vulnerabilities to instrument anomalies and domain shifts introduce risks. This study evaluates a modular safety cage architecture operating as a parallel monitoring layer to assess prediction validity without modifying the underlying estimator. By monitoring runtime indicators, including uncertainty quantification, out-of-domain detection, and influence functions, the framework constrains the operational domain to a verified region. The results demonstrate that model failure is multifaceted, requiring indicator fusion strategies, and that applying safety-driven rejection allows for a small reduction in data coverage leading to a significant reduction in prediction error. In addition, the framework is extended with an error-aware model specialisation pipeline that partitions the parameter space to deploy specialised local experts.
Files
File under embargo until 16-08-2026