Predictive Fault Localization in Mobile Networks using Multi-Domain KPIs

None, None

Predictive Fault Localization in Mobile Networks using Multi-Domain KPIs

Master Thesis (2026)

Author(s)

J.Y.R. Cui (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R.E. Kooij – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

M. Kloen – Mentor (Koninklijke KPN)

M. Ouwens – Mentor (Koninklijke KPN)

H. Wang – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty

Electrical Engineering, Mathematics and Computer Science

Machine learning Deep learning Autoencoder LSTM Anomaly detection CQL

To reference this document use

https://resolver.tudelft.nl/uuid:9fcef3e1-db60-41d2-9d7a-33fcb9c7264d

More Info

expand_more

Publication Year

2026

Language

English

Graduation Date

26-02-2026

Awarding Institution

Delft University of Technology

Programme

Electrical Engineering, Embedded Systems

Faculty

Electrical Engineering, Mathematics and Computer Science

Downloads counter

151

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The rapid growth of mobile data traffic and the evolution towards 5G-Advanced and 6G networks have significantly increased the operational complexity of mobile networks, making fault localization a critical challenge for mobile network operators. Traditional fault management approaches rely on reactive, threshold-based alarms operating within individual network domains. This is increasingly ineffective for detecting and localizing faults in complex, multi-domain environments.

This thesis proposes a cross-domain fault localization framework that combines unsupervised anomaly detection with offline reinforcement learning, where the RL agent learns a policy for identifying the most likely fault origin based on observed anomaly patterns across domains. The proposed framework analyzes time-series Key Performance Indicators (KPIs) collected from the Radio Access Network (RAN), Core Network and end-to-end (E2E) domains to detect anomalous behaviour without requiring labeled data. Subsequently, the detected anomalies and cross-domain results are used by an offline reinforcement learning agent to track the most probable origin of faults across network domains.

The framework is evaluated using real-world KPI data collected over a 1 month period, consisting of 13 KPIs across the 3 domains. Unsupervised anomaly detection is applied to identify deviations from normal network behavior, while fault localization is performed using reinforcement learning based on the observed anomaly patterns. The results demonstrate that the proposed approach can identify anomalies and provide fault localization despite the lack of explicit fault labels.

This thesis highlights key challenges such as traffic dependent KPI behaviour, noise during low traffic periods and threshold selection. While the results indicate that combining unsupervised anomaly detection with reinforcement learning is a promising direction for predictive fault localization, further refinement is required to improve robustness, precision and operational consistency. Future work should focus on online deployment, calibration and improved learning strategies to support deployment in real world mobile network environments.

Files

Master_Thesis.pdf

(pdf | 10.1 Mb)

License info not available