Assessing ML-based anomaly detection across individual telemetry modalities in Cloud-native B5G Networks
S. Kutsarov (TU Delft - Electrical Engineering, Mathematics and Computer Science)
S.M.B.S. Samarakoon Mudiyanselage – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)
Nitinder Mohan – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)
Jérémie Decouchant – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Modern cloud-native systems generate large amounts of telemetry data, including logs, metrics, and traces, which are useful for monitoring and diagnosing system behavior. However, the effectiveness of machine learning-based anomaly detection varies significantly depending on the telemetry modality and the nature of the faults.
With the ever-increasing demands of 5G applications and upcoming 6G systems, operators must ensure that their networks can rapidly respond to and mitigate faults, with detection being the first step. This project investigates the performance of machine learning models for anomaly detection when logs, metrics, and traces are analyzed independently, with the goal of understanding their relative strengths and limitations.
The study analyzes the performance of fifteen models across five different fault classes, comprising 22 faults in total. After creating an appropriate dataset, each model was evaluated using data collected from several runs. The results show that a single modality cannot detect all faults, two modalities can detect all but one fault, and all three modalities together can detect every fault.