Prediction-based Anomaly Detection in Multivariate Time-Series Data
Improving Wahoo Fitness Cycling Data Quality by Addressing Sensor Errors
M. Segers (TU Delft - Electrical Engineering, Mathematics and Computer Science)
H.S. Hung – Mentor (TU Delft - Pattern Recognition and Bioinformatics)
C.A. Raman – Mentor (TU Delft - Pattern Recognition and Bioinformatics)
David M.J. Tax – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)
S. Tan – Graduation committee member (TU Delft - Interactive Intelligence)
Kilian Hendrickx – Mentor
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Consumer-grade fitness trackers can produce unreliable physiological data due to sensor errors. The same holds for cycling data from Wahoo Fitness, where heart rate (HR) and power readings are essential for training and performance analysis. This thesis presents a prediction-based anomaly detection framework tailored to multivariate time-series cycling data. The approach reframes anomaly detection as a personalized physiological HR prediction problem. We define anomalies as deviations between measured sensor values and their predicted values, based on contextual activity metrics (e.g., power, cadence, speed, altitude, and gradient) and user-specific embeddings. The system combines ordinary differential equations (ODEs) modeling heart rate dynamics with machine learning techniques to capture non-linear, non-stationary, and individualized relationships. The model not only detects implausible values but reconstructs them with physiologically consistent alternatives. Compared to reconstruction-based methods, which are mostly used for anomaly detection in time series data, this physiologically grounded approach better differentiates between normal variation and true anomalies. Experimental results demonstrate effective identification and correction of HR and power anomalies, contributing to improved data quality and reliability in wearable fitness applications.