Predicting Multiple Sclerosis from Mobile Phone Typing Data through Representation Learning
D.R.A. Heijbroek (TU Delft - Electrical Engineering, Mathematics and Computer Science)
Marcel J.T. Reinders – Mentor (TU Delft - Pattern Recognition and Bioinformatics)
R. Ghorbani – Mentor (TU Delft - Pattern Recognition and Bioinformatics)
David M.J. Tax – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)
Richard Hendriks – Graduation committee member (TU Delft - Signal Processing Systems)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
A deeper understanding of Multiple Sclerosis (MS) symptom progression is required for diagnostic accuracy and patient care. Remote monitoring through smartphones can provide continuous insights in the well-being of MS patients. This research aims to explore differences between MS and healthy typing behavior through collected typing behavior data (keystroke dynamics) on smartphones. The main contribution of this work was finding patterns within typing data that distinguish healthy typing behavior from MS with unlabeled time series data, except the diagnosis. A dataset of three combined studies was provided, which included 182 patients suffering from MS and 335 healthy subjects. An autoencoder was fitted on healthy samples to learn the temporal patterns in healthy typing behavior. Then, an anomaly detection method leveraged both the reconstruction loss and embeddings of these samples to determine whether MS samples deviated from healthy behavior. MS subjects were evaluated against healthy subjects through the ROC-AUC on both sample level and subject level. An initial hold out dataset of 20\% of the subjects was left out for validation. The overall performance was poor (AUC 0.6-0.8), because we aimed to find one general method for all MS subjects. Key limitations were the large variance in healthy typing behavior, relatively short window lengths, and difficulty in quantification of the training step of the autoencoder. Despite the difficulty of evaluating the performance of our suggested method, the described methods provide interesting monitoring possibilities for neurologists to enhance MS diagnosis. These monitoring possibilities could extend towards monitoring the effect of treatment, which is the first step towards personalised healthcare in MS.
Files
File under embargo until 01-01-2027