Performance of outlier detection on smartwatch data in single and multiple person environments

None, None

Performance of outlier detection on smartwatch data in single and multiple person environments

An analysis of the performance of different outlier detection methods on consumer-grade wearable data in environments with single and multiple subjects

Bachelor Thesis (2023)

Author(s)

L.T. Wubben (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

David M.J. Tax – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Arman Naseri Naseri – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

R. Ghorbani – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Guohao Guohao – Graduation committee member (TU Delft - Embedded Systems)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Machine Learning Smartwatch Outlier detection Biometrics

To reference this document use:

https://resolver.tudelft.nl/uuid:aa57c561-8419-4032-b6fd-b7d462c513c4

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

30-06-2023

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Outlier detection is an essential part of modern systems. It is used to detect anomalies in behaviour or performance of systems or subjects, such as fall detection in smartwatches or voltage irregularity detection in batteries. This provides early indications of something of potential problems.

A part of outlier detection that is not often analysed is the performance of algorithms in environments with data from only one subject, versus environments with data from multiple subjects. This paper aims to answer the questions regarding the performance of Gaussian Mixture Models (GMM) and DBSCAN in these different environments. This paper focuses on time series data collected from consumer-grade wearables like smartwatches. In this paper, the outliers are defined manually, as the used data set did not contain predefined outliers. This research considers both outliers defined within the subject data, and the use of other subjects as outliers.

Results from this paper indicate that the amount of subjects in the environment is not the sole factor in the performance of these algorithms. Rather, it is a combination of the amount of subjects in the environment and the type of outlier to be detected. Results show that a GMM has difficulty distinguishing subjects that are similar when using another subject as outlier data. On average, DBSCAN outperforms a GMM in almost all cases, and DBSCAN is a lot more consistent in its performance than a GMM.

Files

Research_Project_Paper_Luuk_Wu... (pdf)

(pdf | 1.39 Mb)

License info not available