Analysis of Mixed Concept Drift Detectors in Deployed Machine Learning Models

Bachelor Thesis (2023)
Author(s)

T. Zamfirescu (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

L. Poenaru-Olaru – Mentor (TU Delft - Software Engineering)

Jan S. Rellermeyer – Graduation committee member (TU Delft - Data-Intensive Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2023 Toma Zamfirescu
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Toma Zamfirescu
Graduation Date
03-02-2023
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Label-independent concept drift detectors represent an emerging topic in machine learning research, especially in models deployed in a production environment where obtaining labels can become increasingly difficult and costly. Concept drift refers to unforeseeable changes in the distribution of data streams, which directly impact the performance of a model trained on historical data. This paper initially focuses on two mixed label-independent drift detectors, SQSI and UDetect, which are implemented and evaluated on a specific setup using synthetic and real-world data sets. Next, multiple label-dependent drift detectors are evaluated on real-world data sets, and the results are compared to those of the label-independent detectors. This paper presents a framework for comparing multiple concept drift detectors on different data sets and configurations, checking whether they can be reliably used in a production environment.

Files

Toma_Zamfirescu_RP_2023.pdf
(pdf | 0.995 Mb)
License info not available