Deciphering Learning Curve Characteristics via K-Means Clustering of Curve Model Parameters

None, None

Deciphering Learning Curve Characteristics via K-Means Clustering of Curve Model Parameters

Bachelor Thesis (2024)

Author(s)

E.A. Ozgur (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

O.T. Turan – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Tom Julian Viering – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

H.S. Hung – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Clustering Learning curve Curve model

To reference this document use:

https://resolver.tudelft.nl/uuid:26e3395c-ff44-4ccb-9bf5-c6b0529d41f7

More Info

expand_more

Publication Year

2024

Language

English

Copyright

Graduation Date

05-02-2024

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Learning curves illustrate the relationship between the performance of learning algorithms and the increasing volume of training data [1, 2, 3]. While the concept of learning curves is well-established, clustering these curves based on fitting parameters remains an underexplored area. Our study delves into this domain and leverages the Learning Curve Database (LCDB) to discover potential patterns. We investigate whether different curve models uncover distinct patterns, examine the impact of different datasets on these learners, and explore if various learners display unique characteristics and behaviors or adhere to a common pattern. Curve model analyses conclude that most of the data points are in a single cluster (dominant cluster), indicating a potential commonality. Certain learners, such as QuadraticDiscriminantAnalysis and PassiveAggressiveClassifier, exhibit unique traits and do not conform to this common pattern, regardless of dataset attributes. Moreover, while various learners demonstrate similar characteristics within a single curve model, distinct patterns emerged when comparing across different curve models, indicating internal similarity but external divergence in behavior.

Files

Enes_RP_Final_V1.pdf

(pdf | 0.642 Mb)

License info not available

Enes_RP_Final_V1_1.pdf

(pdf | 0.661 Mb)

License info not available