Non-Monotonicity in Empirical Learning Curves

Identifying non-monotonicity through slope approximations on discrete points

Bachelor thesis (2023)

Authors

C. Socol Electrical Engineering, Mathematics and Computer Science

Contributors

T.J. Viering Pattern Recognition and Bioinformatics - (supervisor 1)

J.H. Krijthe Pattern Recognition and Bioinformatics - (supervisor 1)

Z. Yue (supervisor 2)

Faculty

Electrical Engineering, Mathematics and Computer Science

Meta-learning Machine Learning LCDB Learning curve Non-monotonicity

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:3b7f24c8-08a9-4641-be82-38b880ac6898

Published Date

28-06-2023

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Learning curves are used to shape the performance of a Machine Learning (ML) model with respect to the size of the set used for training it. It was commonly thought that adding more training samples would increase the model's accuracy (i.e., they are monotone), but recent works show that may not always be the case. In other words, some learners on some problems show non-monotonic behaviour. To this extent, we introduce a new method to identify non-monotonicity in empirical learning curves by approximating the curve's slope through regression around the discrete points it is defined on.This paper formalises this metric and then evaluates its accuracy through different experiments. Finally, we run the proposed metric on a subset of the extensive Learning Curve Database (LCDB) by Mohr et al. to gain better insights into the problem of non-monotonicity of learning. We found that the metric can identify non-monotonicity in learning curves well (98% experimental accuracy) and does not consider small increases due to measurement error as non-monotonicity in the curve. Finally, we have identified that non-monotonicity may be a property of some classifiers, such as Linear Discriminant Analysis. Moreover, we identified that non-monotonicity is frequently observed in datasets with faster training times.

Files

Codrin_Socol_Final_Bachelor_Th... (.pdf)

(.pdf | 0.277 Mb)