Comparative Analysis of Curriculum Strategies in training Meta-Learning

None, None

Comparative Analysis of Curriculum Strategies in training Meta-Learning

Curriculum Strategies for Faster Meta-Learning

Bachelor Thesis (2024)

Author(s)

M.T. Mihai (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

J.A. de Vries – Mentor (TU Delft - Sequential Decision Making)

MTJ Spaan – Mentor (TU Delft - Sequential Decision Making)

Pradeep Murukannaiah – Graduation committee member (TU Delft - Interactive Intelligence)

Faculty

Electrical Engineering, Mathematics and Computer Science

Active Learning Curriculum Learning Meta-learning Self-Paced Learning Teacher-student networks

To reference this document use:

https://resolver.tudelft.nl/uuid:da90fcef-b242-41f4-b000-3c946ea1ba5d

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

25-06-2024

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Meta-Learning is an emerging field where the main challenge is to develop models capable of distilling previous experiences to efficiently learn new tasks. Curriculum Learning, a group of optimization strategies, structures data in a meaningful order which aids learning. However, the extent to which curriculum strategies can optimize the performance of meta-learners remains unclear. Here we study the separate and joint effects of a model-based (ScreenerNet) and a statistics-based (Active Bias) curriculum strategy on the training of a meta-learning model (Neural Processes) which solves 1-D function regression tasks. The findings show that ScreenerNet increases in-task accuracy and accelerates convergence, but decreases the generalization performance. Active Bias achieves mixed generalization results and significantly decreases training efficiency when trained on noisy data-sets. Combining them partially mitigates ScreenerNet's overfitting and stabilizes Active Bias' susceptibility to noise, but further research is necessary in order to achieve consistent improvements to the baseline.

Files

Final_Thesis.pdf

(pdf | 1.85 Mb)

License info not available