Mind the Gap: Layerwise Proximal Replay for Stable Continual Learning

None, None

Mind the Gap: Layerwise Proximal Replay for Stable Continual Learning

Bachelor Thesis (2025)

Author(s)

O.S.E. Hage (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Tom Julian Viering – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

G.M. van de Ven – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

A Hanjalic – Graduation committee member (TU Delft - Intelligent Systems)

Faculty

Electrical Engineering, Mathematics and Computer Science

Catastrophic Forgetting Continual Learning Stability Gap Layerwise Proximal Replay

To reference this document use:

https://resolver.tudelft.nl/uuid:fee76fbd-f4e1-4727-bd10-132f560b8dc9

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

25-06-2025

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Continual learning aims to train models that can incrementally acquire new knowledge over a sequence of tasks while retaining previously learned information, even in the absence of access to past data. A key challenge in this setting is maintaining stability at task transitions, where even methods like experience replay can suffer from temporary performance degradation known as the stability gap. In this work, we evaluate Layerwise Proximal Replay (LPR), a recently proposed optimisation strategy that constrains updates at the layer level to preserve internal representations of past data. We implement LPR on a simple multi-layer perceptron and benchmark it against an incremental joint training baseline on a domain-incremental variant of Rotated MNIST. To quantify the stability gap, we track accuracy drops immediately following task switches and compute local minima after transitions. Our results show that LPR consistently reduces the stability gap across a range of learning rates, with statistically significant improvements at higher values. However, this improvement comes at the cost of reduced performance on later tasks. These findings demonstrate that LPR significantly mitigates short-term performance degradation at task boundaries while maintaining high learning rates, offering a practical solution for increased stability in continual learning.

Files

Oskar_Hage_Research_Project_Pa... (pdf)

(pdf | 5.47 Mb)

License info not available