Evaluating Catastrophic Forgetting in Neural Networks Trained with Continual Backpropagation

None, None

Evaluating Catastrophic Forgetting in Neural Networks Trained with Continual Backpropagation

Bachelor Thesis (2025)

Author(s)

J. Jučas (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

L.R. Engwegen – Mentor (TU Delft - Sequential Decision Making)

J.W. Böhmer – Mentor (TU Delft - Sequential Decision Making)

Faculty

Electrical Engineering, Mathematics and Computer Science

Neural network Continual Learning Continual Backpropagation

To reference this document use:

https://resolver.tudelft.nl/uuid:f164c62d-ef06-4630-a7dc-7b8495c2cb25

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

27-06-2025

Awarding Institution

Delft University of Technology

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Continual Backpropagation (CBP) has recently been proposed as an effective method for mitigating loss of plasticity in neural networks trained in continual learning (CL) settings. While extensive experiments have been conducted to demonstrate the algorithm's ability to mitigate loss of plasticity, its susceptibility to catastrophic forgetting remains unexamined. This work addresses this gap by systematically evaluating the magnitude of catastrophic forgetting in models trained with CBP and comparing it to four baseline algorithms. We demonstrate that CBP suffers from significantly higher forgetting compared to all tested baselines, particularly in long-term and periodically revisited task scenarios. Moreover, we find that specific hyperparameters of the algorithm have significant influence on the stability-plasticity trade-off. We further analyze the internal dynamics of CBP, identifying strong correlations between forgetting and metrics such as activation drift. Finally, we evaluate three modifications to CBP: noise injection, layer-specific replacement, and partial neuron replacement, and show that the modifications reduce forgetting while maintaining high plasticity.

Files

Research_Paper_Justinas_Ju_as.... (pdf)

(pdf | 14 Mb)

License info not available