Exploring Alternatives to Full Neuron Reset for Maintaining Plasticity in Continual Backpropagation

None, None

Exploring Alternatives to Full Neuron Reset for Maintaining Plasticity in Continual Backpropagation

Bachelor Thesis (2025)

Author(s)

U. Urbonavičiūtė (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

L.R. Engwegen – Mentor (TU Delft - Sequential Decision Making)

J.W. Böhmer – Mentor (TU Delft - Sequential Decision Making)

M. Khosla – Graduation committee member (TU Delft - Multimedia Computing)

Faculty

Electrical Engineering, Mathematics and Computer Science

To reference this document use:

https://resolver.tudelft.nl/uuid:c44ea405-fb2e-421a-a7f4-649d7dac0cbf

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

25-06-2025

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep learning systems are typically trained in static environments and fail to adapt when faced with a continuous stream of new tasks. Continual learning addresses this by allowing neural networks to learn sequentially without forgetting prior knowledge. However, such models often suffer from a gradual decline in learning ability, a phenomenon known as loss of plasticity. Recent work introduced Continual Backpropagation (CBP), which restores plasticity by fully reinitializing low-utility neurons. While this approach is effective, it can also disrupt the learning process. This research proposes and tests three less disruptive alternatives to full reinitialization: injecting Gaussian noise into weights, reinitializing weights from the original initialization distribution, and rescaling weights to match their initial variance. We evaluate these strategies using the Permuted MNIST benchmark. The present findings show that noise injection has results similar to original CBP, reinitializing weights from the original distribution shows a better performance, while weight rescaling performs much worse than CBP. This implies that less destructive methods can maintain plasticity effectively, with some alternatives offering better stability-plasticity trade-offs than CBP.

Files

Research_Paper_Urte.pdf

(pdf | 10.7 Mb)

License info not available