Embedded Continual Learning for High-Energy Physics

Journal Article (2024)
Author(s)

Marco Barbone (Imperial College London)

Christopher Brown (Imperial College London)

Georgi Gaydadjiev (TU Delft - Computer Engineering, University Medical Center Groningen)

Thomas Maguire (University Medical Center Groningen)

Mikael Mieskolainen (Imperial College London)

Benjamin Radburn-Smith (Imperial College London)

Wayne Luk (Imperial College London)

Alexander Tapper (Imperial College London)

Affiliation
External organisation
DOI related publication
https://doi.org/10.1051/epjconf/202429509014 Final published version
More Info
expand_more
Publication Year
2024
Language
English
Affiliation
External organisation
Volume number
295
Article number
09014
Event
26th International Conference on Computing in High Energy and Nuclear Physics, CHEP 2023 (2023-05-08 - 2023-05-12), Norfolk, United States
Downloads counter
206

Abstract

Neural Networks (NN) are often trained offline on large datasets and deployed on specialised hardware for inference, with a strict separation between training and inference. However, in many realistic applications the training environment differs from the real world, or data arrives in a streaming fashion and is continuously changing. In these scenarios, the ability to continuously train and update NN models is desirable. Continual learning (CL) algorithms allow training of models on a stream of data. CL algorithms are often designed to work in constrained settings, such as limited memory and computational power, or limitations on the ability to store past data (e.g, due to privacy concerns or memory requirements). High-energy physics experiments are developing intelligent detectors, with algorithms running on computer systems located close to the detector to meet the challenges of increased data rates and occupancies. The use of NN algorithms in this context is limited by changing detector conditions, such as degradation over time or failure of an input signal which might cause the NNs to lose accuracy leading, in the worst case to the loss of interesting events. CL has the potential to solve this issue, using large amounts of continuously streaming data to allow the network to recognise changes, and to learn and adapt to detector conditions. It has the potential to outperform traditional NN training techniques as not all possible scenarios can be predicted and modelled in static training data samples. However, NN training is computationally expensive and when combined with the strict timing requirements of embedded processors deployed close to the detector, current state-of-the-art offline approaches cannot be directly applied to the real-time systems. Alternatives to typical backpropagation-based training that can be deployed on FPGAs for real-time data processing are presented, and their computational and accuracy characteristics are discussed in the context of High-Luminosity LHC.