Embedded Continual Learning for High-Energy Physics

None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None

Embedded Continual Learning for High-Energy Physics

Journal Article (2024)

Author(s)

Marco Barbone (Imperial College London)

Christopher Brown (Imperial College London)

Georgi Gaydadjiev (TU Delft - Electrical Engineering, Mathematics and Computer Science, University Medical Center Groningen)

Thomas Maguire (University Medical Center Groningen)

Mikael Mieskolainen (Imperial College London)

Benjamin Radburn-Smith (Imperial College London)

Wayne Luk (Imperial College London)

Alexander Tapper (Imperial College London)

Affiliation

External organisation

DOI related publication

https://doi.org/10.1051/epjconf/202429509014 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:56b566cf-a1e2-4fb5-97f6-6f90e521e675

More Info

expand_more

Publication Year

2024

Language

English

Affiliation

External organisation

Volume number

295

Article number

09014

Event

26th International Conference on Computing in High Energy and Nuclear Physics, CHEP 2023 (2023-05-08 - 2023-05-12), Norfolk, United States

Downloads counter

241

Abstract

Neural Networks (NN) are often trained offline on large datasets and deployed on specialised hardware for inference, with a strict separation between training and inference. However, in many realistic applications the training environment differs from the real world, or data arrives in a streaming fashion and is continuously changing. In these scenarios, the ability to continuously train and update NN models is desirable. Continual learning (CL) algorithms allow training of models on a stream of data. CL algorithms are often designed to work in constrained settings, such as limited memory and computational power, or limitations on the ability to store past data (e.g, due to privacy concerns or memory requirements). High-energy physics experiments are developing intelligent detectors, with algorithms running on computer systems located close to the detector to meet the challenges of increased data rates and occupancies. The use of NN algorithms in this context is limited by changing detector conditions, such as degradation over time or failure of an input signal which might cause the NNs to lose accuracy leading, in the worst case to the loss of interesting events. CL has the potential to solve this issue, using large amounts of continuously streaming data to allow the network to recognise changes, and to learn and adapt to detector conditions. It has the potential to outperform traditional NN training techniques as not all possible scenarios can be predicted and modelled in static training data samples. However, NN training is computationally expensive and when combined with the strict timing requirements of embedded processors deployed close to the detector, current state-of-the-art offline approaches cannot be directly applied to the real-time systems. Alternatives to typical backpropagation-based training that can be deployed on FPGAs for real-time data processing are presented, and their computational and accuracy characteristics are discussed in the context of High-Luminosity LHC.