Sustainable Machine Learning Retraining

Optimizing Energy Efficiency Without Compromising Accuracy

Conference Paper (2025)
Author(s)

Lorena Poenaru-Olaru

June Sallou (TU Delft - Software Engineering, Wageningen University & Research)

Luis Cruz (TU Delft - Software Engineering)

Jan S. Rellermeyer (TU Delft - Data-Intensive Systems, Leibniz Universität)

Arie Van Deursen (TU Delft - Software Engineering)

Research Group
Software Engineering
DOI related publication
https://doi.org/10.1109/ICT4S68164.2025.00019
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Software Engineering
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository as part of the Taverne amendment. More information about this copyright law amendment can be found at https://www.openaccess.nl. Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.
Pages (from-to)
100-111
Publisher
IEEE
ISBN (electronic)
9798331587178
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The reliability of machine learning (ML) software systems is heavily influenced by changes in data over time. For that reason, ML systems require regular maintenance, typically based on model retraining. However, retraining requires significant computational demand, which makes it energy-intensive and raises concerns about its environmental impact. To understand which retraining techniques should be considered when designing sustainable ML applications, in this work, we study the energy consumption of common retraining techniques. Since the accuracy of ML systems is also essential, we compare retraining techniques in terms of both energy efficiency and accuracy. We showcase that retraining with only the most recent data compared to all available data reduces energy consumption by up to 25%, being a sustainable alternative to the status quo. Furthermore, our findings show that retraining a model only when there is evidence that updates are necessary, rather than on a fixed schedule, can reduce energy consumption by up to 40%, provided a reliable data change detector is in place. Our findings pave the way for better recommendations for ML practitioners, guiding them toward more energy-efficient retraining techniques when designing sustainable ML software systems.

Files

Sustainable_Machine_Learning_R... (pdf)
(pdf | 0.551 Mb)
- Embargo expired in 31-03-2025
License info not available