Data-Centric Green AI An Exploratory Empirical Study

Conference Paper (2022)
Author(s)

Roberto Verdecchia (Vrije Universiteit Amsterdam)

Luis Cruz (TU Delft - Software Engineering)

June Sallou (University of Rennes)

Michelle Lin (McGill University)

James Wickenden (University of Bristol)

Estelle Hotellier (Inria)

Research Group
Software Engineering
Copyright
© 2022 Roberto Verdecchia, Luis Cruz, June Sallou, Michelle Lin, James Wickenden, Estelle Hotellier
DOI related publication
https://doi.org/10.1109/ICT4S55073.2022.00015
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Roberto Verdecchia, Luis Cruz, June Sallou, Michelle Lin, James Wickenden, Estelle Hotellier
Research Group
Software Engineering
Pages (from-to)
35-45
ISBN (print)
978-1-6654-8287-5
ISBN (electronic)
978-1-6654-8286-8
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

With the growing availability of large-scale datasets, and the popularization of affordable storage and computational capabilities, the energy consumed by AI is becoming a growing concern. To address this issue, in recent years, studies have focused on demonstrating how AI energy efficiency can be improved by tuning the model training strategy. Nevertheless, how modifications applied to datasets can impact the energy consumption of AI is still an open question.To fill this gap, in this exploratory study, we evaluate if data-centric approaches can be utilized to improve AI energy efficiency. To achieve our goal, we conduct an empirical experiment, executed by considering 6 different AI algorithms, a dataset comprising 5,574 data points, and two dataset modifications (number of data points and number of features).Our results show evidence that, by exclusively conducting modifications on datasets, energy consumption can be drastically reduced (up to 92.16%), often at the cost of a negligible or even absent accuracy decline. As additional introductory results, we demonstrate how, by exclusively changing the algorithm used, energy savings up to two orders of magnitude can be achieved.In conclusion, this exploratory investigation empirically demonstrates the importance of applying data-centric techniques to improve AI energy efficiency. Our results call for a research agenda that focuses on data-centric techniques, to further enable and democratize Green AI.

Files

Data_Centric_Green_AI_An_Explo... (pdf)
(pdf | 0.988 Mb)
- Embargo expired in 19-01-2023
License info not available