Adaptive ensemble optimization for memory-related hyperparameters in retraining DNN at edge

None, None; None, None; None, None; None, None; None, None; None, None

Adaptive ensemble optimization for memory-related hyperparameters in retraining DNN at edge

Journal Article (2025)

Author(s)

Yidong Xu (Beijing Institute of Technology)

Rui Han (Beijing Institute of Technology)

Xiaojiang Zuo (Beijing Institute of Technology)

Junyan Ouyang (Beijing Institute of Technology)

Chi Harold Liu (Beijing Institute of Technology)

Y. Chen (TU Delft - Data-Intensive Systems)

Research Group

Data-Intensive Systems

DOI related publication

https://doi.org/10.1016/j.future.2024.107600

Edge computing Deep neural networks (DNN) Memory-related hyperparameters Model retraining

To reference this document use:

https://resolver.tudelft.nl/uuid:c917df5d-81e7-4138-b635-df2a777d5fe1

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Data-Intensive Systems

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en

Volume number

164

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Edge applications are increasingly empowered by deep neural networks (DNN) and face the challenges of adapting or retraining models for the changes in input data domains and learning tasks. The existing techniques to enable DNN retraining on edge devices are to configure the memory-related hyperparameters, termed m-hyperparameters, via batch size reduction, parameter freezing, and gradient checkpoint. While those methods show promising results for static DNNs, little is known about how to online and opportunistically optimize all their m-hyperparameters, especially for retraining tasks of edge applications. In this paper, we propose, MPOptimizer, which jointly optimizes an ensemble of m-hyperparameters according to the input distribution and available edge resources at runtime. The key feature of MPOptimizer is to easily emulate the execution of retraining tasks under different m-hyperparameters and thus effectively estimate their influence on task performance. We implement MPOptimizer on prevalent DNNs and demonstrate its effectiveness against state-of-the-art techniques, i.e. successfully find the best configuration that improves model accuracy by an average of 13% (up to 25.3%) while reducing memory and training time by 4.1x and 5.3x under the same model accuracies.

Files

1-s2.0-S0167739X24005648-main.... (pdf)

(pdf | 3.64 Mb)

- Embargo expired in 10-05-2025

License info not available