Pipetune

None, None; None, None; None, None; None, None; None, None; None, None

Pipetune

Pipeline parallelism of hyper and system parameters tuning for deep learning clusters

Conference Paper (2020)

Author(s)

Isabelly Rocha (University of Neuchâtel)

Nathaniel Morris (The Ohio State University)

Lydia Y. Chen (TU Delft - Data-Intensive Systems)

Pascal Felber (University of Neuchâtel)

Robert Birke (ABB (Switzerland))

Valerio Schiavoni (University of Neuchâtel)

Research Group

Data-Intensive Systems

DOI related publication

https://doi.org/10.1145/3423211.3425692

Parameter tuning Accuracy time trade-off Deep Neural Networks training

To reference this document use:

https://resolver.tudelft.nl/uuid:a7918a95-6059-4d1c-bedc-9e6b38cfa994

More Info

expand_more

Publication Year

2020

Language

English

Research Group

Data-Intensive Systems

Pages (from-to)

89-104

ISBN (print)

978-1-4503-8153-6

Abstract

DNN learning jobs are common in today's clusters due to the advances in AI driven services such as machine translation and image recognition. The most critical phase of these jobs for model performance and learning cost is the tuning of hyperparameters. Existing approaches make use of techniques such as early stopping criteria to reduce the tuning impact on learning cost. However, these strategies do not consider the impact that certain hyperparameters and systems parameters have on training time. This paper presents PIPETUNE, a framework for DNN learning jobs that addresses the trade-offs between these two types of parameters. PIPETUNE takes advantage of the high parallelism and recurring characteristics of such jobs to minimize the learning cost via a pipelined simultaneous tuning of both hyper and system parameters. Our experimental evaluation using three different types of workloads indicates that PIPETUNE achieves up to 22.6% reduction and 1.7× speed up on tuning and training time, respectively. PipeTune not only improves performance but also lowers energy consumption up to 29%.

No files available

Metadata only record. There are no files for this record.