A Survey on Distributed Machine Learning

Review (2020)
Author(s)

Joost Verbraeken (Student TU Delft)

Matthijs Wolting (Student TU Delft)

Jonathan Katzy (Student TU Delft)

Jeroen Kloppenburg (Student TU Delft)

Tim Verbelen (Universiteit Gent)

Jan S. Rellermeyer (TU Delft - Data-Intensive Systems)

Research Group
Data-Intensive Systems
Copyright
© 2020 Joost Verbraeken, Matthijs Wolting, Jonathan Katzy, Jeroen Kloppenburg, Tim Verbelen, Jan S. Rellermeyer
DOI related publication
https://doi.org/10.1145/3377454
More Info
expand_more
Publication Year
2020
Language
English
Copyright
© 2020 Joost Verbraeken, Matthijs Wolting, Jonathan Katzy, Jeroen Kloppenburg, Tim Verbelen, Jan S. Rellermeyer
Research Group
Data-Intensive Systems
Issue number
2
Volume number
53
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The demand for artificial intelligence has grown significantly over the past decade, and this growth has been fueled by advances in machine learning techniques and the ability to leverage hardware acceleration. However, to increase the quality of predictions and render machine learning solutions feasible for more complex applications, a substantial amount of training data is required. Although small machine learning models can be trained with modest amounts of data, the input for training larger models such as neural networks grows exponentially with the number of parameters. Since the demand for processing training data has outpaced the increase in computation power of computing machinery, there is a need for distributing the machine learning workload across multiple machines, and turning the centralized into a distributed system. These distributed systems present new challenges: first and foremost, the efficient parallelization of the training process and the creation of a coherent model. This article provides an extensive overview of the current state-of-the-art in the field by outlining the challenges and opportunities of distributed machine learning over conventional (centralized) machine learning, discussing the techniques used for distributed machine learning, and providing an overview of the systems that are available.

Files

3377454.pdf
(pdf | 0.979 Mb)
- Embargo expired in 01-10-2020
License info not available