Federated Learning With Heterogeneity-Aware Probabilistic Synchronous Parallel on Edge

None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None; None, None

Federated Learning With Heterogeneity-Aware Probabilistic Synchronous Parallel on Edge

Journal Article (2022)

Author(s)

Jianxin Zhao (Beijing Institute of Technology)

Rui Han (Beijing Institute of Technology)

Yongkai Yang (Beijing Engineering Research Center of Civil Aviation Big Data, Key Laboratory of Intelligent Passenger Service of Civil Aviation-CAAC)

Benjamin Catterall (University of Cambridge)

Chi Harold Liu (Beijing Institute of Technology)

Lydia Y. Chen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Richard Mortier (University of Cambridge)

Jon Crowcroft (University of Cambridge)

Liang Wang (University of Cambridge)

Research Group

Data-Intensive Systems

Federated learning Distributed computing Edge computing Barrier control

DOI related publication

https://doi.org/10.1109/TSC.2021.3109910 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:828dd13b-2d57-4e96-b770-41b4b5c715ea

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Data-Intensive Systems

Issue number

2

Volume number

15

Article number

9529051

Pages (from-to)

614-626

Downloads counter

296

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

With the massive amount of data generated from mobile devices and the increase of computing power of edge devices, the paradigm of Federated Learning has attracted great momentum. In federated learning, distributed and heterogeneous nodes collaborate to learn model parameters. However, while providing benefits such as privacy by design and reduced latency, the heterogeneous network present challenges to the synchronisation methods, or barrier control methods, used in training, regarding system progress and model convergence etc. The design of these barrier mechanisms is critical for the performance and scalability of federated learning systems. We propose a new barrier control technique called Probabilistic Synchronous Parallel (PSP). In contrast to existing mechanisms, it introduces a sampling primitive that composes with existing barrier control mechanisms to produce a family of mechanisms with improved convergence speed and scalability. Our proposal is supported with a convergence analysis of PSP-based SGD algorithm. In practice, we also propose heuristic techniques that further improve the efficiency of PSP. We evaluate the performance of proposed methods using the federated learning specific FEMNSIT dataset. The evaluation results show that PSP can effectively achieve good balance between system efficiency and model accuracy, mitigating the challenge of heterogeneity in federated learning.

Files

Federated_Learning_With_Hetero... (pdf)

(pdf | 2.1 Mb)

- Embargo expired in 01-07-2023

License info not available