Performance comparison of different federated learning aggregation algorithms

How does the performance of different federated learning aggregation algorithms compare to each other?

Bachelor Thesis (2023)
Author(s)

R. Katz (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Marcel J T Reinders – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

S.J.F. Garst – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Lydia Y. Chen – Graduation committee member (TU Delft - Data-Intensive Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2023 Roy Katz
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Roy Katz
Graduation Date
28-06-2023
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Federated learning enables the construction of machine learning models, while adhering to privacy constraints and without sharing data between different devices. It is achieved by creating a machine learning model on each device that contains data, and then combining these models through an aggregation algorithm without sharing the data. Federated learning is currently a hot topic, and a lot of research has gone into implementing accurate aggregation algorithms. The original algorithm is FedAvg, and since then many different algorithms have been introduced. In this paper, I will compare the performance of five different aggregation algorithms: FedAvg, FedProx, FedYogi, FedMedian and q-FedAvg. The algorithms are compared on different data sets, namely MNIST and a kinase inhibition data set, as well as on different data distributions and number of clients. The experiments indicate that among these five algorithms, FedYogi achieves the best performance, both in terms of highest final accuracy as well as in terms of convergence rate.

Files

CSE3000_Paper_1_.pdf
(pdf | 0.768 Mb)
License info not available