Addressing Statistical Heterogeneity through Generative Similarity-Based Comparison in Federated Learning

None, None

Addressing Statistical Heterogeneity through Generative Similarity-Based Comparison in Federated Learning

Aggregation Weight Modifications Using Latent Space Insights

Bachelor Thesis (2024)

Author(s)

H. Page (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Swier Garst – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

David M. J. Tax – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

A. Voulimeneas – Graduation committee member (TU Delft - Cyber Security)

Faculty

Electrical Engineering, Mathematics and Computer Science

Federated Learning Variational Autoencoder (VAE) MNIST Wasserstein distance Privacy-preserving collaboration Non-IID data distributions

To reference this document use:

https://resolver.tudelft.nl/uuid:7da21d5d-15cb-4806-b675-04ccfc394487

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

26-06-2024

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Federated Learning (FL), is a distributed learning approach where multiple clients collaboratively train a model whilst maintaining data security and privacy. One significant challenge in FL that must be addressed is statistical heterogeneity within the data. This occurs because data across different clients may not come from the same distribution, potentially leading to sub-optimal performance. To address this, we examine how insights gained from a generative model’s latent space can mitigate these problems by adjusting the aggregation weight (influence) assigned to each client during the training process. We leverage information derived from a Variational Autoencoder (VAE) trained in a federated manner and propose a method to modify the aggregation weight of each client in FL. This method considers local discrepancies, resulting from differences between the local latent space distributions and global latent space distributions, together with the dataset sizes of each client. Experiments were conducted on the MNIST and Fashion-MNIST datasets. Our results indicate that our method enhance the model’s performance by up to 6.76% in the best case, in terms of reducing the average test VAE loss and accelerating the convergence of the β-VAE in scenarios characterised by severe data imbalances among clients. It worsens performance when all clients have an equal level of imbalance. The source code for our research is available at https://github.com/FederatedRP2024Delft/
Federated-Learning-PyTorch-Weight-Modification

Files

Page_Henry_Thesis_Federated_Le... (pdf)

(pdf | 0.698 Mb)

License info not available