Federated Learning for Tabular Data

None, None; None, None; None, None; None, None

Federated Learning for Tabular Data

Exploring Potential Risk to Privacy

Conference Paper (2022)

Author(s)

Han Wu (Newcastle University)

Zilong Zhao (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Lydia Y. Chen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Aad van Moorsel (University of Birmingham)

Research Group

Data-Intensive Systems

Federated learning GAN Privacy Tabular Data

DOI related publication

https://doi.org/10.1109/ISSRE55969.2022.00028 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:bd5b3627-0f99-4fba-9cdb-5dc846899107

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Data-Intensive Systems

Pages (from-to)

193-204

ISBN (print)

978-1-6654-5133-8

ISBN (electronic)

978-1-6654-5132-1

Event

2022 IEEE 33rd International Symposium on Software Reliability Engineering (ISSRE) (2022-10-31 - 2022-11-03), Charlotte, United States

Downloads counter

328

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Federated Learning (FL) has emerged as a potentially powerful privacy-preserving machine learning method-ology, since it avoids exchanging data between participants, but instead exchanges model parameters. FL has traditionally been applied to image, voice and similar data, but recently it has started to draw attention from domains including financial services where the data is predominantly tabular. However, the work on tabular data has not yet considered potential attacks, in particular attacks using Generative Adversarial Networks (GANs), which have been successfully applied to FL for non-tabular data. This paper is the first to explore leakage of private data in Federated Learning systems that process tabular data. We design a Generative Adversarial Networks (GANs)-based attack model which can be deployed on a malicious client to reconstruct data and its properties from other participants. As a side-effect of considering tabular data, we are able to statistically assess the efficacy of the attack (without relying on human observation such as done for FL for images). We implement our attack model in a recently developed generic FL software framework for tabular data processing. The experimental results demonstrate the effectiveness of the proposed attack model, thus suggesting that further research is required to counter GAN-based privacy attacks.

Files

Federated_Learning_for_Tabular... (pdf)

(pdf | 1.51 Mb)

- Embargo expired in 01-07-2023

License info not available