Byzantine-Resilient Federated Computation of Differentially Private Summary Statistics

Conference Paper (2025)
Author(s)

Giulio Segalini (University of Neuchâtel, TU Delft - Electrical Engineering, Mathematics and Computer Science)

Maria Fernandes (Universidade de Lisboa, University of Copenhagen)

Jérémie Decouchant (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Data-Intensive Systems
DOI related publication
https://doi.org/10.1145/3721462.3770766 Final published version
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Data-Intensive Systems
Pages (from-to)
72-85
Publisher
ACM
ISBN (electronic)
9798400715549
Event
26th ACM International Middleware Conference, Middleware 2025 (2025-12-15 - 2025-12-19), Nashville, United States
Downloads counter
42
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Summary statistics are essential to analyse large datasets in various fields, including financial and medical research. Federated computations enhance statistical power by combining geo-distributed datasets while ensuring compliance with data protection regulations, privacy guarantees, and resilience against intrusions. We present Tides, a federated framework leveraging Trusted Execution Environments (TEEs) to defend against adversaries controlling up to f of the N datacenters. We present an instantiation of Tides using genomic (GWAS) statistics. We address TEE-specific attack vectors, including communication blocking and side-channel attacks. Tides follows the following three key steps: (1) TEEs share statistical results through reliable broadcast and run a randomized crash-tolerant binary consensus algorithm to identify the datasets that are available; (2) TEEs enforce differential privacy with ad hoc noise; and (3) TEEs run memory-oblivious algorithms to compute the final summary statistics. We implemented Tides with Intel SGX enclaves and demonstrated its practicality with three datasets.