Accurate Differentially Private Deep Learning on the Edge

Accurate Differentially Private Deep Learning on the Edge

Journal Article (2021)

Research Group

Data-Intensive Systems

DOI related publication

https://doi.org/10.1109/TPDS.2021.3064345

Federated learning Privacy Deep learning Federated learning Sensitivity Training Model accuracy Differential privacy Model accuracy Data models Servers Biological system modeling Differential privacy

To reference this document use:

https://resolver.tudelft.nl/uuid:4b2ecd2d-3bd8-46a3-a10d-c985d95802a2

More Info

expand_more

Publication Year

2021

Language

English

Research Group

Data-Intensive Systems

Issue number

9

Volume number

32

Pages (from-to)

2231-2247

Abstract

Deep learning (DL) models are increasingly built on federated edge participants holding local data. To enable insight extractions without the risk of information leakage, DL training is usually combined with differential privacy (DP). The core theme is to tradeoff learning accuracy by adding statistically calibrated noises, particularly to local gradients of edge learners, during model training. However, this privacy guarantee unfortunately degrades model accuracy due to edge learners' local noises, and the global noise aggregated at the central server. Existing DP frameworks for edge focus on local noise calibration via gradient clipping techniques, overlooking the heterogeneity and dynamic changes of local gradients, and their aggregated impact on accuracy. In this article, we present a systematical analysis that unveils the influential factors capable of mitigating local and aggregated noises, and design PrivateDL to leverage these factors in noise calibration so as to improve model accuracy while fulfilling privacy guarantee. PrivateDL features on: (i) sampling-based sensitivity estimation for local noise calibration and (ii) combining large batch sizes and critical data identification in global training. We implement PrivateDL on the popular Laplace/Gaussian DP mechanisms and demonstrate its effectiveness using Intel BigDL workloads, i.e., considerably improving model accuracy by up to 5X when comparing against existing DP frameworks.

No files available

Metadata only record. There are no files for this record.