NOT ALL NOISE IS ACCOUNTED EQUALLY: HOW DIFFERENTIALLY PRIVATE LEARNING BENEFITS FROM LARGE SAMPLING RATES

被引：3

作者：

Dorman, Friedrich ^{[1
]}

Frisk, Osvald ^{[1
]}

Andersen, Lars Norvang ^{[2
]}

Pedersen, Christian Fischer ^{[1
]}

机构：

[1] Aarhus Univ, Dept Elect & Comp Engn, Aarhus, Denmark

[2] Aarhus Univ, Dept Math, Aarhus, Denmark

来源：

2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | 2021年

关键词：

Deep Learning; Privacy; Differential Privacy; Stochastic Gradient Descent; Gradient Noise;

D O I：

10.1109/MLSP52302.2021.9596307

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning often involves sensitive data and as such, privacy preserving extensions to Stochastic Gradient Descent (SGD) and other machine learning algorithms have been developed using the definitions of Differential Privacy (DP). In differentially private SGD, the gradients computed at each training iteration are subject to two different types of noise. Firstly, inherent sampling noise arising from the use of minibatches. Secondly, additive Gaussian noise from the underlying mechanisms that introduce privacy. In this study, we show that these two types of noise are equivalent in their effect on the utility of private neural networks, however they are not accounted for equally in the privacy budget. Given this observation, we propose a training paradigm that shifts the proportions of noise towards less inherent and more additive noise, such that more of the overall noise can be accounted for in the privacy budget. With this paradigm, we are able to improve on the state-of-the-art in the privacy/utility tradeoff of private end-to-end CNNs.

引用

页数：6