Machine Learning with Membership Privacy using Adversarial Regularization

被引:226
作者
Nasr, Milad [1 ]
Shokri, Reza [2 ]
Houmansadr, Amir [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
[2] Natl Univ Singapore, Singapore, Singapore
来源
PROCEEDINGS OF THE 2018 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (CCS'18) | 2018年
关键词
Data privacy; Machine learning; Inference attacks; Membership privacy; Indistinguishability; Min-max game; Adversarial process;
D O I
10.1145/3243734.3243855
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Machine learning models leak significant amount of information about their training sets, through their predictions. This is a serious privacy concern for the users of machine learning as a service. To address this concern, in this paper, we focus on mitigating the risks of black-box inference attacks against machine learning models. We introduce a mechanism to train models with membership privacy, which ensures indistinguishability between the predictions of a model on its training data and other data points (from the same distribution). This requires minimizing the accuracy of the best black-box membership inference attack against the model. We formalize this as a min-max game, and design an adversarial training algorithm that minimizes the prediction loss of the model as well as the maximum gain of the inference attacks. This strategy, which can guarantee membership privacy (as prediction indistinguishability), acts also as a strong regularizer and helps generalizing the model. We evaluate the practical feasibility of our privacy mechanism on training deep neural networks using benchmark datasets. We show that the min-max strategy can mitigate the risks of membership inference attacks (near random guess), and can achieve this with a negligible drop in the model's prediction accuracy (less than 4%).
引用
收藏
页码:634 / 646
页数:13
相关论文
共 52 条
  • [1] [Anonymous], 2012, P 2012 ACM C COMP CO
  • [2] [Anonymous], INT C MACH LEARN
  • [3] [Anonymous], 2003, NONLINEAR PROGRAMMIN
  • [4] [Anonymous], ARXIV170202382
  • [5] [Anonymous], 2011, J MACHINE LEARNING R
  • [6] [Anonymous], P 21 INT C MACH LEAR
  • [7] [Anonymous], ARXIV180305847
  • [8] [Anonymous], SEC PRIV SP 2017 IEE
  • [9] [Anonymous], 2017, ARXIV170107875
  • [10] [Anonymous], 2016, USENIX SEC S