Membership Inference Attacks on Machine Learning: A Survey

被引:223
作者
Hu, Hongsheng [1 ]
Salcic, Zoran [1 ]
Sun, Lichao [2 ]
Dobbie, Gillian [1 ]
Yu, Philip S. [3 ]
Zhang, Xuyun [4 ]
机构
[1] Univ Auckland, Auckland, New Zealand
[2] Lehigh Univ, Bethlehem, PA 18015 USA
[3] Univ Illinois, Chicago, IL 60680 USA
[4] Macquarie Univ, N Ryde, NSW, Australia
关键词
Membership inference attacks; deep leaning; privacy risk; differential privacy; NEURAL-NETWORKS; PRIVACY;
D O I
10.1145/3523273
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Machine learning (ML) models have been widely applied to various applications, including image classification, text generation, audio recognition, and graph data analysis. However, recent studies have shown that ML models are vulnerable to membership inference attacks (MIAs), which aim to infer whether a data record was used to train a target model or not. MIAs on ML models can directly lead to a privacy breach. For example, via identifying the fact that a clinical record that has been used to train a model associated with a certain disease, an attacker can infer that the owner of the clinical record has the disease with a high chance. In recent years, MIAs have been shown to be effective on various ML models, e.g., classification models and generative models. Meanwhile, many defense methods have been proposed to mitigate MIAs. Although MIAs on ML models form a newly emerging and rapidly growing research area, there has been no systematic survey on this topic yet. In this article, we conduct the first comprehensive survey on membership inference attacks and defenses. We provide the taxonomies for both attacks and defenses, based on their characterizations, and discuss their pros and cons. Based on the limitations and gaps identified in this survey, we point out several promising future research directions to inspire the researchers who wish to follow this area. This survey not only serves as a reference for the research community but also provides a clear description for researchers outside this research domain. To further help the researchers, we have created an online resource repository, which we will keep updated with future relevant work. Interested readers can find the repository at https://github.com/HongshengHu/membership-inference-machine-learning-literature.
引用
收藏
页数:37
相关论文
共 246 条
[1]   Deep Learning with Differential Privacy [J].
Abadi, Martin ;
Chu, Andy ;
Goodfellow, Ian ;
McMahan, H. Brendan ;
Mironov, Ilya ;
Talwar, Kunal ;
Zhang, Li .
CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, :308-318
[2]   A Survey of Unsupervised Generative Models for Exploratory Data Analysis and Representation Learning [J].
Abukmeil, Mohanad ;
Ferrari, Stefano ;
Genovese, Angelo ;
Piuri, Vincenzo ;
Scotti, Fabio .
ACM COMPUTING SURVEYS, 2021, 54 (05)
[3]  
[Anonymous], 2013, P 7 ACM C RECOMMENDE
[4]  
[Anonymous], 2011, Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics
[5]  
[Anonymous], 2008, WORKSHOP FACES INREA
[6]  
Arjovsky M, 2020, Arxiv, DOI arXiv:1907.02893
[7]  
Ateniese Giuseppe, 2015, International Journal of Security and Networks, V10, P137
[8]  
Ba LJ, 2014, ADV NEUR IN, V27
[9]   walk2friends: Inferring Social Links from Mobility Profiles [J].
Backes, Michael ;
Humbert, Mathias ;
Pang, Jun ;
Zhang, Yang .
CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1943-1957
[10]  
Bagmar Aadesh Mahavir, 2021, ICML WORKSHOP