A Survey of Differentially Private Regression for Clinical and Epidemiological Research

被引:4
作者
Ficek, Joseph [1 ]
Wang, Wei [2 ]
Chen, Henian [1 ]
Dagne, Getachew [1 ]
Daley, Ellen [1 ]
机构
[1] Univ South Florida USF, Coll Publ Hlth, Tampa, FL 33612 USA
[2] Ctr Addict & Mental Hlth CAMH, Toronto, ON M6J IH4, Canada
关键词
confidence intervals; data confidentiality; data privacy; differential privacy; hypothesis testing; regression; statistical disclosure limitation; CONFIDENTIALITY; TECHNOLOGY; NOISE;
D O I
10.1111/insr.12391
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Differential privacy is a framework for data analysis that provides rigorous privacy protections for database participants. It has increasingly been accepted as the gold standard for privacy in the analytics industry, yet there are few techniques suitable for statistical inference in the health sciences. This is notably the case for regression, one of the most widely used modelling tools in clinical and epidemiological studies. This paper provides an overview of differential privacy and surveys the literature on differentially private regression, highlighting the techniques that hold the most relevance for statistical inference as practiced in clinical and epidemiological research. Research gaps and opportunities for further inquiry are identified.
引用
收藏
页码:132 / 147
页数:16
相关论文
共 98 条
[1]  
Agresti A., 2012, CATEGORICAL DATA ANA, DOI DOI 10.1002/0471249688
[2]  
[Anonymous], 2003, Final NIH Statement on Sharing Research Data
[3]  
[Anonymous], 2013, Summary of the HIPAA Privacy Rule
[4]  
[Anonymous], 2014, Differentially private hypothesis testing for normal random variables
[5]  
[Anonymous], 2017, PROC 20 INT C ARTIF
[6]  
Awan J., 2018, ARXIVPREPRINTARXIV18
[7]  
Awan J., 2019, ARXIV180109236V3
[8]  
Barak B., 2007, P 26 ACM SIGMOD SIGA, P273
[9]   Differentially Private Significance Tests for Regression Coefficients [J].
Barrientos, Andres F. ;
Reiter, Jerome P. ;
Machanavajjhala, Ashwin ;
Chen, Yan .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (02) :440-453
[10]  
Bassily R., 2014, ARXIVPREPRINTARXIV14