Differentially Private Significance Tests for Regression Coefficients

被引:24
作者
Barrientos, Andres F. [1 ]
Reiter, Jerome P. [1 ]
Machanavajjhala, Ashwin [2 ]
Chen, Yan [2 ]
机构
[1] Duke Univ, Dept Stat Sci, Durham, NC 27708 USA
[2] Duke Univ, Dept Comp Sci, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
Confidentiality; Disclosure; Laplace; Query; Synthetic; Verification; MICRODATA; DIAGNOSTICS; FRAMEWORK;
D O I
10.1080/10618600.2018.1538881
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many data producers seek to provide users access to confidential data without unduly compromising data subjects' privacy and confidentiality. One general strategy is to require users to do analyses without seeing the confidential data; for example, analysts only get access to synthetic data or query systems that provide disclosure-protected outputs of statistical models. With synthetic data or redacted outputs, the analyst never really knows how much to trust the resulting findings. In particular, if the user did the same analysis on the confidential data, would regression coefficients of interest be statistically significant or not? We present algorithms for assessing this question that satisfy differential privacy. We describe conditions under which the algorithms should give accurate answers about statistical significance. We illustrate the properties of the proposed methods using artificial and genuine data. for this article are available online.
引用
收藏
页码:440 / 453
页数:14
相关论文
共 47 条
[1]  
Abowd J., 2006, TECH REP
[2]  
[Anonymous], ARXIV160602109
[3]  
[Anonymous], ARXIV160203090
[4]  
[Anonymous], ARXIV151206388
[5]  
[Anonymous], 2000, CAMBRIDGE SERIES STA
[6]  
[Anonymous], 2018, 9 INNOVATIONS THEORE, DOI [DOI 10.4230/LIPICS.ITCS.2018.44, 10.4230/LIPICS.ITCS.2018.44]
[7]   Private Empirical Risk Minimization: Efficient Algorithms and Tight Error Bounds [J].
Bassily, Raef ;
Smith, Adam ;
Thakurta, Abhradeep .
2014 55TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2014), 2014, :464-473
[8]  
Bhattacharya R, 2016, A course in mathematical statistics and large sample theory
[9]   Differentially Private ANOVA Testing [J].
Campbell, Zachary ;
Bray, Andrew ;
Ritz, Anna ;
Groce, Adam .
2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018), 2018, :281-285
[10]  
Chaudhuri A, 2009, PRODUCT RESEARCH: THE ART AND SCIENCE BEHIND SUCCESSFUL PRODUCT LAUNCHES, P289, DOI 10.1007/978-90-481-2860-0_16