Optimal data-independent noise for differential privacy

被引:59
作者
Soria-Comas, Jordi [1 ]
Domingo-Ferrer, Josep [1 ]
机构
[1] Univ Rovira & Virgili, Dept Comp Engn & Math, UNESCO Chair Data Privacy, E-43007 Tarragona, Catalonia, Spain
关键词
Data privacy; Differential privacy; Noise addition; Privacy-preserving data mining; Statistical disclosure control;
D O I
10.1016/j.ins.2013.07.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
epsilon-Differential privacy is a property that seeks to characterize privacy in data sets. It is formulated as a query-response method, and computationally achieved by output perturbation. Several noise-addition methods to implement such output perturbation have been proposed in the literature. We focus on data-independent noise, that is, noise whose distribution is constant across data sets. Our goal is to find the optimal data-independent noise distribution to achieve epsilon-differential privacy. We propose a general optimality criterion based on the concentration of the probability miss of the noise distribution around zero, and we show that any noise optimal under this criterion must be optimal under any other sensible criterion. We also show that the Laplace distribution, commonly used for noise in epsilon-differential privacy, is not optimal, and we build the optimal data-independent noise distribution. We compare the Laplace and the optimal data-independent noise distributions. For univariate query functions, both introduce a similar level of distortion; for multivariate query functions, optimal data-independent noise offers responses with substantially better data quality. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:200 / 214
页数:15
相关论文
共 17 条
[1]  
[Anonymous], 2003, P 22 ACM SIGMOD SIGA
[2]  
Blum A, 2008, ACM S THEORY COMPUT, P609
[3]  
Blum Avrim, 2005, P 24 ACM SIGMOD SIGA, P128, DOI [DOI 10.1145/1065167.1065184, 10.1145/1065167.1065184]
[4]  
Dwork C, 2004, LECT NOTES COMPUT SC, V3152, P528
[5]  
Dwork C., 2009, Journal of Privacy and Confidentiality, V1, P135
[6]   Differential privacy: A survey of results [J].
Dwork, Cynthia .
THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2008, 4978 :1-19
[7]  
Dwork C, 2006, LECT NOTES COMPUT SC, V4052, P1
[8]   Calibrating noise to sensitivity in private data analysis [J].
Dwork, Cynthia ;
McSherry, Frank ;
Nissim, Kobbi ;
Smith, Adam .
THEORY OF CRYPTOGRAPHY, PROCEEDINGS, 2006, 3876 :265-284
[9]   A Firm Foundation for Private Data Analysis [J].
Dwork, Cynthia .
COMMUNICATIONS OF THE ACM, 2011, 54 (01) :86-95
[10]  
Dwork C, 2009, ACM S THEORY COMPUT, P381