The Geometry of Differential Privacy: The Sparse and Approximate Cases

被引:0
|
作者
Nikolov, Aleksandar [1 ]
Talwar, Kunal [2 ]
Zhang, Li [2 ]
机构
[1] Rutgers State Univ, Piscataway, NJ 08854 USA
[2] Microsoft Res SVC, Mountain View, CA 94043 USA
来源
STOC'13: PROCEEDINGS OF THE 2013 ACM SYMPOSIUM ON THEORY OF COMPUTING | 2013年
关键词
Differential Privacy; Convex Geometry; Statistial Estimation; Combinatorial Discrepancy; DATA RELEASE; MECHANISM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study trade-offs between accuracy and privacy in the context of linear queries over histograms. This is a rich class of queries that includes contingency tables and range queries and has been the focus of a long line of work. For a given set of d linear queries over a database x epsilon R-N, we seek to find the differentially private mechanism that has the minimum mean squared error. For pure differential privacy, [5,32] give an O(log(2) d) approximation to the optimal mechanism. Our first contribution is to give an efficient O(log(2) d) approximation guarantee for the case of (epsilon, delta) differential privacy. Our mechanism adds carefully chosen correlated Gaussian noise to the answers. We prove its approximation guarantee relative to the hereditary discrepancy lower hound of [44], using tools from convex geometry. We next consider the sparse case when the number of queries exceeds the number of individuals in the database, i.e. when d > n (Delta) double under bar parallel to x parallel to(1). The lower bounds used in the previous approximation algorithm no longer apply - in fact better mechanisms are known in this setting [7, 27, 28, 31, 49]. Our second main contribution is to give an efficient (epsilon, delta)-differentially private mechanism that, for any given query set A and an upper bound n on parallel to x parallel to(1), has mean squared error within polylog (d, N) of the optimal for A and n. This approximation is achieved by coupling the Gaussian noise addition approach with linear regression over the l(1) ball. Additionally, we show a similar polylogarithmic approximation guarantee for the optimal epsilon-differentially private mechanism in this sparse setting. Our work also shows that for arbitrary counting queries, i.e. A with entries in {0, 1}, there is an epsilon-differentially private mechanism with expected error (O) over tilde(root n) per query, improving on the (O) over tilde (n(3)(2)) bound of [7] and matching the lower bound implied by [15] up to logarithmic factors. The connection between the hereditary discrepancy and the privacy mechanism enables us to derive the first polylogarithmic approximation to the hereditary discrepancy of a matrix A.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 50 条
  • [1] THE GEOMETRY OF DIFFERENTIAL PRIVACY: THE SMALL DATABASE AND APPROXIMATE CASES
    Nikolov, Aleksandar
    Talwar, Kunal
    Zhang, Li
    SIAM JOURNAL ON COMPUTING, 2016, 45 (02) : 575 - 616
  • [2] Bisimilarity Distances for Approximate Differential Privacy
    Chistikov, Dmitry
    Murawski, Andrzej S.
    Purser, David
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2018), 2018, 11138 : 194 - 210
  • [3] Differential privacy for sparse classification learning
    Wang, Puyu
    Zhang, Hai
    NEUROCOMPUTING, 2020, 375 : 91 - 101
  • [4] Comparing approximate and probabilistic differential privacy parameters
    Guingona, Vincent
    Kolesnikov, Alexei
    Nierwinski, Julianne
    Schweitzer, Avery
    INFORMATION PROCESSING LETTERS, 2023, 182
  • [5] Fingerprinting Codes and the Price of Approximate Differential Privacy
    Bun, Mark
    Ullman, Jonathan
    Vadhan, Salil
    STOC'14: PROCEEDINGS OF THE 46TH ANNUAL 2014 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2014, : 1 - 10
  • [6] FINGERPRINTING CODES AND THE PRICE OF APPROXIMATE DIFFERENTIAL PRIVACY
    Bun, Mark
    Ullman, Jonathan
    Vadhan, Salil
    SIAM JOURNAL ON COMPUTING, 2018, 47 (05) : 1888 - 1938
  • [7] Optimizing Batch Linear Queries under Exact and Approximate Differential Privacy
    Yuan, Ganzhao
    Zhang, Zhenjie
    Winslett, Marianne
    Xiao, Xiaokui
    Yang, Yin
    Hao, Zhifeng
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2015, 40 (02):
  • [8] Sparse Mobile Crowdsensing With Differential and Distortion Location Privacy
    Wang, Leye
    Zhang, Daqing
    Yang, Dingqi
    Lim, Brian Y.
    Han, Xiao
    Ma, Xiaojuan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 2735 - 2749
  • [9] On Sparse Linear Regression in the Local Differential Privacy Model
    Wang, Di
    Xu, Jinhui
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (02) : 1182 - 1200
  • [10] Differential Privacy for Free? Harnessing the Noise in Approximate Homomorphic Encryption
    Ogilvie, Tabitha
    TOPICS IN CRYPTOLOGY, CT-RSA 2024, 2024, 14643 : 292 - 315