THE COST OF PRIVACY: OPTIMAL RATES OF CONVERGENCE FOR PARAMETER ESTIMATION WITH DIFFERENTIAL PRIVACY

被引:55
|
作者
Cai, T. Tony [1 ]
Wang, Yichen [1 ]
Zhang, Linjun [2 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat, Philadelphia, PA 19104 USA
[2] Rutgers State Univ, Dept Stat, New Brunswick, NJ USA
来源
ANNALS OF STATISTICS | 2021年 / 49卷 / 05期
关键词
High-dimensional data; differential privacy; mean estimation; linear regression; mini- max optimality; STATISTICAL ESTIMATION;
D O I
10.1214/21-AOS2058
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Privacy-preserving data analysis is a rising challenge in contemporary statistics, as the privacy guarantees of statistical methods are often achieved at the expense of accuracy. In this paper, we investigate the tradeoff between statistical accuracy and privacy in mean estimation and linear regression, under both the classical low-dimensional and modern high-dimensional settings. A primary focus is to establish minimax optimality for statistical estimation with the (s, 8)-differential privacy constraint. By refining the "tracing adversary" technique for lower bounds in the theoretical computer science literature, we improve existing minimax lower bound for low-dimensional mean estimation and establish new lower bounds for high-dimensional mean estimation and linear regression problems. We also design differentially private algorithms that attain the minimax lower bounds up to logarithmic factors. In particular, for high-dimensional linear regression, a novel private iterative hard thresholding algorithm is proposed. The numerical performance of differentially private algorithms is demonstrated by simulation studies and applications to real data sets.
引用
收藏
页码:2825 / 2850
页数:26
相关论文
共 50 条
  • [1] Privacy-preserving Statistical Estimation with Optimal Convergence Rates [Extended Abstract]
    Smith, Adam
    STOC 11: PROCEEDINGS OF THE 43RD ACM SYMPOSIUM ON THEORY OF COMPUTING, 2011, : 813 - 821
  • [2] GEOMETRIZING RATES OF CONVERGENCE UNDER LOCAL DIFFERENTIAL PRIVACY CONSTRAINTS
    Rohde, Angelika
    Steinberger, Lukas
    ANNALS OF STATISTICS, 2020, 48 (05): : 2646 - 2670
  • [3] Minimax rates of convergence for sliced inverse regression with differential privacy
    Zhao, Wenbiao
    Zhu, Xuehu
    Zhu, Lixing
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 201
  • [4] Optimal Distribution of Privacy Budget in Differential Privacy
    Bkakria, Anis
    Tasidou, Aimilia
    Cuppens-Boulahia, Nora
    Cuppens, Frederic
    Bouattour, Fatma
    Ben Fredj, Feten
    RISKS AND SECURITY OF INTERNET AND SYSTEMS, 2019, 11391 : 222 - 236
  • [5] Optimal Algorithms for Mean Estimation under Local Differential Privacy
    Asi, Hilal
    Feldman, Vitaly
    Talwar, Kunal
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [6] Minimax Optimal Estimation of Approximate Differential Privacy on Neighboring Databases
    Liu, Xiyang
    Oh, Sewoong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] Instance-optimal Mean Estimation Under Differential Privacy
    Huang, Ziyue
    Liang, Yuting
    Yi, Ke
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] The Optimal Mechanism in Differential Privacy
    Geng, Quan
    Viswanath, Pramod
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2014, : 2371 - 2375
  • [9] Optimal Schemes for Discrete Distribution Estimation under Local Differential Privacy
    Ye, Min
    Barg, Alexander
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 759 - 763
  • [10] Optimal Schemes for Discrete Distribution Estimation Under Locally Differential Privacy
    Ye, Min
    Barg, Alexander
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (08) : 5662 - 5676