Private measures, random walks, and synthetic data

被引:1
|
作者
Boedihardjo, March [1 ]
Strohmer, Thomas [2 ]
Vershynin, Roman [3 ]
机构
[1] Michigan State Univ, Dept Math, E Lansing, MI 48864 USA
[2] Univ Calif Davis, Dept Math, Davis, CA USA
[3] Univ Calif Irvine, Dept Math, Irvine, CA 92697 USA
关键词
Random walks; Differential privacy; Synthetic data;
D O I
10.1007/s00440-024-01279-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Differential privacy is a mathematical concept that provides an information-theoretic security guarantee. While differential privacy has emerged as a de facto standard for guaranteeing privacy in data sharing, the known mechanisms to achieve it come with some serious limitations. Utility guarantees are usually provided only for a fixed, a priori specified set of queries. Moreover, there are no utility guarantees for more complex-but very common-machine learning tasks such as clustering or classification. In this paper we overcome some of these limitations. Working with metric privacy, a powerful generalization of differential privacy, we develop a polynomial-time algorithm that creates a private measure from a data set. This private measure allows us to efficiently construct private synthetic data that are accurate for a wide range of statistical analysis tools. Moreover, we prove an asymptotically sharp min-max result for private measures and synthetic data in general compact metric spaces, for any fixed privacy budget epsilon \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon $$\end{document} bounded away from zero. A key ingredient in our construction is a new superregular random walk, whose joint distribution of steps is as regular as that of independent random variables, yet which deviates from the origin logarithmically slowly.
引用
收藏
页码:569 / 611
页数:43
相关论文
共 50 条
  • [31] Random random walks on the integers mod n
    Dai, JJ
    Hildebrand, MV
    STATISTICS & PROBABILITY LETTERS, 1997, 35 (04) : 371 - 379
  • [32] EINSTEIN RELATION FOR RANDOM WALKS IN RANDOM ENVIRONMENT
    Guo, Xiaoqin
    ANNALS OF PROBABILITY, 2016, 44 (01) : 324 - 359
  • [33] Random Walks and Bisections in Random Circulant Graphs
    Mans, Bernard
    Shparlinski, Igor E.
    LATIN 2012: THEORETICAL INFORMATICS, 2012, 7256 : 542 - 555
  • [34] On the speed of Random Walks among Random Conductances
    Berger, Noam
    Salvi, Michele
    ALEA-LATIN AMERICAN JOURNAL OF PROBABILITY AND MATHEMATICAL STATISTICS, 2013, 10 (02): : 1063 - 1083
  • [35] Collisions of random walks in dynamic random environments
    Halberstam, Noah
    Hutchcroft, Tom
    ELECTRONIC JOURNAL OF PROBABILITY, 2022, 27
  • [36] Shuffle Differential Private Data Aggregation for Random Population
    Wang, Shaowei
    Luo, Xuandi
    Qian, Yuqiu
    Zhu, Youwen
    Chen, Kongyang
    Chen, Qi
    Xin, Bangzhou
    Yang, Wei
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (05) : 1667 - 1681
  • [37] MULTIPLE RANDOM WALKS IN RANDOM REGULAR GRAPHS
    Cooper, Colin
    Frieze, Alan
    Radzik, Tomasz
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2009, 23 (04) : 1738 - 1761
  • [38] Random walks on a fractal solid
    Kozak, JJ
    JOURNAL OF STATISTICAL PHYSICS, 2000, 101 (1-2) : 405 - 414
  • [39] Counting trees with random walks
    Iacobelli, Giulio
    Figueiredo, Daniel R.
    Barbosa, Valmir C.
    EXPOSITIONES MATHEMATICAE, 2019, 37 (01) : 96 - 102
  • [40] Return of Fibonacci random walks
    Neunhaeuserer, Joerg
    STATISTICS & PROBABILITY LETTERS, 2017, 121 : 51 - 53