Private measures, random walks, and synthetic data

被引:1
|
作者
Boedihardjo, March [1 ]
Strohmer, Thomas [2 ]
Vershynin, Roman [3 ]
机构
[1] Michigan State Univ, Dept Math, E Lansing, MI 48864 USA
[2] Univ Calif Davis, Dept Math, Davis, CA USA
[3] Univ Calif Irvine, Dept Math, Irvine, CA 92697 USA
关键词
Random walks; Differential privacy; Synthetic data;
D O I
10.1007/s00440-024-01279-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Differential privacy is a mathematical concept that provides an information-theoretic security guarantee. While differential privacy has emerged as a de facto standard for guaranteeing privacy in data sharing, the known mechanisms to achieve it come with some serious limitations. Utility guarantees are usually provided only for a fixed, a priori specified set of queries. Moreover, there are no utility guarantees for more complex-but very common-machine learning tasks such as clustering or classification. In this paper we overcome some of these limitations. Working with metric privacy, a powerful generalization of differential privacy, we develop a polynomial-time algorithm that creates a private measure from a data set. This private measure allows us to efficiently construct private synthetic data that are accurate for a wide range of statistical analysis tools. Moreover, we prove an asymptotically sharp min-max result for private measures and synthetic data in general compact metric spaces, for any fixed privacy budget epsilon \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon $$\end{document} bounded away from zero. A key ingredient in our construction is a new superregular random walk, whose joint distribution of steps is as regular as that of independent random variables, yet which deviates from the origin logarithmically slowly.
引用
收藏
页码:569 / 611
页数:43
相关论文
共 50 条
  • [41] Random Walks on the Folded Hypercube
    Chen, Hong
    Li, Xiaoyan
    Lin, Cheng-Kuan
    JOURNAL OF INTERNET TECHNOLOGY, 2019, 20 (06): : 1987 - 1994
  • [42] Dynamical properties of random walks
    Messaoudi, Ali
    Valle, Glauco
    STOCHASTICS AND DYNAMICS, 2019, 19 (03)
  • [43] Deterministic walks in random environments
    Bunimovich, LA
    PHYSICA D-NONLINEAR PHENOMENA, 2004, 187 (1-4) : 20 - 29
  • [44] RANDOM WALKS AND CHEMICAL NETWORKS
    Malyshev, V. A.
    Pirogov, S. A.
    Rybko, A. N.
    MOSCOW MATHEMATICAL JOURNAL, 2004, 4 (02) : 441 - 453
  • [45] Random walks on spatial networks
    Dou Fei-Ling
    Hu Yan-Qing
    Li Yong
    Fan Ying
    Di Zeng-Ru
    ACTA PHYSICA SINICA, 2012, 61 (17)
  • [46] Lamplighter Random Walks on Fractals
    Takashi Kumagai
    Chikara Nakamura
    Journal of Theoretical Probability, 2018, 31 : 68 - 92
  • [47] Random walks for image segmentation
    Grady, Leo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (11) : 1768 - 1783
  • [48] Random walks on generalized lattices
    Salvatori, M
    MONATSHEFTE FUR MATHEMATIK, 1996, 121 (1-2): : 145 - 161
  • [49] Simplicial branching random walks
    Rosenthal R.
    Journal of Applied and Computational Topology, 2024, 8 (6) : 1751 - 1791
  • [50] Random walks through poetry
    Choi, Jeanne Devautour
    Boury, Samuel
    DIGITAL CREATIVITY, 2022, 33 (02) : 157 - 170