INFERENCE USING NOISY DEGREES: DIFFERENTIALLY PRIVATE β-MODEL AND SYNTHETIC GRAPHS

被引:69
作者
Karwa, Vishesh [1 ]
Slavkovic, Aleksandra [2 ]
机构
[1] Carnegie Mellon Univ, Dept Stat, Pittsburgh, PA 15213 USA
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
Degree sequence; differential privacy; beta-model; existence of MLE; measurement error; NUMBER;
D O I
10.1214/15-AOS1358
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The beta-model of random graphs is an exponential family model with the degree sequence as a sufficient statistic. In this paper, we contribute three key results. First, we characterize conditions that lead to a quadratic time algorithm to check for the existence of MLE of the beta-model, and show that the MLE never exists for the degree partition beta-model. Second, motivated by privacy problems with network data, we derive a differentially private estimator of the parameters of beta-model, and show it is consistent and asymptotically normally distributed-it achieves the same rate of convergence as the non-private estimator. We present an efficient algorithm for the private estimator that can be used to release synthetic graphs. Our techniques can also be used to release degree distributions and degree partitions accurately and privately, and to perform inference from noisy degrees arising from contexts other than privacy. We evaluate the proposed estimator on real graphs and compare it with a current algorithm for releasing degree distributions and find that it does significantly better. Finally, our paper addresses shortcomings of current approaches to a fundamental problem of how to perform valid statistical inference from data released by privacy mechanisms, and lays a foundational groundwork on how to achieve optimal and private statistical inference in a principled manner by modeling the privacy mechanism; these principles should be applicable to a class of models beyond the beta-model.
引用
收藏
页码:87 / 112
页数:26
相关论文
共 49 条
  • [1] [Anonymous], PREPRINT
  • [2] How likely is an i.i.d. degree sequence to be graphical?
    Arratia, R
    Liggett, TM
    [J]. ANNALS OF APPLIED PROBABILITY, 2005, 15 (1B) : 652 - 670
  • [3] Barndorff-Nielsen O., 1978, Information and Exponential Families in Statistical Theory
  • [4] Bhattacharya A, 2006, ELECTRON J COMB, V13
  • [5] A Sequential Importance Sampling Algorithm for Generating Random Graphs with Prescribed Degrees
    Blitzstein, Joseph
    Diaconis, Persi
    [J]. INTERNET MATHEMATICS, 2011, 6 (04) : 489 - 522
  • [6] Carroll R.J., 2006, Monographs on Statistics and Applied Probability, V105, P455, DOI [10.1201/9781420010138, DOI 10.1201/9781420010138]
  • [7] RANDOM GRAPHS WITH A GIVEN DEGREE SEQUENCE
    Chatterjee, Sourav
    Diaconis, Persi
    Sly, Allan
    [J]. ANNALS OF APPLIED PROBABILITY, 2011, 21 (04) : 1400 - 1435
  • [8] Duchi JC., 2013, PREPRINT
  • [9] Dwork C, 2006, LECT NOTES COMPUT SC, V4004, P486
  • [10] Calibrating noise to sensitivity in private data analysis
    Dwork, Cynthia
    McSherry, Frank
    Nissim, Kobbi
    Smith, Adam
    [J]. THEORY OF CRYPTOGRAPHY, PROCEEDINGS, 2006, 3876 : 265 - 284