Differentially Private Topological Data Analysis

被引:0
作者
Kang, Taegyu [1 ]
Kim, Sehwan [2 ]
Sohn, Jinwon [1 ]
Awan, Jordan [1 ]
机构
[1] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
[2] Harvard Med Sch, Harvard Pilgrim Hlth Care Inst, Dept Populat Med, Boston, MA 02215 USA
关键词
& Ccaron; ech complex; Distance to a measure; Exponential mechanism; Persistence diagram; Persistent homology; PERSISTENT HOMOLOGY; CONVERGENCE; DISTANCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper is the first to attempt differentially private (DP) topological data analysis (TDA), producing near-optimal private persistence diagrams. We analyze the sensitivity of persistence diagrams in terms of the bottleneck distance, and we show that the commonly used & Ccaron;ech complex has sensitivity that does not decrease as the sample size n increases. This makes it challenging for the persistence diagrams of & Ccaron;ech complexes to be privatized. As an alternative, we show that the persistence diagram obtained by the L 1 -distance to measure (DTM) has sensitivity O (1 /n ) . Based on the sensitivity analysis, we propose using the exponential mechanism whose utility function is defined in terms of the bottleneck distance of the L 1 -DTM persistence diagrams. We also derive upper and lower bounds of the accuracy of our privacy mechanism; the obtained bounds indicate that the privacy error of our mechanism is near-optimal. We demonstrate the performance of our privatized persistence diagrams through simulations as well as on a real data set tracking human movement.
引用
收藏
页数:42
相关论文
共 54 条
[1]   Deep Learning with Differential Privacy [J].
Abadi, Martin ;
Chu, Andy ;
Goodfellow, Ian ;
McMahan, H. Brendan ;
Mironov, Ilya ;
Talwar, Kunal ;
Zhang, Li .
CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, :308-318
[2]   DTM-Based Filtrations [J].
Anai, Hirokazu ;
Chazal, Frederic ;
Glisse, Marc ;
Ike, Yuichi ;
Inakoshi, Hiroya ;
Tinarrage, Raphael ;
Umeda, Yuhei .
TOPOLOGICAL DATA ANALYSIS, ABEL SYMPOSIUM 2018, 2020, 15 :33-66
[3]   Privacy-Preserving Parametric Inference: A Case for Robust Statistics [J].
Avella-Medina, Marco .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (534) :969-983
[4]  
Awan J, 2023, J MACH LEARN RES, V24, P1
[5]  
Awan J, 2023, Arxiv, DOI arXiv:2208.06236
[6]  
Awan Jordan, 2019, PR MACH LEARN RES, V97
[7]  
Balle B, 2022, J ROY STAT SOC B, V84, P37, DOI 10.1111/rssb.12455
[8]  
Betthauser Leo, 2022, arXiv
[9]   DISTANCE FUNCTIONS, CRITICAL POINTS, AND THE TOPOLOGY OF RANDOM CECH COMPLEXES [J].
Bobrowski, Omer ;
Adler, Robert J. .
HOMOLOGY HOMOTOPY AND APPLICATIONS, 2014, 16 (02) :311-344
[10]  
Bredon GlenE., 1997, Topology and Geometry