A Convergent Differentially Private k-Means Clustering Algorithm

被引:13
作者
Lu, Zhigang [1 ]
Shen, Hong [1 ,2 ]
机构
[1] Univ Adelaide, Sch Comp Sci, Adelaide, SA, Australia
[2] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I | 2019年 / 11439卷
基金
澳大利亚研究理事会; 国家重点研发计划;
关键词
Differential privacy; Adversarial machine learning; k-means clustering;
D O I
10.1007/978-3-030-16148-4_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Preserving differential privacy (DP) for the iterative clustering algorithms has been extensively studied in the interactive and the non-interactive settings. However, existing interactive differentially private clustering algorithms suffer from a non-convergence problem, i.e., these algorithms may not terminate without a predefined number of iterations. This problem severely impacts the clustering quality and the efficiency of the algorithm. To resolve this problem, we propose a novel iterative approach in the interactive settings which controls the orientation of the centroids movement over the iterations to ensure the convergence by injecting DP noise in a selected area. We prove that, in the expected case, our approach converges to the same centroids as Lloyd's algorithm in at most twice the iterations of Lloyd's algorithm. We perform experimental evaluations on real-world datasets to show that our algorithm outperforms the state-of-the-art of the interactive differentially private clustering algorithms with a guaranteed convergence and better clustering quality to meet the same DP requirement.
引用
收藏
页码:612 / 624
页数:13
相关论文
共 50 条
  • [1] Differentially Private K-Means Clustering
    Su, Dong
    Cao, Jianneng
    Li, Ninghui
    Bertino, Elisa
    Jin, Hongxia
    CODASPY'16: PROCEEDINGS OF THE SIXTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, 2016, : 26 - 37
  • [2] Optimal Differentially Private Algorithms for k-Means Clustering
    Huang, Zhiyi
    Liu, Jinyan
    PODS'18: PROCEEDINGS OF THE 37TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2018, : 395 - 408
  • [3] Differentially Private k-Means Clustering With Convergence Guarantee
    Lu, Zhigang
    Shen, Hong
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (04) : 1541 - 1552
  • [4] Differentially Private K-Means Clustering and a Hybrid Approach to Private Optimization
    Su, Dong
    Cao, Jianneng
    Li, Ninghui
    Bertino, Elisa
    Lyu, Min
    Jin, Hongxia
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2017, 20 (04)
  • [5] DIFFERENTIALLY PRIVATE COMPRESSIVE K-MEANS
    Schellekens, V.
    Chatalic, A.
    Houssiau, F.
    de Montjoye, Y. -A.
    Jacques, L.
    Gribonval, R.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7933 - 7937
  • [6] Locally Private k-Means Clustering
    Stemmer, Uri
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [7] Differentially Private K-Means Clustering Applied to Meter Data Analysis and Synthesis
    Ravi, Nikhil
    Scaglione, Anna
    Kadam, Sachin
    Gentz, Reinhard
    Peisert, Sean
    Lunghino, Brent
    Levijarvi, Emmanuel
    Shumavon, Aram
    IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (06) : 4801 - 4814
  • [8] Coresets for Differentially Private K-Means Clustering and Applications to Privacy in Mobile Sensor Networks
    Feldman, Dan
    Xiang, Chongyuan
    Zhu, Ruihao
    Rus, Daniela
    2017 16TH ACM/IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS (IPSN), 2017, : 3 - 15
  • [9] Utility-efficient differentially private K-means clustering based on cluster merging
    Ni, Tianjiao
    Qiao, Minghao
    Chen, Zhili
    Zhang, Shun
    Zhong, Hong
    NEUROCOMPUTING, 2021, 424 : 205 - 214
  • [10] Practical multi-party private collaborative k-means clustering
    Zhang, En
    Li, Huimin
    Huang, Yuchen
    Hong, Shuangxi
    Zhao, Le
    Ji, Congmin
    NEUROCOMPUTING, 2022, 467 : 256 - 265