A Convergent Differentially Private k-Means Clustering Algorithm

被引:13
作者
Lu, Zhigang [1 ]
Shen, Hong [1 ,2 ]
机构
[1] Univ Adelaide, Sch Comp Sci, Adelaide, SA, Australia
[2] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I | 2019年 / 11439卷
基金
澳大利亚研究理事会; 国家重点研发计划;
关键词
Differential privacy; Adversarial machine learning; k-means clustering;
D O I
10.1007/978-3-030-16148-4_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Preserving differential privacy (DP) for the iterative clustering algorithms has been extensively studied in the interactive and the non-interactive settings. However, existing interactive differentially private clustering algorithms suffer from a non-convergence problem, i.e., these algorithms may not terminate without a predefined number of iterations. This problem severely impacts the clustering quality and the efficiency of the algorithm. To resolve this problem, we propose a novel iterative approach in the interactive settings which controls the orientation of the centroids movement over the iterations to ensure the convergence by injecting DP noise in a selected area. We prove that, in the expected case, our approach converges to the same centroids as Lloyd's algorithm in at most twice the iterations of Lloyd's algorithm. We perform experimental evaluations on real-world datasets to show that our algorithm outperforms the state-of-the-art of the interactive differentially private clustering algorithms with a guaranteed convergence and better clustering quality to meet the same DP requirement.
引用
收藏
页码:612 / 624
页数:13
相关论文
共 50 条
  • [21] Weighted k-Means Algorithm Based Text Clustering
    Chen, Xiuguo
    Yin, Wensheng
    Tu, Pinghui
    Zhang, Hengxi
    IEEC 2009: FIRST INTERNATIONAL SYMPOSIUM ON INFORMATION ENGINEERING AND ELECTRONIC COMMERCE, PROCEEDINGS, 2009, : 51 - +
  • [22] Hierarchical hesitant fuzzy K-means clustering algorithm
    CHEN Na
    XU Ze-shui
    XIA Mei-mei
    Applied Mathematics:A Journal of Chinese Universities, 2014, (01) : 1 - 17
  • [23] Hierarchical hesitant fuzzy K-means clustering algorithm
    Na Chen
    Ze-shui Xu
    Mei-mei Xia
    Applied Mathematics-A Journal of Chinese Universities, 2014, 29 : 1 - 17
  • [24] Clustering With Constraints: Feasibility Issues and the k-Means Algorithm
    Davidson, Ian
    Ravi, S. S.
    PROCEEDINGS OF THE FIFTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 138 - 149
  • [25] Enhanced Parallel Implementation of the K-Means Clustering Algorithm
    Baydoun, Mohammed
    Dawi, Mohammad
    Ghaziri, Hassan
    2016 3RD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS (ACTEA), 2016, : 7 - 11
  • [26] Colour Constancy using K-means Clustering Algorithm
    Hussain, Md. Akmol
    Akbari, Akbar Sheikh
    Ghaffari, Ahmad
    2016 9TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2016), 2016, : 283 - 288
  • [27] K-means clustering algorithm in kernel function space
    Liang, JZ
    Wang, JY
    Xu, XB
    PROGRESS IN INTELLIGENCE COMPUTATION & APPLICATIONS, 2005, : 642 - 646
  • [28] Modified K-means Algorithm for Big Data Clustering
    Sengupta, Debapriya
    Roy, Sayantan Singha
    Ghosh, Sarbani
    Dasgupta, Ranjan
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 1443 - 1448
  • [29] Cluster center initialization algorithm for K-means clustering
    Khan, SS
    Ahmad, A
    PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302
  • [30] On the Optimality of k-means Clustering
    Dalton, Lori A.
    2013 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2013), 2013, : 70 - 71