Research on K-Means Clustering Algorithm Over Encrypted Data

被引:1
|
作者
Wang, Chen [1 ]
Wang, Andi [1 ]
Liu, Xinyu [1 ]
Xu, Jian [1 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang 110169, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
K-means algorithm; Privacy-preserving clustering; Homomorphic encryption; Security protocol;
D O I
10.1007/978-3-030-37352-8_16
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at the privacy-preserving problem in data mining process, this paper proposes an improved K-Means algorithm over encrypted data, called HK-means++ that uses the idea of homomorphic encryption to solve the encrypted data multiplication problems, distance calculation problems and the comparison problems. Then apply these security protocols to the improved clustering algorithm framework. To prevent the leakage of privacy while calculating the distance between the sample points and the center points, it prevents the attacker from inferring the cluster grouping of the user by hiding the cluster center. To some extent, it would reduce the risk of leakage of private data in the cluster mining process. It is well known that the traditional K-Means algorithm is too dependent on the initial value. In this paper, we focus on solving the problem to reduce the number of iterations, and improve the clustering efficiency. The experimental results demonstrate that our proposed, HK-Means algorithm has good clustering performance and the running time is also reduced.
引用
收藏
页码:182 / 191
页数:10
相关论文
共 50 条
  • [1] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [2] Review on the Research of K-means Clustering Algorithm in Big Data
    Chen Jie
    Zhang Jiyue
    Wu Junhui
    Wu Yusheng
    Si Huiping
    Lin Kaiyan
    2020 IEEE THE 3RD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING (ICECE), 2020, : 107 - 111
  • [3] IMPROVEMENT IN K-MEANS CLUSTERING ALGORITHM FOR DATA CLUSTERING
    Rajeswari, K.
    Acharya, Omkar
    Sharma, Mayur
    Kopnar, Mahesh
    Karandikar, Kiran
    1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 367 - 369
  • [4] Research and Improvement on K-Means Clustering Algorithm
    Wang, Xue-mei
    Wang, Jin-bo
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 1138 - 1141
  • [5] Research on Improved K-means Clustering Algorithm
    Zhang, Yinsheng
    Shan, Huilin
    Li, Jiaqiang
    Zhou, Jie
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1977 - 1980
  • [6] Research on improved K-means clustering algorithm
    Zhang, Yinsheng
    Shan, Huilin
    Li, Jiaqiang
    Zhou, Jie
    Advanced Materials Research, 2012, 403-408 : 1977 - 1980
  • [7] On K-means Data Clustering Algorithm with Genetic Algorithm
    Kapil, Shruti
    Chawla, Meenu
    Ansari, Mohd Dilshad
    2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
  • [8] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [9] Outsourced k-Means Clustering over Encrypted Data Under Multiple Keys in Spark Framework
    Rong, Hong
    Wang, Huimei
    Liu, Jian
    Hao, Jialu
    Xian, Ming
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2017, 2018, 238 : 67 - 87
  • [10] Hybridization of Chaos and Flower Pollination Algorithm over K-Means for data clustering
    Kaur, Arvinder
    Pal, Saibal Kumar
    Singh, Amrit Pal
    APPLIED SOFT COMPUTING, 2020, 97 (97)