Research on K-Means Clustering Algorithm Over Encrypted Data

被引:1
|
作者
Wang, Chen [1 ]
Wang, Andi [1 ]
Liu, Xinyu [1 ]
Xu, Jian [1 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang 110169, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
K-means algorithm; Privacy-preserving clustering; Homomorphic encryption; Security protocol;
D O I
10.1007/978-3-030-37352-8_16
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at the privacy-preserving problem in data mining process, this paper proposes an improved K-Means algorithm over encrypted data, called HK-means++ that uses the idea of homomorphic encryption to solve the encrypted data multiplication problems, distance calculation problems and the comparison problems. Then apply these security protocols to the improved clustering algorithm framework. To prevent the leakage of privacy while calculating the distance between the sample points and the center points, it prevents the attacker from inferring the cluster grouping of the user by hiding the cluster center. To some extent, it would reduce the risk of leakage of private data in the cluster mining process. It is well known that the traditional K-Means algorithm is too dependent on the initial value. In this paper, we focus on solving the problem to reduce the number of iterations, and improve the clustering efficiency. The experimental results demonstrate that our proposed, HK-Means algorithm has good clustering performance and the running time is also reduced.
引用
收藏
页码:182 / 191
页数:10
相关论文
共 50 条
  • [41] The global k-means clustering algorithm
    Likas, A
    Vlassis, N
    Verbeek, JJ
    PATTERN RECOGNITION, 2003, 36 (02) : 451 - 461
  • [42] Improved K-means clustering algorithm
    Zhang, Zhe
    Zhang, Junxi
    Xue, Huifeng
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
  • [43] A k-means based clustering algorithm
    Bloisi, Domenico Daniele
    Locchi, Luca
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 109 - 118
  • [44] A Novel K-Means based Clustering Algorithm for Big Data
    Sinha, Ankita
    Jana, Prasanta K.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1875 - 1879
  • [45] Improvement of K-Means Algorithm for Accelerated Big Data Clustering
    Wu, Chunqiong
    Yan, Bingwen
    Yu, Rongrui
    Huang, Zhangshu
    Yu, Baoqin
    Yu, Yanliang
    Chen, Na
    Zhou, Xiukao
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2021, 14 (02) : 99 - 119
  • [46] AN INITIALIZATION METHOD OF K-MEANS CLUSTERING ALGORITHM FOR MIXED DATA
    Li, Taoying
    Jin, Zhihong
    Chen, Yan
    Ebonzo, Angelo Dan Menga
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (05): : 1873 - 1883
  • [47] An initialization method of K-means clustering algorithm for mixed data
    Li, Taoying, 1873, ICIC International (10):
  • [48] An improved K-means clustering algorithm
    Huang, Xiuchang
    Su, Wei
    Journal of Networks, 2014, 9 (01) : 161 - 167
  • [49] An Enhancement of K-means Clustering Algorithm
    Gu, Jirong
    Zhou, Jieming
    Chen, Xianwei
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 237 - 240
  • [50] Adaptive K-Means clustering algorithm
    Chen, Hailin
    Wu, Xiuqing
    Hu, Junhua
    MIPPR 2007: PATTERN RECOGNITION AND COMPUTER VISION, 2007, 6788