Research on K-Means Clustering Algorithm Over Encrypted Data

被引：1

作者：

Wang, Chen ^{[1
]}

Wang, Andi ^{[1
]}

Liu, Xinyu ^{[1
]}

Xu, Jian ^{[1
]}

机构：

[1] Northeastern Univ, Software Coll, Shenyang 110169, Peoples R China

来源：

CYBERSPACE SAFETY AND SECURITY, PT II | 2019年 / 11983卷

基金：

中国国家自然科学基金;

关键词：

K-means algorithm; Privacy-preserving clustering; Homomorphic encryption; Security protocol;

D O I：

10.1007/978-3-030-37352-8_16

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Aiming at the privacy-preserving problem in data mining process, this paper proposes an improved K-Means algorithm over encrypted data, called HK-means++ that uses the idea of homomorphic encryption to solve the encrypted data multiplication problems, distance calculation problems and the comparison problems. Then apply these security protocols to the improved clustering algorithm framework. To prevent the leakage of privacy while calculating the distance between the sample points and the center points, it prevents the attacker from inferring the cluster grouping of the user by hiding the cluster center. To some extent, it would reduce the risk of leakage of private data in the cluster mining process. It is well known that the traditional K-Means algorithm is too dependent on the initial value. In this paper, we focus on solving the problem to reduce the number of iterations, and improve the clustering efficiency. The experimental results demonstrate that our proposed, HK-Means algorithm has good clustering performance and the running time is also reduced.

引用

页码：182 / 191

页数：10

共 50 条

[41] The global k-means clustering algorithm
Likas, A
Vlassis, N
Verbeek, JJ
PATTERN RECOGNITION, 2003, 36 (02) : 451 - 461
[42] Improved K-means clustering algorithm
Zhang, Zhe
Zhang, Junxi
Xue, Huifeng
CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
[43] A k-means based clustering algorithm
Bloisi, Domenico Daniele
Locchi, Luca
COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 109 - 118
[44] A Novel K-Means based Clustering Algorithm for Big Data
Sinha, Ankita
Jana, Prasanta K.
2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1875 - 1879
[45] Improvement of K-Means Algorithm for Accelerated Big Data Clustering
Wu, Chunqiong
Yan, Bingwen
Yu, Rongrui
Huang, Zhangshu
Yu, Baoqin
Yu, Yanliang
Chen, Na
Zhou, Xiukao
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2021, 14 (02) : 99 - 119
[46] AN INITIALIZATION METHOD OF K-MEANS CLUSTERING ALGORITHM FOR MIXED DATA
Li, Taoying
Jin, Zhihong
Chen, Yan
Ebonzo, Angelo Dan Menga
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (05): : 1873 - 1883
[47] An initialization method of K-means clustering algorithm for mixed data
Li, Taoying, 1873, ICIC International (10):
[48] An improved K-means clustering algorithm
Huang, Xiuchang
Su, Wei
Journal of Networks, 2014, 9 (01) : 161 - 167
[49] An Enhancement of K-means Clustering Algorithm
Gu, Jirong
Zhou, Jieming
Chen, Xianwei
2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 237 - 240
[50] Adaptive K-Means clustering algorithm
Chen, Hailin
Wu, Xiuqing
Hu, Junhua
MIPPR 2007: PATTERN RECOGNITION AND COMPUTER VISION, 2007, 6788

← 1 2 3 4 5 →