Improved research to k-means initial cluster centers

被引:15
作者
Zhang Min [1 ]
Duan Kai-fei [1 ]
机构
[1] Dalian Univ, Coll Informat & Engn, Dalian, Peoples R China
来源
2015 NINTH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY FCST 2015 | 2015年
关键词
clustering; K-means plus plus algorithm; initial cluster centers; variance;
D O I
10.1109/FCST.2015.61
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
K-means in the field of clustering analysis algorithms is a kind of more traditional algorithm. It exists many shortcomings. For example, K value is easily affected by manmade subjective factors, and the algorithm is easy to fall into a local optimal solution, and the clustering result is not stable, etc; And K-means++ algorithm as the classic improved algorithm of K-means algorithm, but there is still a phenomenon of unstable cluster center. This paper is a kind of improvement aimed at the shortcoming of K-means++ algorithm, which introduces the concept of the variance in probability and mathematical statistics. Variance reflects the degree of density between samples and other samples. In the K-means++ algorithm when you select the first initial clustering center, you need to select minimum variance of sample points, which is in the position of the largest sample density, then you select the next cluster centers based on the weight method of D2 which is described in the K-means++ algorithm. Experimental results show the accuracy is higher and stability is better.
引用
收藏
页码:348 / 352
页数:5
相关论文
共 11 条
[1]  
[Anonymous], 2007, 18 ANN ACM SIAM S DI
[2]  
Chen Xingshu, 2015, J SICHUAN U ENG SCI, P13
[3]  
Han L.-b., 2010, COMPUTER ENG APPL, V46
[4]  
Han Lingbo, 2010, COMPUTER ENG APPL, P150
[5]   Initialization for K-means clustering using Voronoi diagram [J].
Reddy, Damodar ;
Jana, Prasanta K. .
2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 :395-400
[6]   Clustering algorithms research [J].
Sun, Ji-Gui ;
Liu, Jie ;
Zhao, Lian-Yu .
Ruan Jian Xue Bao/Journal of Software, 2008, 19 (01) :48-61
[7]  
Tong Xuejiao, 2011, COMPUTER ENG DESIGN, P2788
[8]  
Tong Xuejiao, 2011, COMPUTER ENG DESIGN, P2721
[9]  
Xie Juanying, 2012, J NW U NATURAL SCI E, P570
[10]  
Zhai Donghai, 2014, APPL RES COMPUTERS, P713