Density K-means : A New Algorithm for Centers Initialization for K-means

被引:0
作者
Lan, Xv [1 ]
Li, Qian [2 ]
Zheng, Yi [1 ]
机构
[1] Natl Def Univ, Coll Comp, Changsha 410073, Hunan, Peoples R China
[2] Minzu Univ China, Sch Econ, Beijing 100083, Peoples R China
来源
PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE | 2015年
关键词
K-means; Initial cluster centers; Density peaks;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
K-means is one of the most significant clustering algorithms in data mining. It performs well in many cases, especially in the massive data sets. However, the result of clustering by K-means largely depends upon the initial centers, which makes K-means difficult to reach global optimum. In this paper, we developed a novel algorithm based on finding density peaks to optimize the initial centers for K-means. In the experiment, together with our algorithm, nine different clustering algorithms were extensively compared on four well-known test data sets. According to our experimental results, the performance of our algorithm is significantly better than other eight algorithms, which indicates that it is a valuable method to select initial center for K-means.
引用
收藏
页码:958 / 961
页数:4
相关论文
共 16 条
[1]  
Arai Kohei, 2007, Reports of the Faculty of Science and Engineering, Saga University, V36, P25
[2]   Effect of a cash transfer programme for schooling on prevalence of HIV and herpes simplex type 2 in Malawi: a cluster randomised trial [J].
Baird, Sarah J. ;
Garfein, Richard S. ;
McIntosh, Craig T. ;
Oezler, Berk .
LANCET, 2012, 379 (9823) :1320-1329
[3]  
Bradley P. S., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P9
[4]   k*-means:: A new generalized k-means clustering algorithm [J].
Cheung, YM .
PATTERN RECOGNITION LETTERS, 2003, 24 (15) :2883-2893
[5]  
FORGY EW, 1965, BIOMETRICS, V21, P768
[6]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[7]   Data clustering: 50 years beyond K-means [J].
Jain, Anil K. .
PATTERN RECOGNITION LETTERS, 2010, 31 (08) :651-666
[8]  
Kaufinan L., 2009, FINDING GROUPS DATA, V344
[9]   Cluster center initialization algorithm for K-means clustering [J].
Khan, SS ;
Ahmad, A .
PATTERN RECOGNITION LETTERS, 2004, 25 (11) :1293-1302
[10]  
MacQueen J., 1967, 5 BERK S MATH STAT P, V1, P281, DOI DOI 10.1007/S11665-016-2173-6