Initializing K-means Clustering Using Affinity Propagation

被引:23
作者
Zhu, Yan [1 ]
Yu, Jian [1 ]
Jia, Caiyan [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Comp Sci, Beijing 100044, Peoples R China
来源
HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 1, PROCEEDINGS | 2009年
关键词
k-means; k-centers; affinity propagation; convergence;
D O I
10.1109/HIS.2009.73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-means clustering is widely used due to its fast convergence, but it is sensitive to the initial condition. Therefore, many methods of initializing K-means clustering have been proposed in the literatures. Compared with K-means clustering, a novel clustering algorithm called affinity propagation (AP clustering) has been developed by Frey and Dueck, which can produce a good set of cluster exemplars with fast speed. Taking the convergence property of K-means and the good performance of affinity propagation, we presented a new clustering strategy which can produce much lower squared error than AP and standard K-means: initializing K-means clustering using cluster exemplars produced by AP. Numerical experiments indicated that such combined method outperforms not only AP and original K-means clustering, but also K-means clustering with sophisticated initial conditions designed by various methods.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 12 条
[1]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[2]  
BRADLEY PS, 1998, P 15 INT C MACH LEAR, P91
[3]   Iterative optimization and simplification of hierarchical clusterings [J].
Fisher, D .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :147-179
[4]  
Frey B., 2007, MULTI DATABASE RETRI, Vvol. 315, ppp, DOI [DOI 10.1126/SCIENCE.1136800, 10.1126/science.1136800]
[5]  
JIANG W, 2007, AFFINITY PROPAGATION
[6]  
Kaufman L., 1990, Finding Groups in Data, V447, P467, DOI DOI 10.1002/9780470316801
[7]  
LIKAS A, 2002, PATTERN RECOGN, V2, P451
[8]  
MacQueen J., 1967, 5 BERK S MATH STAT P, P281, DOI DOI 10.1007/S11665-016-2173-6
[9]   An empirical comparison of four initialization methods for the K-Means algorithm [J].
Peña, JM ;
Lozano, JA ;
Larrañaga, P .
PATTERN RECOGNITION LETTERS, 1999, 20 (10) :1027-1040
[10]   K-MEANS-TYPE ALGORITHMS - A GENERALIZED CONVERGENCE THEOREM AND CHARACTERIZATION OF LOCAL OPTIMALITY [J].
SELIM, SZ ;
ISMAIL, MA .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (01) :81-87