Learning Assignment Order of Instances for the Constrained K-Means Clustering Algorithm

被引:29
作者
Hong, Yi [1 ]
Kwong, Sam [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2009年 / 39卷 / 02期
关键词
Constrained K-means clustering algorithm (Cop-Kmeans); ensemble learning; instance-level constraints;
D O I
10.1109/TSMCB.2008.2006641
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The sensitivity of the constrained K-means clustering algorithm (Cop-Kmeans) to the assignment order of instances is studied, and a novel assignment order learning method for Cop-Kmeans, termed as clustering Uncertainty-based Assignment order Learning Algorithm (UALA), is proposed in this correspondence paper. The main idea of UALA is to rank all instances in the data set according to their clustering uncertainties calculated by using the ensembles of multiple clustering algorithms. Experimental results on several real data sets with artificial instance-level constraints demonstrate that UALA can identify a good assignment order of instances for Cop-Kmeans. In addition, the effects of ensemble sizes on the performance of UALA are analyzed, and the generalization property of Cop-Kmeans is also studied.
引用
收藏
页码:568 / 574
页数:7
相关论文
共 24 条
[1]  
Angelova A, 2005, PROC CVPR IEEE, P494
[2]  
[Anonymous], 2005, THESIS U TEXAS AUSTI
[3]  
[Anonymous], UCI Repository of machine learning databases
[4]  
[Anonymous], P SIAM INT C DAT MIN
[5]  
Bar-Hillel AB, 2005, J MACH LEARN RES, V6, P937
[6]  
BASU S, 2004, P SIAM INT C DAT MIN
[7]  
BASU S, 2005, P ACM SIGKDD INT C K, P59
[8]   Derived 12-lead electrocardiogram in the assessment of ST-segment deviation and cardiac rhythm [J].
Chantad, D ;
Krittayaphong, R ;
Komoltri, C .
JOURNAL OF ELECTROCARDIOLOGY, 2006, 39 (01) :7-12
[9]  
COHN D, 2003, TR20031892 CORN U
[10]  
Duda R. O., 2000, Pattern classification