Clustering aggregation by probability accumulation

被引:78
作者
Wang, Xi [1 ]
Yang, Chunyu [1 ]
Zhou, Jie [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
Clustering aggregation; Evidence accumulation; Probability accumulation; RETRIEVAL;
D O I
10.1016/j.patcog.2008.09.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since a large number of clustering algorithms exist, aggregating different clustered partitions into a single consolidated one to obtain better results has become an important problem. In Fred and Jain's evidence accumulation algorithm, they construct a co-association matrix on original partition labels, and then apply minimum spanning tree to this matrix for the combined clustering. In this paper, we will propose a novel clustering aggregation scheme, probability accumulation. In this algorithm, the construction of correlation matrices takes the cluster sizes of original clusterings into consideration. An alternate improved algorithm with additional pre- and post-processing is also proposed. Experimental results on both synthetic and real data-sets show that the proposed algorithms perform better than evidence accumulation, as well as some other methods. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:668 / 675
页数:8
相关论文
共 17 条
[1]  
[Anonymous], P SIAM INT C DAT MIN
[2]  
[Anonymous], 2004, ASS COMPUTING MACHIN
[3]   Conceptual clustering in information retrieval [J].
Bhatia, SK ;
Deogun, JS .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03) :427-436
[4]  
BIAN Zhao-qi, 2000, Pattern recognition
[5]  
Carpineto C, 1996, MACH LEARN, V24, P95
[6]  
Duda R. O., 2000, Pattern classification
[7]   Bagging to improve the accuracy of a clustering procedure [J].
Dudoit, S ;
Fridlyand, J .
BIOINFORMATICS, 2003, 19 (09) :1090-1099
[8]   Combining multiple clusterings using evidence accumulation [J].
Fred, ALN ;
Jain, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) :835-850
[9]  
Fred ALN, 2002, INT C PATT RECOG, P276, DOI 10.1109/ICPR.2002.1047450
[10]  
FRED ALN, 2002, P JOINT IAPR INT WOR, P442