On voting-based consensus of cluster ensembles

被引:101
作者
Ayad, Hanan G. [1 ]
Kamel, Mohamed S. [1 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Clustering; Cluster ensembles; Voting-based consensus; MODELS;
D O I
10.1016/j.patcog.2009.11.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Voting-based consensus clustering refers to a distinct class of consensus methods in which the cluster label mismatch problem is explicitly addressed The voting problem is defined as the problem of finding the optimal relabeling of a given partition with respect to a reference partition It is commonly formulated as a weighted bipartite matching problem. In this paper, we present a more general formulation of the voting problem as a regression problem with multiple-response and multiple-input variables We show that a recently introduced cumulative voting scheme is a special case corresponding to a linear regression method We use a randomized ensemble generation technique, where an overproduced number of clusters is randomly selected for each ensemble partition. We apply an information theoretic algorithm for extracting the consensus clustering from the aggregated ensemble representation and for estimating the number of clusters We apply it in conjunction with bipartite matching and cumulative voting. We present empirical evidence showing substantial improvements in clustering accuracy, stability, and estimation of the true number of clusters based on cumulative voting The improvements are achieved in comparison to consensus algorithms based on bipartite matching, which perform very poorly with the chosen ensemble generation technique, and also to other recent consensus algorithms (C) 2009 Elsevier Ltd All rights reserved
引用
收藏
页码:1943 / 1953
页数:11
相关论文
共 33 条
[1]  
[Anonymous], NIPS
[2]  
[Anonymous], P 4 IEEE INT C DAT M
[3]  
[Anonymous], 1985, J CLASSIF
[4]  
[Anonymous], CLUE CLUSTER ENSEMBL
[5]  
[Anonymous], THESIS U MASSACHUSET
[6]  
[Anonymous], P INT C SYST MAN CYB
[7]  
[Anonymous], 329 U WASH DEP STAT
[8]  
[Anonymous], ADV NEURAL INFORM PR
[9]  
[Anonymous], P 4 INT C INT SYST M
[10]  
[Anonymous], Pattern Recognition with Fuzzy Objective Function Algorithms