Consensus clusterings

被引:159
作者
Nguyen, Nam [1 ]
Caruana, Rich [1 ]
机构
[1] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
来源
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING | 2007年
关键词
D O I
10.1109/ICDM.2007.73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we address the Problem of combining multiple clusterings without access to the underlying features of the data. This process is known in the literature as clustering ensembles, clustering aggregation, or consensus clustering. Consensus clustering yields a stable and robust final clustering that is in agreement with multiple clusterings. We find that an iterative EM-like method is remarkably effective for this problem. We present an iterative algorithm and its variations for finding clustering consensus. An extensive empirical study compares our proposed algorithms with eleven other consensus clustering methods on four data sets using three different clustering performance metrics. The experimental results show that the new ensemble clustering methods produce clusterings that are as good as, and often better than, these other methods.
引用
收藏
页码:607 / 612
页数:6
相关论文
共 19 条
  • [1] [Anonymous], P SIAM INT C DAT MIN
  • [2] [Anonymous], 2002, J. Mach. Learn. Res
  • [3] [Anonymous], 2006, P IEEE INT C DAT MIN
  • [4] *BIOG HEX, 1998, ENV DAT
  • [5] Boulis C, 2004, LECT NOTES ARTIF INT, V3202, P63
  • [6] Elkan Charles, 2003, ICML
  • [7] FERN XZ, 2003, P 20 ICML
  • [8] Fred A.L.N., 2002, P 16 INT C PATT REC
  • [9] GIONIS A, 2005, P 21 ICDM
  • [10] COMPARING PARTITIONS
    HUBERT, L
    ARABIE, P
    [J]. JOURNAL OF CLASSIFICATION, 1985, 2 (2-3) : 193 - 218