Partition Selection Approach for Hierarchical Clustering Based on Clustering Ensemble

被引:0
作者
Vega-Pons, Sandro [1 ]
Ruiz-Shulcloper, Jose [1 ]
机构
[1] Adv Technol Applicat Ctr CENATAV, Havana, Cuba
来源
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS | 2010年 / 6419卷
关键词
Hierarchical clustering; partition selection; clustering ensemble; cluster validity index; FIND;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical clustering algorithms are widely used in many fields of investigation. They provide a hierarchy of partitions of the same dataset. However, in many practical problems, the selection of a representative level (partition) in the hierarchy is needed. The classical approach to do so is by using a cluster validity index to select the best partition according to the criterion imposed by this index. In this paper, we present a new approach based on the clustering ensemble philosophy. The representative level is defined here as the consensus partition in the hierarchy. In the consensus computation process, we take into account the similarity between partitions and information from the evaluation of partitions with different cluster validity indexes. An experimental comparison on several datasets shows the superiority of the proposed approach with respect to the classical approach.
引用
收藏
页码:525 / 532
页数:8
相关论文
共 10 条
  • [1] Bakir GH, 2004, ADV NEUR IN, V16, P449
  • [2] Everitt B. S., 2001, CLUSTER ANAL
  • [3] Frank A., 2010, UCI machine learning repository, V213
  • [4] Combining multiple clusterings using evidence accumulation
    Fred, ALN
    Jain, AK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) : 835 - 850
  • [5] SEP/COP: An efficient method to find the best partition in hierarchical clustering based on a new cluster validity index
    Gurrutxaga, Ibai
    Albisua, Inaki
    Arbelaitz, Olatz
    Martin, Jose I.
    Muguerza, Javier
    Perez, Jesus M.
    Perona, Inigo
    [J]. PATTERN RECOGNITION, 2010, 43 (10) : 3364 - 3373
  • [6] Data clustering: A review
    Jain, AK
    Murty, MN
    Flynn, PJ
    [J]. ACM COMPUTING SURVEYS, 1999, 31 (03) : 264 - 323
  • [7] AN EXAMINATION OF PROCEDURES FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET
    MILLIGAN, GW
    COOPER, MC
    [J]. PSYCHOMETRIKA, 1985, 50 (02) : 159 - 179
  • [8] STREHL A, 2002, J MACH LEARN RES, V3, P583, DOI DOI 10.1162/153244303321897735
  • [9] Weighted partition consensus via kernels
    Vega-Pons, Sandro
    Correa-Morris, Jyrko
    Ruiz-Shulcloper, Jose
    [J]. PATTERN RECOGNITION, 2010, 43 (08) : 2712 - 2724
  • [10] Xu R., 2009, IEEE Press Series on Computational Intelligence