Weighted partition consensus via kernels

被引:76
作者
Vega-Pons, Sandro [1 ]
Correa-Morris, Jyrko [2 ]
Ruiz-Shulcloper, Jose [1 ]
机构
[1] CENATAV, Adv Technol Applicat Ctr, Havana, Cuba
[2] Univ Havana, Dept Appl Math, Fac Math, Havana, Cuba
关键词
Cluster ensemble; Kernel function; Similarity measure; Clustering validity index; Consensus partition; VALIDATION;
D O I
10.1016/j.patcog.2010.03.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The combination of multiple clustering results (clustering ensemble) has emerged as an important procedure to improve the quality of clustering solutions. In this paper we propose a new cluster ensemble method based on kernel functions, which introduces the Partition Relevance Analysis step. This step has the goal of analyzing the set of partition in the cluster ensemble and extract valuable information that can improve the quality of the combination process. Besides, we propose a new similarity measure between partitions proving that it is a kernel function. A new consensus function is introduced using this similarity measure and based on the idea of finding the median partition. Related to this consensus function, some theoretical results that endorse the suitability of our methods are proven. Finally, we conduct a numerical experimentation to show the behavior of our method on several databases by making a comparison with simple clustering algorithms as well as to other cluster ensemble methods. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2712 / 2724
页数:13
相关论文
共 25 条
  • [1] ALRAZGAN M, 2007, SIAM INT C DAT MIN S, P258
  • [2] Bakir GH, 2004, LECT NOTES COMPUT SC, V3175, P253
  • [3] BARTLETT B, 1995, AUST J PUBLIC HEALTH, V19, P3
  • [4] FORMIN S, 1999, ELEMENTS THEORY FUNC
  • [5] Fred A., 2001, Multiple Classifier Systems. Second International Workshop, MCS 2001. Proceedings (Lecture Notes in Computer Science Vol.2096), P309
  • [6] FRED A., 2008, SUPERVISED UNSUPERVI, P3
  • [7] Combining multiple clusterings using evidence accumulation
    Fred, ALN
    Jain, AK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) : 835 - 850
  • [8] GARTNER T, 2008, SERIES MACHINE PERCE
  • [9] On clustering validation techniques
    Halkidi, M
    Batistakis, Y
    Vazirgiannis, M
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2001, 17 (2-3) : 107 - 145
  • [10] Computational cluster validation in post-genomic data analysis
    Handl, J
    Knowles, J
    Kell, DB
    [J]. BIOINFORMATICS, 2005, 21 (15) : 3201 - 3212