Weighted Spectral Cluster Ensemble

被引:15
作者
Yousefnezhad, Muhammad [1 ]
Zhang, Daoqiang [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Dept Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
来源
2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2015年
关键词
cluster ensemble; spectral clustering; normalized modularity; weighted evidence accumulation clustering;
D O I
10.1109/ICDM.2015.145
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering explores meaningful patterns in the non-labeled data sets. Cluster Ensemble Selection (CES) is a new approach, which can combine individual clustering results for increasing the performance of the final results. Although CES can achieve better final results in comparison with individual clustering algorithms and cluster ensemble methods, its performance can be dramatically affected by its consensus diversity metric and thresholding procedure. There are two problems in CES: 1) most of the diversity metrics is based on heuristic Shannon's entropy and 2) estimating threshold values are really hard in practice. The main goal of this paper is proposing a robust approach for solving the above mentioned problems. Accordingly, this paper develops a novel framework for clustering problems, which is called Weighted Spectral Cluster Ensemble (WSCE), by exploiting some concepts from community detection arena and graph based clustering. Under this framework, a new version of spectral clustering, which is called Two Kernels Spectral Clustering, is used for generating graphs based individual clustering results. Further, by using modularity, which is a famous metric in the community detection, on the transformed graph representation of individual clustering results, our approach provides an effective diversity estimation for individual clustering results. Moreover, this paper introduces a new approach for combining the evaluated individual clustering results without the procedure of thresholding. Experimental study on varied data sets demonstrates that the prosed approach achieves superior performance to state-of-the-art methods.
引用
收藏
页码:549 / 558
页数:10
相关论文
共 20 条
  • [1] Alizadeh H., 2015, INTELLIGENT DATA ANA, V19
  • [2] Cluster ensemble selection based on a new cluster stability measure
    Alizadeh, Hosein
    Minaei-Bidgoli, Behrouz
    Parvin, Hamid
    [J]. INTELLIGENT DATA ANALYSIS, 2014, 18 (03) : 389 - 408
  • [3] [Anonymous], 2009, 15 ACM C KNOWL DISC
  • [4] Azimi J, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P992
  • [5] Blake C.L., 1998, UCI REPOSITORY MACHI
  • [6] Chen YD, 2014, PR MACH LEARN RES, V32, P1566
  • [7] Clauset A, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.066111
  • [8] Fern XZ, 2008, STAT ANAL DATA MIN, P128, DOI DOI 10.1002/SAM.10008
  • [9] FRED A., 2008, SUPERVISED UNSUPERVI, P3
  • [10] Combining multiple clusterings using evidence accumulation
    Fred, ALN
    Jain, AK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) : 835 - 850