A comprehensive study of clustering ensemble weighting based on cluster quality and diversity

被引:60
|
作者
Nazari, Ahmad [1 ]
Dehghan, Ayob [1 ]
Nejatian, Samad [2 ,3 ]
Rezaie, Vahideh [3 ,4 ]
Parvin, Hamid [1 ,5 ]
机构
[1] Islamic Azad Univ, Yasooj Branch, Dept Comp Engn, Yasuj, Iran
[2] Islamic Azad Univ, Yasooj Branch, Dept Elect Engn, Yasuj, Iran
[3] Islamic Azad Univ, Yasooj Branch, Young Researchers & Elite Club, Yasuj, Iran
[4] Islamic Azad Univ, Yasooj Branch, Dept Math, Yasuj, Iran
[5] Islamic Azad Univ, Young Researchers & Elite Club, Nourabad Mamasani Branch, Nourabad, Mamasani, Iran
关键词
Data clustering; Clustering ensemble; Consensus function; Weighting; COMBINING MULTIPLE CLUSTERINGS; TRANSFER DISTANCE; SELECTION; CONSENSUS; PARTITIONS;
D O I
10.1007/s10044-017-0676-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering as a major task in data mining is responsible for discovering hidden patterns in unlabeled datasets. Finding the best clustering is also considered as one of the most challenging problems in data mining. Due to the problem complexity and the weaknesses of primary clustering algorithm, a large part of research has been directed toward ensemble clustering methods. Ensemble clustering aggregates a pool of base clusterings and produces an output clustering that is also named consensus clustering. The consensus clustering is usually better clustering than the output clusterings of the basic clustering algorithms. However, lack of quality in base clusterings makes their consensus clustering weak. In spite of some researches in selection of a subset of high quality base clusterings based on a clustering assessment metric, cluster-level selection has been always ignored. In this paper, a new clustering ensemble framework has been proposed based on cluster-level weighting. The certainty amount that the given ensemble has about a cluster is considered as the reliability of that cluster. The certainty amount that the given ensemble has about a cluster is computed by the accretion amount of that cluster by the ensemble. Then by selecting the best clusters and assigning a weight to each selected cluster based on its reliability, the final ensemble is created. After that, the paper proposes cluster-level weighting co-association matrix instead of traditional co-association matrix. Then, two consensus functions have been introduced and used for production of the consensus partition. The proposed framework completely overshadows the state-of-the-art clustering ensemble methods experimentally.
引用
收藏
页码:133 / 145
页数:13
相关论文
共 50 条
  • [21] Clustering Ensemble Algorithm with Cluster Connection Based on Wisdom of Crowds
    Zhang H.
    Gao Y.
    Chen Y.
    Wang Z.
    Gao, Yukun (821566504@qq.com), 2018, Science Press (55): : 2611 - 2619
  • [22] DICLENS: Divisive Clustering Ensemble with Automatic Cluster Number
    Mimaroglu, Selim
    Aksehirli, Emin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (02) : 408 - 420
  • [23] Enhanced Ensemble Clustering via Fast Propagation of Cluster-Wise Similarities
    Huang, Dong
    Wang, Chang-Dong
    Peng, Hongxing
    Lai, Jianhuang
    Kwoh, Chee-Keong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (01): : 508 - 520
  • [24] Meta-cluster Based Consensus Clustering with Local Weighting and Random Walking
    He, Nannan
    Huang, Dong
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING, PT II, 2019, 11936 : 266 - 277
  • [25] A cluster-weighted clustering ensemble algorithm based on member selection
    Xu, Sen
    Gao, Ting
    Xu, Xiu-Fang
    Xu, He-Yang
    Guo, Nai-Xuan
    Bian, Xue-Sheng
    Hua, Xiaopeng
    Chen, Zhi-Yuan
    Kongzhi yu Juece/Control and Decision, 2024, 39 (12): : 4136 - 4140
  • [26] A Weighted Object-Cluster Association-Based Ensemble Method for Clustering Undergraduate Students
    Chau Thi Ngoc Vo
    Phung Hua Nguyen
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2018, PT I, 2018, 10751 : 587 - 598
  • [27] A Link-Based Cluster Ensemble Approach for Categorical Data Clustering
    Iam-On, Natthakan
    Boongoen, Tossapon
    Garrett, Simon
    Price, Chris
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (03) : 413 - 425
  • [28] An ensemble hierarchical clustering algorithm based on merits at cluster and partition levels
    Huang, Qirui
    Gao, Rui
    Akhavan, Hoda
    PATTERN RECOGNITION, 2023, 136
  • [29] Parameter-free ensemble clustering with dynamic weighting mechanism
    Xie, Fangyuan
    Nie, Feiping
    Yu, Weizhong
    Li, Xuelong
    PATTERN RECOGNITION, 2024, 151
  • [30] From Ensemble Clustering to Subspace Clustering: Cluster Structure Encoding
    Tao, Zhiqiang
    Li, Jun
    Fu, Huazhu
    Kong, Yu
    Fu, Yun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (05) : 2670 - 2681