An ensemble hierarchical clustering algorithm based on merits at cluster and partition levels

被引:20
|
作者
Huang, Qirui [1 ]
Gao, Rui [2 ]
Akhavan, Hoda [3 ]
机构
[1] Nanyang Inst Technol, Sch Informat Engn, Nanyang 473004, Henan, Peoples R China
[2] Dongying Vocat Inst, Acad Affairs Off, Dongying 257000, Shandong, Peoples R China
[3] Amirkabir Univ Technol, Comp Engn & Informat Technol Dept, Tehran, Iran
关键词
Ensemble clustering; Cluster consensus; Hyper; -cluster; Merit level; Robustness measure; QUALITY; PREDICTION; DIVERSITY; CRITERION; SELECTION;
D O I
10.1016/j.patcog.2022.109255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble clustering has emerged as a combination of several basic clustering algorithms to achieve high quality final clustering. However, this technique is challenging due to the complexities in primary clus-ters such as overlapping, vagueness, instability and uncertainty. Typically, ensemble clustering uses all the primary clusters into partitions for consensus, where the merits of a cluster or a partition can be con-sidered to improve the quality of the consensus. In general, the robustness of a partition may be poorly measured, while having some high-quality clusters. Inspired by the evaluation of cluster and partition, this paper proposes an ensemble hierarchical clustering algorithm based on the cluster consensus selec-tion approach. Here, the selection of a subset of primary clusters from partitions based on their merit level is emphasized. Merit level is defined using the development of Normalized Mutual Information measure. Clusters of basic clustering algorithms that satisfy the predefined threshold of this measure are selected to participate in the final consensus. In addition, the consensus of the selected primary clusters to create the final clusters is performed based on the clusters clustering technique. In this technique, the selected primary clusters are re-clustered to create hyper-clusters. Finally, the final clusters are formed by assigning instances to hyper-clusters with the highest similarity. Here, an innovative criterion based on merit and cluster size for defining similarity is presented. The performance of the proposed algorithm has been proven by extensive experiments on real-world datasets from the UCI repository compared to state-of-the-art algorithms such as CPDM, ENMI, IDEA, CFTLC and SSCEN.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] An evolutionary many-objective algorithm based on decomposition and hierarchical clustering selection
    Sun, Yuehong
    Xiao, Kelian
    Wang, Siqiong
    Lv, Qiuyue
    APPLIED INTELLIGENCE, 2022, 52 (08) : 8464 - 8509
  • [42] A Scalable Cluster-based Hierarchical Hardware Accelerator for a Cortically Inspired Algorithm
    Dey, Sumon
    Baker, Lee
    Schabel, Joshua
    Li, Weifu
    Franzon, Paul D.
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2021, 17 (04)
  • [43] Developing ensemble clustering through similarity measures: A semi-supervised hierarchical clustering learning
    Wang, Dandan
    Li, Qi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (16)
  • [44] Modulation Format Identification Technology Based on a Searching Cluster Boundary Clustering Algorithm
    Yan, Xu
    Cao, Changqing
    Zhang, Wenrui
    Feng, Zhejun
    Zeng, Xiaodong
    Wu, Zengyan
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2023, 41 (01) : 105 - 113
  • [45] Ensemble Clustering Based Dimensional Reduction
    Abddallah, Loai
    Yousef, Malik
    DATABASE AND EXPERT SYSTEMS APPLICATIONS: DEXA 2018 INTERNATIONAL WORKSHOPS, 2018, 903 : 115 - 125
  • [46] Clustering Ensemble Based on Fuzzy Matrix Self-Enhancement
    Ji, Xia
    Sun, Jiawei
    Peng, Jianhua
    Pang, Yue
    Zhou, Peng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (01) : 148 - 161
  • [47] Ensemble Based Support Vector Clustering
    Pu, Fei
    2017 2ND INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION ENGINEERING (ICRAE), 2017, : 496 - 500
  • [48] Ensemble clustering based on dense representation
    Zhou, Jie
    Zheng, Hongchan
    Pan, Lulu
    NEUROCOMPUTING, 2019, 357 : 66 - 76
  • [49] Adaptive Ensemble Clustering With Boosting BLS-Based Autoencoder
    Shi, Yifan
    Yang, Kaixiang
    Yu, Zhiwen
    Chen, C. L. Philip
    Zeng, Huanqiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12369 - 12383
  • [50] A degree-distribution based hierarchical agglomerative clustering algorithm for protein complexes identification
    Yu, Liang
    Gao, Lin
    Li, Kui
    Zhao, Yi
    Chiu, David K. Y.
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (05) : 298 - 307