A novel member enhancement-based clustering ensemble algorithm

被引:0
|
作者
He, Yulin [1 ,2 ,3 ]
Yang, Jin [2 ]
Cheng, Yingchao [1 ]
Du, Xueqin [2 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
基金
中国国家自然科学基金;
关键词
ensemble clustering; heterocluster; homocluster; MMD; neighborhood density; COMBINING MULTIPLE CLUSTERINGS; SELECTION; PARTITIONS; STABILITY; QUALITY;
D O I
10.1002/cpe.7992
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering ensemble is a popular approach for identifying data clusters that combines the clustering results from multiple base clustering algorithms to produce more accurate and robust data clusters. However, the performance of clustering ensemble algorithms is highly dependent on the quality of clustering members. To address this problem, this paper proposes a member enhancement-based clustering ensemble (MECE) algorithm that selects the ensemble members by considering their distribution consistency. MECE has two main components, called heterocluster splitting and homocluster merging. The first component estimates two probability density functions (p.d.f.s) estimated on the sample points of an heterocluster and represents them using a Gaussian distribution and a Gaussian mixture model. If the random numbers generated by these two p.d.f.s have different probability distributions, the heterocluster is then split into smaller clusters. The second component merges the clusters that have high neighborhood densities into a homocluster, where the neighborhood density is measured using a novel evaluation criterion. In addition, a co-association matrix is presented, which serves as a summary for the ensemble of diverse clusters. A series of experiments were conducted to evaluate the feasibility and effectiveness of the proposed ensemble member generation algorithm. Results show that the proposed MECE algorithm can select high quality ensemble members and as a result yield the better clusterings than six state-of-the-art ensemble clustering algorithms, that is, cluster-based similarity partitioning algorithm (CSPA), meta-clustering algorithm (MCLA), hybrid bipartite graph formulation (HBGF), evidence accumulation clustering (EAC), locally weighted evidence accumulation (LWEA), and locally weighted graph partition (LWGP). Specifically, MECE algorithm has the nearly 23% higher average NMI, 27% higher average ARI, 15% higher average FMI, and 10% higher average purity than CSPA, MCLA, HBGF, EAC, LWEA, and LWGA algorithms. The experimental results demonstrate that MECE algorithm is a valid approach to deal with the clustering ensemble problems.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] DenMG: Density-Based Member Generation for Ensemble Clustering
    Du, Xueqin
    He, Yulin
    Fournier-Viger, Philippe
    Huang, Joshua Zhexue
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP 2022, 2022,
  • [2] Clustering Ensemble Based on Fuzzy Matrix Self-Enhancement
    Ji, Xia
    Sun, Jiawei
    Peng, Jianhua
    Pang, Yue
    Zhou, Peng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (01) : 148 - 161
  • [3] An ensemble hierarchical clustering algorithm based on merits at cluster and partition levels
    Huang, Qirui
    Gao, Rui
    Akhavan, Hoda
    PATTERN RECOGNITION, 2023, 136
  • [4] An ensemble agglomerative hierarchical clustering algorithm based on clusters clustering technique and the novel similarity measurement
    Li, Teng
    Rezaeipanah, Amin
    El Din, ElSayed M. Tag
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 3828 - 3842
  • [5] An Ensemble Clustering Framework Based on Hierarchical Clustering Ensemble Selection and Clusters Clustering
    Li, Wenjun
    Wang, Zikang
    Sun, Wei
    Bahrami, Sara
    CYBERNETICS AND SYSTEMS, 2023, 54 (05) : 741 - 766
  • [6] Ensemble clustering algorithm based on rapid simulated annealing
    Li H.
    Zhang Z.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (08): : 1646 - 1652
  • [7] Co-Clustering Ensemble Based on Bilateral K-Means Algorithm
    Yang, Hui
    Peng, Han
    Zhu, Jianyong
    Nie, Feiping
    IEEE ACCESS, 2020, 8 : 51285 - 51294
  • [8] LWMC: A Locally Weighted Meta-Clustering Algorithm for Ensemble Clustering
    Huang, Dong
    Wang, Chang-Dong
    Lai, Jian-Huang
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 167 - 176
  • [9] Dependability-based cluster weighting in clustering ensemble
    Najafi, Fatemeh
    Parvin, Hamid
    Mirzaie, Kamal
    Nejatian, Samad
    Rezaie, Vahideh
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (02) : 151 - 164
  • [10] Design of SSVEP Enhancement-Based Brain Computer Interface
    Lin, Bor-Shing
    Wang, Hsiao-An
    Huang, Yao-Kuang
    Wang, Yu-Lin
    Lin, Bor-Shyh
    IEEE SENSORS JOURNAL, 2021, 21 (13) : 14330 - 14338