A novel member enhancement-based clustering ensemble algorithm

被引:0
作者
He, Yulin [1 ,2 ,3 ]
Yang, Jin [2 ]
Cheng, Yingchao [1 ]
Du, Xueqin [2 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
基金
中国国家自然科学基金;
关键词
ensemble clustering; heterocluster; homocluster; MMD; neighborhood density; COMBINING MULTIPLE CLUSTERINGS; SELECTION; PARTITIONS; STABILITY; QUALITY;
D O I
10.1002/cpe.7992
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering ensemble is a popular approach for identifying data clusters that combines the clustering results from multiple base clustering algorithms to produce more accurate and robust data clusters. However, the performance of clustering ensemble algorithms is highly dependent on the quality of clustering members. To address this problem, this paper proposes a member enhancement-based clustering ensemble (MECE) algorithm that selects the ensemble members by considering their distribution consistency. MECE has two main components, called heterocluster splitting and homocluster merging. The first component estimates two probability density functions (p.d.f.s) estimated on the sample points of an heterocluster and represents them using a Gaussian distribution and a Gaussian mixture model. If the random numbers generated by these two p.d.f.s have different probability distributions, the heterocluster is then split into smaller clusters. The second component merges the clusters that have high neighborhood densities into a homocluster, where the neighborhood density is measured using a novel evaluation criterion. In addition, a co-association matrix is presented, which serves as a summary for the ensemble of diverse clusters. A series of experiments were conducted to evaluate the feasibility and effectiveness of the proposed ensemble member generation algorithm. Results show that the proposed MECE algorithm can select high quality ensemble members and as a result yield the better clusterings than six state-of-the-art ensemble clustering algorithms, that is, cluster-based similarity partitioning algorithm (CSPA), meta-clustering algorithm (MCLA), hybrid bipartite graph formulation (HBGF), evidence accumulation clustering (EAC), locally weighted evidence accumulation (LWEA), and locally weighted graph partition (LWGP). Specifically, MECE algorithm has the nearly 23% higher average NMI, 27% higher average ARI, 15% higher average FMI, and 10% higher average purity than CSPA, MCLA, HBGF, EAC, LWEA, and LWGA algorithms. The experimental results demonstrate that MECE algorithm is a valid approach to deal with the clustering ensemble problems.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] A fuzzy clustering ensemble based on cluster clustering and iterative Fusion of base clusters
    Mojarad, Musa
    Nejatian, Samad
    Parvin, Hamid
    Mohammadpoor, Majid
    APPLIED INTELLIGENCE, 2019, 49 (07) : 2567 - 2581
  • [42] A New Self Adaptive Fuzzy Unsupervised Clustering Ensemble Based On Spectral Clustering
    Lahmar, Ines
    Zaier, Aida
    Yahia, Mohamed
    Bouallegue, Ridha
    PROCEEDINGS OF THE 2020 17TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD 2020), 2020, : 1 - 5
  • [43] Anchor-based fast spectral ensemble clustering
    Zhang, Runxin
    Hang, Shuaijun
    Sun, Zhensheng
    Nie, Feiping
    Wang, Rong
    Li, Xuelong
    INFORMATION FUSION, 2025, 113
  • [44] Ensemble classification based on supervised clustering for credit scoring
    Xiao, Hongshan
    Xiao, Zhi
    Wang, Yu
    APPLIED SOFT COMPUTING, 2016, 43 : 73 - 86
  • [45] Ensemble clustering based approach for software architecture recovery
    Puchala S.P.R.
    Chhabra J.K.
    Rathee A.
    International Journal of Information Technology, 2022, 14 (4) : 2013 - 2019
  • [46] Tumor Clustering based on Hybrid Cluster Ensemble Framework
    Yu, Zhiwen
    You, Jane
    Chen, Hantao
    Li, Le
    Wang, Xiaowei
    2012 INTERNATIONAL CONFERENCE ON COMPUTERIZED HEALTHCARE (ICCH), 2012, : 99 - +
  • [47] Clustering ensemble selection based on the extended Jaccard measure
    Khalili, Hajar
    Rabbani, Mohsen
    Akbari, Ebrahim
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (04) : 2215 - 2231
  • [48] Social Media User Partitioning Based on Ensemble Clustering
    Yu Wendong
    Li Hong
    Pan Na
    Liu Zhenzhen
    2016 13TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, 2016,
  • [49] A hybrid ensemble pruning approach based on consensus clustering and multi-objective evolutionary algorithm for sentiment classification
    Onan, Aytug
    Korukoglu, Serdar
    Bulut, Hasan
    INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (04) : 814 - 833
  • [50] A Novel Clustering Algorithm for Ad Hoc Network
    Gao, Li
    Mu, Dejun
    Wang, Yuexian
    Zhang, Guoqing
    Zhang, Li
    ICIEA: 2009 4TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-6, 2009, : 440 - +