A novel member enhancement-based clustering ensemble algorithm

被引:0
|
作者
He, Yulin [1 ,2 ,3 ]
Yang, Jin [2 ]
Cheng, Yingchao [1 ]
Du, Xueqin [2 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
基金
中国国家自然科学基金;
关键词
ensemble clustering; heterocluster; homocluster; MMD; neighborhood density; COMBINING MULTIPLE CLUSTERINGS; SELECTION; PARTITIONS; STABILITY; QUALITY;
D O I
10.1002/cpe.7992
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering ensemble is a popular approach for identifying data clusters that combines the clustering results from multiple base clustering algorithms to produce more accurate and robust data clusters. However, the performance of clustering ensemble algorithms is highly dependent on the quality of clustering members. To address this problem, this paper proposes a member enhancement-based clustering ensemble (MECE) algorithm that selects the ensemble members by considering their distribution consistency. MECE has two main components, called heterocluster splitting and homocluster merging. The first component estimates two probability density functions (p.d.f.s) estimated on the sample points of an heterocluster and represents them using a Gaussian distribution and a Gaussian mixture model. If the random numbers generated by these two p.d.f.s have different probability distributions, the heterocluster is then split into smaller clusters. The second component merges the clusters that have high neighborhood densities into a homocluster, where the neighborhood density is measured using a novel evaluation criterion. In addition, a co-association matrix is presented, which serves as a summary for the ensemble of diverse clusters. A series of experiments were conducted to evaluate the feasibility and effectiveness of the proposed ensemble member generation algorithm. Results show that the proposed MECE algorithm can select high quality ensemble members and as a result yield the better clusterings than six state-of-the-art ensemble clustering algorithms, that is, cluster-based similarity partitioning algorithm (CSPA), meta-clustering algorithm (MCLA), hybrid bipartite graph formulation (HBGF), evidence accumulation clustering (EAC), locally weighted evidence accumulation (LWEA), and locally weighted graph partition (LWGP). Specifically, MECE algorithm has the nearly 23% higher average NMI, 27% higher average ARI, 15% higher average FMI, and 10% higher average purity than CSPA, MCLA, HBGF, EAC, LWEA, and LWGA algorithms. The experimental results demonstrate that MECE algorithm is a valid approach to deal with the clustering ensemble problems.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Ensemble Clustering Based Dimensional Reduction
    Abddallah, Loai
    Yousef, Malik
    DATABASE AND EXPERT SYSTEMS APPLICATIONS: DEXA 2018 INTERNATIONAL WORKSHOPS, 2018, 903 : 115 - 125
  • [22] Ensemble Based Support Vector Clustering
    Pu, Fei
    2017 2ND INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION ENGINEERING (ICRAE), 2017, : 496 - 500
  • [23] Ensemble clustering based on dense representation
    Zhou, Jie
    Zheng, Hongchan
    Pan, Lulu
    NEUROCOMPUTING, 2019, 357 : 66 - 76
  • [24] A Novel Fuzzy Clustering Algorithm Based on Similarity of Attribute Space
    Shi Weifeng
    Zhuo Jinbao
    Lan Ying
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2722 - 2728
  • [25] Ensemble clustering based on Evidence theory
    Wang, Xueen
    Han, Deqiang
    Han, Chongzhao
    2017 20TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2017, : 759 - 767
  • [26] CSLSEP: an ensemble pruning algorithm based on clustering soft label and sorting for facial expression recognition
    Huang, Shisong
    Li, Danyang
    Zhang, Zhuhong
    Wu, Yating
    Tang, Yumei
    Chen, Xing
    Wu, Yiqing
    MULTIMEDIA SYSTEMS, 2023, 29 (03) : 1463 - 1479
  • [27] A Novel Ensemble Clustering Approach with Internal Weighting Strategy
    Zhao, Wenfei
    Lian, Cheng
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2521 - 2526
  • [28] A new Ensemble Clustering Algorithm using a Reconstructed Mapping Coefficient
    Cao, Tuoqia
    Chang, Dongxia
    Zhao, Yao
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (07) : 2957 - 2980
  • [29] Ensemble Partial Least Squares Algorithm Based on Variable Clustering for Quantitative Infrared Spectrometric Analysis
    Bi Yi-Ming
    Chu Guo-Hai
    Wu Ji-Zhong
    Yuan Kai-Long
    Wu Jian
    Liao Fu
    Xia Jun
    Zhang Guang-Xin
    Zhou Guo-Jun
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2015, 43 (07) : 1086 - U62
  • [30] STABLE CLUSTERING ENSEMBLE BASED ON EVIDENCE THEORY
    Fu, Haijie
    Yue, Xiaodong
    Liu, Wei
    Denoeux, Thierry
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2046 - 2050