DenMG: Density-Based Member Generation for Ensemble Clustering

被引:0
作者
Du, Xueqin [1 ]
He, Yulin [1 ,2 ]
Fournier-Viger, Philippe [1 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[2] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen, Peoples R China
来源
51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP 2022 | 2022年
基金
中国国家自然科学基金;
关键词
ensemble clustering; MMD; homocluster; heterocluster; neighborhood density; COMBINING MULTIPLE CLUSTERINGS;
D O I
10.1145/3547276.3548520
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Ensemble clustering is a popular approach for identifying clusters in data, which combines results from multiple clustering algorithms to obtain more accurate and robust clusters. However, the performance of ensemble clustering algorithms greatly depends on the quality of its members. Based on this observation, this paper proposes a density-based member generation (DenMG) algorithm that selects ensemble members by considering the distribution consistency. DenMG has two main components, which split sample points from a heterocluster and merge sample points to form a homocluster, respectively. The first component estimates two probability density functions ( p.d.f.s) based on an heterocluster's sample points, and represents them using a Gaussian distribution and a Gaussian mixture model. If random numbers generated by these two p.d.f.s are deemed to have different probability distributions, the heterocluster is split into smaller clusters. The second component merges clusters that have high neighborhood densities into a homocluster. This is done using an opposite-oriented criterion that measures neighborhood density. A series of experiments were conducted to demonstrate the feasibility and effectiveness of the proposed ensemble member generation algorithm. Results show that the proposed algorithm can generate high quality ensemble members and as a result yield better clustering than five state-of-the-art ensemble clustering algorithms.
引用
收藏
页数:7
相关论文
共 20 条
[1]   Cluster ensemble selection based on a new cluster stability measure [J].
Alizadeh, Hosein ;
Minaei-Bidgoli, Behrouz ;
Parvin, Hamid .
INTELLIGENT DATA ANALYSIS, 2014, 18 (03) :389-408
[2]  
[Anonymous], 2011, REPRODUCING KERNEL H
[3]  
Fern X. Z., 2003, P 20 INT C MACH LEAR, P186, DOI DOI 10.5555/3041838.3041862
[4]  
Fern X.Z., 2004, P 21 INT C MACH LEAR
[5]   Path-based clustering for grouping of smooth curves and texture segmentation [J].
Fischer, B ;
Buhmann, JM .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (04) :513-518
[6]   Combining multiple clusterings using evidence accumulation [J].
Fred, ALN ;
Jain, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) :835-850
[7]  
Golalipour K, 2021, ENG APPL ARTIF INTEL, V104, DOI 10.1016/j.engappai.2021.104388
[8]  
[He Yulin 何玉林], 2021, [应用数学, Mathematics Applicata], V34, P284
[9]   Locally Weighted Ensemble Clustering [J].
Huang, Dong ;
Wang, Chang-Dong ;
Lai, Jian-Huang .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (05) :1460-1473
[10]   Combining multiple clusterings via crowd agreement estimation and multi-granularity link analysis [J].
Huang, Dong ;
Lai, Jian-Huang ;
Wang, Chang-Dong .
NEUROCOMPUTING, 2015, 170 :240-250