Combined Gaussian Mixture Model and Pathfinder Algorithm for Data Clustering

被引:3
作者
Huang, Huajuan [1 ,2 ]
Liao, Zepeng [1 ]
Wei, Xiuxi [1 ]
Zhou, Yongquan [1 ,2 ]
机构
[1] Guangxi Minzu Univ, Coll Artificial Intelligence, Nanning 530006, Peoples R China
[2] Guangxi Key Lab Hybrid Computat & IC Design Anal, Nanning 530006, Peoples R China
基金
中国国家自然科学基金;
关键词
clustering; Gaussian Mixture Models; metaheuristic algorithm; pathfinder algorithm; LEADERSHIP;
D O I
10.3390/e25060946
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Data clustering is one of the most influential branches of machine learning and data analysis, and Gaussian Mixture Models (GMMs) are frequently adopted in data clustering due to their ease of implementation. However, there are certain limitations to this approach that need to be acknowledged. GMMs need to determine the cluster numbers manually, and they may fail to extract the information within the dataset during initialization. To address these issues, a new clustering algorithm called PFA-GMM has been proposed. PFA-GMM is based on GMMs and the Pathfinder algorithm (PFA), and it aims to overcome the shortcomings of GMMs. The algorithm automatically determines the optimal number of clusters based on the dataset. Subsequently, PFA-GMM considers the clustering problem as a global optimization problem for getting trapped in local convergence during initialization. Finally, we conducted a comparative study of our proposed clustering algorithm against other well-known clustering algorithms using both synthetic and real-world datasets. The results of our experiments indicate that PFA-GMM outperformed the competing approaches.
引用
收藏
页数:20
相关论文
共 41 条
[1]   Automatic clustering method to segment COVID-19 CT images [J].
Abd Elaziz, Mohamed ;
Al-qaness, Mohammed A. A. ;
Zaid, Esraa Osama Abo ;
Lu, Songfeng ;
Ibrahim, Rehab Ali ;
Ewees, Ahmed A. .
PLOS ONE, 2021, 16 (01)
[2]   A Review of the Modification Strategies of the Nature Inspired Algorithms for Feature Selection Problem [J].
Abu Khurma, Ruba ;
Aljarah, Ibrahim ;
Sharieh, Ahmad ;
Abd Elaziz, Mohamed ;
Damasevicius, Robertas ;
Krilavicius, Tomas .
MATHEMATICS, 2022, 10 (03)
[3]   FCM - THE FUZZY C-MEANS CLUSTERING-ALGORITHM [J].
BEZDEK, JC ;
EHRLICH, R ;
FULL, W .
COMPUTERS & GEOSCIENCES, 1984, 10 (2-3) :191-203
[4]  
Bilmes JA., 1998, A gentle tutorial of the em algorithm and its application to parameter estimation for gaussian mixture and hidden markov models, V4, P126, DOI DOI 10.1080/0042098032000136147
[5]   SOLUTION OF THE ASSIGNMENT PROBLEM [H] [J].
CARPANETO, G ;
TOTH, P .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1980, 6 (01) :104-111
[6]   Cutoff for Exact Recovery of Gaussian Mixture Models [J].
Chen, Xiaohui ;
Yang, Yun .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (06) :4223-4238
[7]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[8]  
De Luca G, 2004, SKEW-ELLIPTICAL DISTRIBUTIONS AND THEIR APPLICATIONS: A JOURNEY BEYOND NORMALITY, P205
[9]   A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms [J].
Derrac, Joaquin ;
Garcia, Salvador ;
Molina, Daniel ;
Herrera, Francisco .
SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (01) :3-18
[10]  
Doval D., 1999, STEP '99. Proceedings Ninth International Workshop Software Technology and Engineering Practice, P73, DOI 10.1109/STEP.1999.798481