A hybrid chimp optimization algorithm and generalized normal distribution algorithm with opposition-based learning strategy for solving data clustering problems

被引:0
作者
Sayed Pedram Haeri Boroujeni
Elnaz Pashaei
机构
[1] School of Computing, Clemson University, Clemson, 29634, SC
[2] Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis
关键词
Chimp optimization algorithm; Data clustering; Generalized normal distribution algorithm; K-means; Meta-heuristic optimization algorithm; Opposition-based learning;
D O I
10.1007/s42044-023-00160-x
中图分类号
学科分类号
摘要
This paper focuses on connectivity-based data clustering for categorizing similar and dissimilar data into distinct groups. Although classical clustering algorithms such as K-means are efficient techniques, they often trap in local optima and have a slow convergence rate in solving high-dimensional problems. To address these issues, many successful meta-heuristic optimization algorithms and intelligence-based methods have been introduced to attain the optimal solution in a reasonable time. In this study, we attempt to conceptualize a powerful approach using the three main components: Chimp Optimization Algorithm (ChOA), Generalized Normal Distribution Algorithm (GNDA), and Opposition-Based Learning (OBL) method. First, two versions of ChOA with two different dynamic coefficients and seven chaotic maps, entitled ChOA(I) and ChOA(II), are presented to achieve the best possible result for data clustering purposes. Second, a novel combination of ChOA and GNDA algorithms with the OBL strategy is devised to solve the major shortcomings of the original algorithms. Lastly, the proposed ChOAGNDA method is a Selective Opposition (SO) algorithm based on ChOA and GNDA, which can be used to tackle large and complex real-world optimization problems, particularly data clustering applications. In this study, eight benchmark datasets, including five datasets of the UCI machine learning repository and three challenging shape datasets, are used to investigate the general performance of the proposed method. The results are evaluated against several popular and recent state-of-the-art clustering techniques. Experimental results illustrate that the proposed work significantly outperforms other existing methods in terms of the Sum of Intra-Cluster Distances (SICD), Error Rate (ER), and convergence rate. © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2023.
引用
收藏
页码:65 / 101
页数:36
相关论文
共 80 条
[31]  
Pedram Haeri Boroujeni S., Pashaei E., Data clustering using chimp optimization algorithm, ICCKE 2021—11th Int. Conf. Comput. Eng. Knowl., (2021)
[32]  
Wolpert D.H., Macready W.G., No free lunch theorems for optimization, IEEE Trans. Evol. Comput, (1997)
[33]  
Das P., Das D.K., Dey S., A new class topper optimization algorithm with an application to data clustering, IEEE Trans. Emerg. Top. Comput, (2020)
[34]  
Taherdangkoo M., Yazdi M., Bagheri M.H., A powerful and efficient evolutionary optimization algorithm based on stem cells algorithm for data clustering, Open Comput. Sci, (2012)
[35]  
Hatamlou A., Heart: a novel optimization algorithm for cluster analysis, Prog. Artif. Intell, (2014)
[36]  
Zadeh M.R.D., Fathian M., Gholamian M.R., A new method for clustering based on development of imperialist competitive algorithm, China Commun, (2014)
[37]  
Naik A., Use of teaching learning based optimization for data clustering, Lect. Notes Electr. Eng, (2020)
[38]  
Hatamlou A., Abdullah S., Hatamlou M., Data clustering using big bang-big crunch algorithm, Commun. Comput. Inf. Sci, (2011)
[39]  
Hatamlou A., Abdullah S., Nezamabadi-Pour H., Application of gravitational search algorithm on data clustering, Lect. Notes Comput. Sci. (Including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), (2011)
[40]  
Hatamlou A., Black hole: a new heuristic optimization approach for data clustering, Inf. Sci. (NY), (2013)