A hybrid chimp optimization algorithm and generalized normal distribution algorithm with opposition-based learning strategy for solving data clustering problems

被引:0
作者
Sayed Pedram Haeri Boroujeni
Elnaz Pashaei
机构
[1] School of Computing, Clemson University, Clemson, 29634, SC
[2] Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis
关键词
Chimp optimization algorithm; Data clustering; Generalized normal distribution algorithm; K-means; Meta-heuristic optimization algorithm; Opposition-based learning;
D O I
10.1007/s42044-023-00160-x
中图分类号
学科分类号
摘要
This paper focuses on connectivity-based data clustering for categorizing similar and dissimilar data into distinct groups. Although classical clustering algorithms such as K-means are efficient techniques, they often trap in local optima and have a slow convergence rate in solving high-dimensional problems. To address these issues, many successful meta-heuristic optimization algorithms and intelligence-based methods have been introduced to attain the optimal solution in a reasonable time. In this study, we attempt to conceptualize a powerful approach using the three main components: Chimp Optimization Algorithm (ChOA), Generalized Normal Distribution Algorithm (GNDA), and Opposition-Based Learning (OBL) method. First, two versions of ChOA with two different dynamic coefficients and seven chaotic maps, entitled ChOA(I) and ChOA(II), are presented to achieve the best possible result for data clustering purposes. Second, a novel combination of ChOA and GNDA algorithms with the OBL strategy is devised to solve the major shortcomings of the original algorithms. Lastly, the proposed ChOAGNDA method is a Selective Opposition (SO) algorithm based on ChOA and GNDA, which can be used to tackle large and complex real-world optimization problems, particularly data clustering applications. In this study, eight benchmark datasets, including five datasets of the UCI machine learning repository and three challenging shape datasets, are used to investigate the general performance of the proposed method. The results are evaluated against several popular and recent state-of-the-art clustering techniques. Experimental results illustrate that the proposed work significantly outperforms other existing methods in terms of the Sum of Intra-Cluster Distances (SICD), Error Rate (ER), and convergence rate. © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2023.
引用
收藏
页码:65 / 101
页数:36
相关论文
共 80 条
[1]  
Soleimani M., Forouzanfar Z., Soltani M., Harandi M.J., Imbalanced multiclass medical data classification based on learning automata and neural network, EAI Endorsed Trans. AI Robot, 24, (2023)
[2]  
Mehrabi N., Boroujeni S.P.H., Age estimation based on facial images using hybrid features and particle swarm optimization, ICCKE 2021—11th Int. Conf. Comput. Eng. Knowl., (2021)
[3]  
Jaros R., Byrtus R., Dohnal J., Danys L., Baros J., Koziorek J., Zmij P., Martinek R., Advanced signal processing methods for condition monitoring, Arch. Comp. Methods Eng, 30, 3, pp. 1553-1577, (2023)
[4]  
Mehta V., Bawa S., Singh J., Stamantic clustering: combining statistical and semantic features for clustering of large text datasets, Expert Syst. Appl, (2021)
[5]  
Basar M.A., Hosen M.F., Paul B.K., Hasan M.R., Shamim S.M., Bhuyian T., Identification of drug and protein-protein interaction network among stress and depression: a bioinformatics approach, Inform. Med. Unlocked, 1, 37, (2023)
[6]  
Ezugwu A.E., Ikotun A.M., Oyelade O.O., Abualigah L., Agushaka J.O., Eke C.I., Akinyelu A.A., A comprehensive survey of clustering algorithms: state-of-the-art machine learning applications, taxonomy, challenges, and future research prospects, Eng. Appl. Artif. Intell, 1, 110, (2022)
[7]  
Catalyurek U., Devine K., Faraj M., Gottesburen L., Heuer T., Meyerhenke H., Sanders P., Schlag S., Schulz C., Seemaier D., Wagner D., More recent advances in (hyper) graph partitioning, ACM Comput. Surv, 55, 12, pp. 1-38, (2023)
[8]  
Sun G., Han R., Deng L., Li C., Yang G., Hierarchical structure-based joint operations algorithm for global optimization, Swarm Evol. Comput, 1, 79, (2023)
[9]  
Zhu J., Ma X., Martinez L., Zhan J., A probabilistic linguistic three-way decision method with regret theory via fuzzy c-means clustering algorithm, IEEE Trans. Fuzzy Syst, (2023)
[10]  
Bhattacharjee P., Mitra P., A survey of density based clustering algorithms, Front. Comput. Sci, (2021)