Bi-objective feature selection in high-dimensional datasets using improved binary chimp optimization algorithm

被引:5
作者
Al-qudah, Nour Elhuda A. [1 ]
Abed-alguni, Bilal H. [1 ]
Barhoush, Malek [2 ]
机构
[1] Yarmouk Univ, Dept Comp Sci, Irbid, Jordan
[2] Yarmouk Univ, Dept Informat Technol Cybersecur Program, Irbid, Jordan
基金
英国科研创新办公室;
关键词
Chimp optimization algorithm; Opposition-based learning; High-dimensional datasets; Feature selection; Levy flight; beta-Hill climbing algorithm; CUCKOO SEARCH; MODELS;
D O I
10.1007/s13042-024-02308-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The machine learning process in high-dimensional datasets is far more complicated than in low-dimensional datasets. In high-dimensional datasets, Feature Selection (FS) is necessary to decrease the complexity of learning. However, FS in high-dimensional datasets is a complex process that requires the combination of several search techniques. The Chimp Optimization Algorithm, known as ChOA, is a new meta-heuristic method inspired by the chimps' individual intellect and sexual incentive in cooperative hunting. It is basically employed in solving complex continuous optimization problems, while its binary version is frequently utilized in solving difficult binary optimization problems. Both versions of ChOA are subject to premature convergence and are incapable of effectively solving high-dimensional optimization problems. This paper proposes the Binary Improved ChOA Algorithm (BICHOA) for solving the bi-objective, high-dimensional FS problems (i.e., high-dimensional FS problems that aim to maximize the classifier's accuracy and minimize the number of selected features from a dataset). BICHOA improves the performance of ChOA using four new exploration and exploitation techniques. First, it employs the opposition-based learning approach to initially create a population of diverse binary feasible solutions. Second, it incorporates the L & eacute;vy mutation function in the main probabilistic update function of ChOA to boost its searching and exploring capabilities. Third, it uses an iterative exploration technique based on an exploratory local search method called the beta-hill climbing algorithm. Finally, it employs a new binary time-varying transfer function to calculate binary feasible solutions from the continuous feasible solutions generated by the update equations of the ChOA and beta-hill climbing algorithms. BICHOA's performance was assessed and compared against six machine learning classifiers, five integer programming methods, and nine efficient popular optimization algorithms using 25 real-world high-dimensional datasets from various domains. According to the overall experimental findings, BICHOA scored the highest accuracy, best objective value, and fewest selected features for each of the 25 real-world high-dimensional datasets. Besides, the reliability of the experimental findings was established using Friedman and Wilcoxon statistical tests.
引用
收藏
页码:6107 / 6148
页数:42
相关论文
共 100 条
[31]  
Brownlee J., 2020, MACHINE LEARNING MAS
[32]  
Cai D., 2010, Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, P333, DOI DOI 10.1145/1835804.1835848
[33]   Exploration and Exploitation in Evolutionary Algorithms: A Survey [J].
Crepinsek, Matej ;
Liu, Shih-Hsi ;
Mernik, Marjan .
ACM COMPUTING SURVEYS, 2013, 45 (03)
[34]   Intrusion Detection System for IoT Based on Deep Learning and Modified Reptile Search Algorithm [J].
Dahou, Abdelghani ;
Abd Elaziz, Mohamed ;
Chelloug, Samia Allaoua ;
Awadallah, Mohammed A. ;
Al-Betar, Mohammed Azmi ;
Al-qaness, Mohammed A. A. ;
Forestiero, Agostino .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[35]   A novel algorithm for global optimization: Rat Swarm Optimizer [J].
Dhiman, Gaurav ;
Garg, Meenakshi ;
Nagar, Atulya ;
Kumar, Vijay ;
Dehghani, Mohammad .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (08) :8457-8482
[36]  
Dong Guozhu., 2018, Feature engineering for machine learning and data analytics
[37]   Time-varying hierarchical chains of salps with random weight networks for feature selection [J].
Faris, Hossam ;
Heidari, Ali Asghar ;
Al-Zoubi, Ala' M. ;
Mafarja, Majdi ;
Aljarah, Ibrahim ;
Eshtay, Mohammed ;
Mirjalili, Seyedali .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
[38]  
Fukunaga Keinosuke., 2013, Introduction to statistical pattern recognition
[39]   Cuckoo search algorithm: a metaheuristic approach to solve structural optimization problems [J].
Gandomi, Amir Hossein ;
Yang, Xin-She ;
Alavi, Amir Hossein .
ENGINEERING WITH COMPUTERS, 2013, 29 (01) :17-35
[40]  
Gibbons J.D., 1993, NONPARAMETRIC STAT, P31