Efficient Feature Selection in High Dimensional Data Based on Enhanced Binary Chimp Optimization Algorithms and Machine Learning

被引:7
作者
Farid Ayeche
Adel Alti
机构
[1] LMETR,Faculty of Technologies, Technology Department
[2] University Ferhat Abbas Sétif-1,Department of Management Information Systems
[3] College of Business Qassim University,Faculty of Sciences, Computer Science Department
[4] LRSD,undefined
[5] University Ferhat Abbas Sétif-1,undefined
来源
Human-Centric Intelligent Systems | 2023年 / 3卷 / 4期
关键词
Feature selection; BChimp; Machine learning; Dimensionality reduction; Relevancy; Classification accuracy;
D O I
10.1007/s44230-023-00048-w
中图分类号
学科分类号
摘要
Feature selection with the highest performance accuracy is the biggest win for multidimensional data. The Chimpanzee Optimization Algorithm (ChOA) serves as a crucial technique for dealing with multidimensional global optimization issues. However, ChOA often lacks fast convergence and good selection of sensitive attributes leading to poor performance. To address these issues, most significant features were selected using two variants of ChOA called BChimp1 and BChimp2 (BChimp1 and BChimp are available at : https://www.mathworks.com/matlabcentral/fileexchange/133267-binary-chimpoptimization-algorithm-forfeatures-selection. September 22, 202). BChimp1 selects the optimal solution from the four best possible solutions and it applies a stochastic crossover on four moving solutions to deeply speed-up convergence level. BChimp2 uses the sigmoid function to select the significant features. Then, these features were trained using six-well known classifiers. The proposed techniques tend to select the most significant features, speed up the convergence rate and decrease training time for high-dimensional data. 23 standard datasets with six well-known classifiers were employed to assess the performance of BChimp1 and BChimp2. Experimental results validate the efficiency of BChimp1 and BChimp2 in enhancing accuracy by 83.83% and 82.02%, and reducing dimensionality by 42.77% and 72.54%, respectively. However, time-evaluation results of BChimp1 and BChimp2 in all datasets showed fast convergence and surpassed current optimization algorithms such as PSO, GWA, GOA, and GA.
引用
收藏
页码:558 / 587
页数:29
相关论文
共 50 条
[31]   An enhanced Harris hawk optimizer based on extreme learning machine for feature selection [J].
Alzaqebah, Abdullah ;
Al-Kadi, Omar ;
Aljarah, Ibrahim .
PROGRESS IN ARTIFICIAL INTELLIGENCE, 2023, 12 (01) :77-97
[32]   Efficient feature selection filters for high-dimensional data [J].
Ferreira, Artur J. ;
Figueiredo, Mario A. T. .
PATTERN RECOGNITION LETTERS, 2012, 33 (13) :1794-1804
[33]   A contrast based feature selection algorithm for high-dimensional datasets in machine learning [J].
Cao, Chunxu ;
Zhang, Qiang ;
Deng, Yuhui .
INFORMATION SCIENCES, 2025, 717
[34]   Metaheuristic-Based Feature Selection Methods for Diagnosing Sarcopenia with Machine Learning Algorithms [J].
Lee, Jaehyeong ;
Yoon, Yourim ;
Kim, Jiyoun ;
Kim, Yong-Hyuk .
BIOMIMETICS, 2024, 9 (03)
[35]   Mrmr plus and Cfs plus feature selection algorithms for high-dimensional data [J].
Angulo, Adrian Pino ;
Shin, Kilho .
APPLIED INTELLIGENCE, 2019, 49 (05) :1954-1967
[36]   Mrmr+ and Cfs+ feature selection algorithms for high-dimensional data [J].
Adrian Pino Angulo ;
Kilho Shin .
Applied Intelligence, 2019, 49 :1954-1967
[37]   Machine learning algorithms in production: A guideline for efficient data source selection [J].
Stanula, Patrick ;
Ziegenbein, Amina ;
Metternich, Joachim .
6TH CIRP GLOBAL WEB CONFERENCE - ENVISAGING THE FUTURE MANUFACTURING, DESIGN, TECHNOLOGIES AND SYSTEMS IN INNOVATION ERA (CIRPE 2018), 2018, 78 :261-266
[38]   Feature Selection Algorithms for High-dimensional Unbalanced Medical Data [J].
Liu, Jiaxuan ;
Li, Daiwei ;
Ren, Lijuan ;
Zhang, Haiqing ;
Tang, Xin ;
Xiang, Xiaoming .
2024 4TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, ROBOTICS AND CONTROL ENGINEERING, IARCE, 2024, :511-514
[39]   A Machine Learning-Based Wrapper Method for Feature Selection [J].
Patel, Damodar ;
Saxena, Amit ;
Wang, John .
INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2024, 20 (01)
[40]   Feature Selection in High Dimensional Data: A Review [J].
Silaich, Sarita ;
Gupta, Suneet .
THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 :703-717