A Novel Wrapper-Based Optimization Algorithm for the Feature Selection and Classification

被引:10
作者
Talpur, Noureen [1 ]
Abdulkadir, Said Jadid [1 ]
Hasan, Mohd Hilmi [1 ]
Alhussian, Hitham [1 ]
Alwadain, Ayed [2 ]
机构
[1] Univ Teknol PETRONAS, Comp & Informat Sci Dept, Seri Iskandar 32610, Perak, Malaysia
[2] King Saud Univ, Community Coll, Comp Sci Dept, Riyadh 145111, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 74卷 / 03期
关键词
Machine learning; optimization; feature selection; classification; medical data; GAS SOLUBILITY OPTIMIZATION; SEARCH;
D O I
10.32604/cmc.2023.034025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) practices such as classification have played a very important role in classifying diseases in medical science. Since medical science is a sensitive field, the pre-processing of medical data requires careful handling to make quality clinical decisions. Generally, medical data is considered high-dimensional and complex data that contains many irrelevant and redundant features. These factors indirectly upset the disease prediction and classification accuracy of any ML model. To address this issue, various data pre-processing methods called Feature Selection (FS) techniques have been presented in the literature. However, the majority of such techniques frequently suffer from local minima issues due to large solution space. Thus, this study has proposed a novel wrapper-based Sand Cat Swarm Optimization (SCSO) technique as an FS approach to find optimum features from ten benchmark medical datasets. The SCSO algorithm replicates the hunting and searching strategies of the sand cat while having the advantage of avoiding local optima and finding the ideal solution with minimal control variables. Moreover, K-Nearest Neighbor (KNN) classifier was used to evaluate the effectiveness of the features identified by the proposed SCSO algorithm. The performance of the proposed SCSO algorithm was compared with six state-of-the-art and recent wrapper-based optimization algorithms using the validation metrics of classification accuracy, optimum feature size, and computational cost in seconds. The simulation results on the benchmark medical datasets revealed that the proposed SCSO-KNN approach has outperformed comparative algorithms with an average classification accuracy of 93.96% by selecting 14.2 features within 1.91 s. Additionally, the Wilcoxon rank test was used to perform the significance analysis between the proposed SCSOKNN method and six other algorithms for a p-value less than 5.00E-02. The findings revealed that the proposed algorithm produces better outcomes with an average p-value of 1.82E-02. Moreover, potential future directions are also suggested as a result of the study's promising findings.
引用
收藏
页码:5799 / 5820
页数:22
相关论文
共 78 条
[1]   Tiki-taka algorithm: a novel metaheuristic inspired by football playing style [J].
Ab Rashid, Mohd Fadzil Faisae .
ENGINEERING COMPUTATIONS, 2021, 38 (01) :313-343
[2]   Boosting COVID-19 Image Classification Using MobileNetV3 and Aquila Optimizer Algorithm [J].
Abd Elaziz, Mohamed ;
Dahou, Abdelghani ;
Alsaleh, Naser A. ;
Elsheikh, Ammar H. ;
Saba, Amal I. ;
Ahmadein, Mahmoud .
ENTROPY, 2021, 23 (11)
[3]   Fitness Dependent Optimizer: Inspired by the Bee Swarming Reproductive Process [J].
Abdullah, Jaza Mahmood ;
Rashid, Tarik Ahmed .
IEEE ACCESS, 2019, 7 :43473-43486
[4]   Aquila Optimizer: A novel meta-heuristic optimization algorithm [J].
Abualigah, Laith ;
Yousri, Dalia ;
Abd Elaziz, Mohamed ;
Ewees, Ahmed A. ;
Al-qaness, Mohammed A. A. ;
Gandomi, Amir H. .
COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 157 (157)
[5]   Brain epilepsy seizure detection using bio-inspired krill herd and artificial alga optimized neural network approaches [J].
Abugabah, Ahed ;
AlZubi, Ahmad Ali ;
Al-Maitah, Mohammed ;
Alarifi, Abdulaziz .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (03) :3317-3328
[6]   Chaotic gaining sharing knowledge-based optimization algorithm: an improved metaheuristic algorithm for feature selection [J].
Agrawal, Prachi ;
Ganesh, Talari ;
Mohamed, Ali Wagdy .
SOFT COMPUTING, 2021, 25 (14) :9505-9528
[7]   S-shaped and V-shaped gaining-sharing knowledge-based algorithm for feature selection [J].
Agrawal, Prachi ;
Ganesh, Talari ;
Oliva, Diego ;
Mohamed, Ali Wagdy .
APPLIED INTELLIGENCE, 2022, 52 (01) :81-112
[8]   Metaheuristic Algorithms on Feature Selection: A Survey of One Decade of Research (2009-2019) [J].
Agrawal, Prachi ;
Abutarboush, Hattan F. ;
Ganesh, Talari ;
Mohamed, Ali Wagdy .
IEEE ACCESS, 2021, 9 :26766-26791
[9]   A novel binary gaining-sharing knowledge-based optimization algorithm for feature selection [J].
Agrawal, Prachi ;
Ganesh, Talari ;
Mohamed, Ali Wagdy .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (11) :5989-6008
[10]   A novel metaheuristic optimization algorithm: the monarchy metaheuristic [J].
Ahmia, Ibtissam ;
Aider, Meziane .
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (01) :362-376