A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset

被引:24
作者
Bacanin, Nebojsa [1 ]
Venkatachalam, K. [2 ]
Bezdan, Timea [1 ]
Zivkovic, Miodrag [1 ]
Abouhawwash, Mohamed [3 ,4 ]
机构
[1] Singidunum Univ, Danijelova 32, Belgrade 11000, Serbia
[2] Univ Hradec Kralove, Fac Sci, Dept Appl Cybernet, Hradec Kralove 50003, Czech Republic
[3] Mansoura Univ, Fac Sci, Dept Math, Mansoura 35516, Egypt
[4] Michigan State Univ, Dept Computat Math Sci & Engn CMSE, E Lansing, MI 48824 USA
关键词
Firefly algorithm; Swarm intelligence; Quasi-reflection-based learning; Feature selection; Genetic operators; COVID-19; dataset; PARTICLE SWARM OPTIMIZATION; INTERNET;
D O I
10.1016/j.micpro.2023.104778
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is one of the most important challenges in machine learning and data science. This process is usually performed in the data preprocessing phase, where the data is transformed to a proper format for further operations by machine learning algorithm. Many real-world datasets are highly dimensional with many irrelevant, even redundant features. These kinds of features do not improve classification accuracy and can even shrink down performance of a classifier. The goal of feature selection is to find optimal (or sub-optimal) subset of features that contain relevant information about the dataset from which machine learning algorithms can derive useful conclusions. In this manuscript, a novel version of firefly algorithm (FA) is proposed and adapted for feature selection challenge. Proposed method significantly improves performance of the basic FA, and also outperforms other state-of-the-art metaheuristics for both, benchmark bound-constrained and practical feature selection tasks. Method was first validated on standard unconstrained benchmarks and later it was applied for feature selection by using 21 standard University of California, Irvine (UCL) datasets. Moreover, presented approach was also tested for relatively novel COVID-19 dataset for predicting patients health, and one microcontroller microarray dataset. Results obtained in all practical simulations attest robustness and efficiency of proposed algorithm in terms of convergence, solutions' quality and classification accuracy. More precisely, the proposed approach obtained the best classification accuracy on 13 out of 21 total datasets, significantly outperforming other competitor methods.
引用
收藏
页数:21
相关论文
共 59 条
[1]  
Babatunde OluleyeH., 2014, A genetic algorithm-based feature selection
[2]   Feature Selection in Machine Learning by Hybrid Sine Cosine Metaheuristics [J].
Bacanin, Nebojsa ;
Petrovic, Aleksandar ;
Zivkovic, Miodrag ;
Bezdan, Timea ;
Antonijevic, Milos .
ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 :604-616
[3]   Monarch Butterfly Optimization Based Convolutional Neural Network Design [J].
Bacanin, Nebojsa ;
Bezdan, Timea ;
Tuba, Eva ;
Strumberger, Ivana ;
Tuba, Milan .
MATHEMATICS, 2020, 8 (06)
[4]   Firefly Algorithm for Cardinality Constrained Mean-Variance Portfolio Optimization Problem with Entropy Diversity Constraint [J].
Bacanin, Nebojsa ;
Tuba, Milan .
SCIENTIFIC WORLD JOURNAL, 2014,
[5]   An Overview of Evolutionary Algorithms for Parameter Optimization [J].
Baeck, Thomas ;
Schwefel, Hans-Paul .
EVOLUTIONARY COMPUTATION, 1993, 1 (01) :1-23
[6]   A memetic algorithm using emperor penguin and social engineering optimization for medical data classification [J].
Baliarsingh, Santos Kumar ;
Ding, Weiping ;
Vipsita, Swati ;
Bakshi, Sambit .
APPLIED SOFT COMPUTING, 2019, 85
[7]  
Bezdan T., 2021, 2021 29 TELECOMMUNIC, P1
[8]  
Bezdan T., 2021, P 7 C ENG COMPUTER B, P1, DOI DOI 10.1145/3459960.3459974
[9]   Feature Selection by Hybrid Brain Storm Optimization Algorithm for COVID-19 Classification [J].
BEZDAN, T. I. M. E. A. ;
ZIVKOVIC, M. I. O. D. R. A. G. ;
BACANIN, N. E. B. O. J. S. A. ;
CHHABRA, A. M. I. T. ;
SURESH, M. U. T. H. U. S. A. M. Y. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (06) :515-529
[10]   Comparative Study between the Improved Implementation of 3 Classic Mutation Operators for Genetic Algorithms [J].
Cazacu, Razvan .
10TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING, INTER-ENG 2016, 2017, 181 :634-640