A novel hybrid approach for feature selection enhancement: COVID-19 case study

被引:2
作者
Limam, Hela [1 ,2 ]
Hasni, Oumaima [2 ]
Ben Alaya, Ines [3 ]
机构
[1] Univ Tunis El Manar, Inst Super Informat, Tunis, Tunisia
[2] Inst Super Gest Tunis, Lab BestMod, Tunis, Tunisia
[3] Tunis El Manar Univ, Higher Inst Med Technol Tunis, Lab Biophys & Med Technol, Tunis, Tunisia
关键词
Hybrid feature selection; mutual information; backward feature elimination; COVID-19; MUTUAL INFORMATION; ALGORITHM; FILTER;
D O I
10.1080/10255842.2022.2112185
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Feature selection is a promising Artificial Intelligence technique for screening, analysing, predicting, and tracking current COVID-19 patients and likely future patients. Significant applications are developed to track data of confirmed, recovered, and death cases. In this work, we propose a new feature selection method based on a new way of hybridization between filter and wrapper methods. The proposed approach is expected to achieve high classification accuracy with a small feature subset. Specifically, the main contribution of this work is a four steps-based approach organized as follows: First, we remove consecutively duplicate and constant features. Then, we select the highest-ranked feature with Mutual Information. In the last step, we run the 'Backward Feature Elimination' algorithm to delete features from the active subset until a stopping criterion based on the degradation of classification performance is met. We applied the proposed approach to a COVID-19 dataset to test its ability to find the relevant feature for characterizing the disease, such as new cases infected with the virus, people vaccinated, and the number of deaths, to better assess the situation. For evaluation purposes, experiments are conducted at the first stage on the COVID-19 dataset, then on six benchmark datasets that have a high dimensional and large size. The method performance is tracked and measured on these datasets and a comparison with many approaches is provided.
引用
收藏
页码:1183 / 1197
页数:15
相关论文
共 42 条
[1]   COVID-19 image classification using deep learning: Advances, challenges and opportunities [J].
Aggarwal, Priya ;
Mishra, Narendra Kumar ;
Fatimah, Binish ;
Singh, Pushpendra ;
Gupta, Anubha ;
Joshi, Shiv Dutt .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 144
[2]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[3]  
Ali Rasha H., 2021, Journal of Physics: Conference Series, V1879, DOI 10.1088/1742-6596/1879/2/022083
[4]  
[Anonymous], 2000, ESIT
[5]   Two hybrid wrapper-filter feature selection algorithms applied to high-dimensional microarray experiments [J].
Apolloni, Javier ;
Leguizamon, Guillermo ;
Alba, Enrique .
APPLIED SOFT COMPUTING, 2016, 38 :922-932
[6]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[7]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[8]   Hybrid of binary gravitational search algorithm and mutual information for feature selection in intrusion detection systems [J].
Bostani, Hamid ;
Sheikhan, Mansour .
SOFT COMPUTING, 2017, 21 (09) :2307-2324
[9]  
Caruana R., 2003, Journal of Machine Learning Research, V3, P1245, DOI 10.1162/153244303322753652
[10]   Estimating optimal feature subsets using efficient estimation of high-dimensional mutual information [J].
Chow, TWS ;
Huang, D .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (01) :213-224