Nested ensemble selection: An effective hybrid feature selection method

被引:6
作者
Kamalov, Firuz [1 ]
Sulieman, Hana [2 ]
Moussa, Sherif [1 ]
Reyes, Jorge Avante [1 ]
Safaraliev, Murodbek [3 ]
机构
[1] Canadian Univ Dubai, Dept Elect Engn, Dubai, U Arab Emirates
[2] Amer Univ Sharjah, Dept Math & Stat, Sharjah, U Arab Emirates
[3] Ural Fed Univ, Dept Automated Elect Syst, Ekaterinburg, Russia
关键词
Feature selection; Ensemble selection; Random forest; Synthetic data; Machine learning; Filter method; Wrapper method; GENE SELECTION;
D O I
10.1016/j.heliyon.2023.e19686
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features. To address this issue, we propose a highly effective approach, called Nested Ensemble Selection (NES), that is based on a combination of filter and wrapper methods. The proposed feature selection algorithm differs from the existing filter-wrapper hybrid methods in its simplicity and efficiency as well as precision. The new algorithm is able to separate the relevant variables from the irrelevant as well as the redundant and correlated features. Furthermore, we provide a robust heuristic for identifying the optimal number of selected features which remains one of the greatest challenges in feature selection. Numerical experiments on synthetic and real-life data demonstrate the effectiveness of the proposed method. The NES algorithm achieves perfect precision on the synthetic data and near optimal accuracy on the real-life data. The proposed method is compared against several popular algorithms including mRMR, Boruta, genetic, recursive feature elimination, Lasso, and Elastic Net. The results show that NES significantly outperforms the benchmarks algorithms especially on multi-class datasets.
引用
收藏
页数:13
相关论文
共 55 条
[1]   A Review of the Modification Strategies of the Nature Inspired Algorithms for Feature Selection Problem [J].
Abu Khurma, Ruba ;
Aljarah, Ibrahim ;
Sharieh, Ahmad ;
Abd Elaziz, Mohamed ;
Damasevicius, Robertas ;
Krilavicius, Tomas .
MATHEMATICS, 2022, 10 (03)
[2]  
Ado A., 2021, 11 ANN INT C IND ENG, P4664
[3]   A hybrid fuzzy feature selection algorithm for high-dimensional regression problems: An mRMR-based framework [J].
Aghaeipoor, Fatemeh ;
Javidi, Mohammad Masoud .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
[4]   Gene selection for microarray data classification based on Gray Wolf Optimizer enhanced with TRIZ-inspired operators [J].
Alomari, Osama Ahmad ;
Makhadmeh, Sharif Naser ;
Al-Betar, Mohammed Azmi ;
Alyasseri, Zaid Abdi Alkareem ;
Abu Doush, Iyad ;
Abasi, Ammar Kamal ;
Awadallah, Mohammed A. ;
Abu Zitar, Raed .
KNOWLEDGE-BASED SYSTEMS, 2021, 223
[5]   A framework for feature selection through boosting [J].
Alsahaf, Ahmad ;
Petkov, Nicolai ;
Shenoy, Vikram ;
Azzopardi, George .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
[6]   An Effective Filter Method Towards the Performance Improvement of FF-SVM Algorithm [J].
Alshamlan, Hala M. .
IEEE ACCESS, 2021, 9 :140835-140840
[7]   A two-layer feature selection method using Genetic Algorithm and Elastic Net [J].
Amini, Fatemeh ;
Hu, Guiping .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 166
[8]  
[Anonymous], 2017, Ann Data Sci, DOI DOI 10.1007/S40745-017-0116-1
[9]   mRMR-PSO: A Hybrid Feature Selection Technique with a Multiobjective Approach for Sign Language Recognition [J].
BansalnAff, Sandhya Rani ;
Wadhawan, Savita ;
Goel, Rajeev .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) :10365-10380