Feature library-assisted surrogate model for evolutionary wrapper-based feature selection and classification

被引:5
|
作者
Guo, Hainan [1 ]
Ma, Junnan [2 ]
Wang, Ruiqi [2 ]
Zhou, Yu [2 ]
机构
[1] Shenzhen Univ, Coll Management, Shenzhen 518052, Guangdong, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518052, Guangdong, Peoples R China
关键词
Feature selection; Surrogate model; High-dimensional data; Classification; PARTICLE SWARM OPTIMIZATION; DIFFERENTIAL EVOLUTION; GENETIC ALGORITHM;
D O I
10.1016/j.asoc.2023.110241
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, wrapper-based feature selection (FS) using evolutionary algorithms has been widely studied due to its ability to search for and evaluate subsets of features based on populations. However, these methods often suffer from a high computational cost and a long computation time, mainly due to the process of evaluating the feature subsets according to the classification performance. In order to tackle this problem, this paper presents a feature library-assisted surrogate model (FL-SM), which aims to reduce the computational cost but maintain a good prediction accuracy. Unlike the existing surrogate models used in FS, the proposed method focuses on the feature level instead of the sample level: an FL is built by collecting the scores of all the features during the evolutionary search. Specifically, each solution (subset candidate) is pre-evaluated based on the FL using only simple operations to decide whether or not it deserves to be evaluated by the classifier, improving the efficiency of the FS algorithm. Meanwhile, because not evaluating a certain number of solutions may lead to inaccurate solution selection during the evolutionary search, dynamic individual selection criteria are proposed. In addition, an adaptive FL update operator is proposed to handle the dynamics of the evolved population; it ensures the real-time validity of the FL. Furthermore, we incorporate the proposed FL-SM into some state-of-the-art single-and multi-objective evolutionary FS methods. The experimental results on benchmark datasets show that with good flexibility and extendibility, FL-SM can effectively reduce the computational cost of wrapper-based FS and still obtain high-quality feature subsets. Among the five algorithms tested, the average computation time reduction was 34.87%; at the same time, there was no significant difference in the classification accuracy for 80% of the tests, and our method even improved the classification accuracy for 6% of the tests.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A Novel Wrapper-Based Optimization Algorithm for the Feature Selection and Classification
    Talpur, Noureen
    Abdulkadir, Said Jadid
    Hasan, Mohd Hilmi
    Alhussian, Hitham
    Alwadain, Ayed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 5799 - 5820
  • [2] Wrapper-Based Feature Selection to Classify Flatfoot Disease
    Miguel-Andres, Israel
    Ramos-Frutos, Jorge
    Sharawi, Marwa
    Oliva, Diego
    Reyes-Davila, Elivier
    Casas-Ordaz, Angel
    Perez-Cisneros, Marco
    Zapotecas-Martinez, Saul
    IEEE ACCESS, 2024, 12 : 22433 - 22447
  • [3] A wrapper-based feature selection for improving performance of intrusion detection systems
    Samadi Bonab, Maryam
    Ghaffari, Ali
    Soleimanian Gharehchopogh, Farhad
    Alemi, Payam
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2020, 33 (12)
  • [4] Wrapper-based feature selection: how important is the wrapped classifier?
    Bajer, Drazen
    Dudjak, Mario
    Zoric, Bruno
    PROCEEDINGS OF 2020 INTERNATIONAL CONFERENCE ON SMART SYSTEMS AND TECHNOLOGIES (SST 2020), 2020, : 97 - 105
  • [5] Evolutionary Feature Selection: A Novel Wrapper Feature Selection Architecture Based on Evolutionary Strategies
    Dubey, Aaryan
    Inoue, Alexandre Hoppe
    Fernandes Birmann, Pedro Terra
    da Silva, Sammuel Ramos
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 359 - 366
  • [6] Improving performance for classification with incomplete data using wrapper-based feature selection
    Tran C.T.
    Zhang M.
    Andreae P.
    Xue B.
    Evolutionary Intelligence, 2016, 9 (3) : 81 - 94
  • [7] Performance Evaluation of Wrapper-Based Feature Selection Techniques for Medical Datasets
    Kewat, Anil
    Srivastava, P. N.
    Kumhar, Dharamdas
    ADVANCES IN COMPUTING AND INTELLIGENT SYSTEMS, ICACM 2019, 2020, : 619 - 633
  • [8] Wrapper-based feature selection method for ADMET prediction using evolutionary computing
    Soto, Axel J.
    Cecchini, Rocio L.
    Vazquez, Gustavo E.
    Ponzoni, Ignacio
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2008, 4973 : 188 - 199
  • [9] Wrapper-Based Federated Feature Selection for IoT Environments
    Mahanipour, Afsaneh
    Khamfroush, Hana
    2023 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2023, : 214 - 219
  • [10] Feature Subset Selection for High-Dimensional, Low Sampling Size Data Classification Using Ensemble Feature Selection With a Wrapper-Based Search
    Mandal, Ashis Kumar
    Nadim, MD.
    Saha, Hasi
    Sultana, Tangina
    Hossain, Md. Delowar
    Huh, Eui-Nam
    IEEE ACCESS, 2024, 12 : 62341 - 62357