Enhanced NSGA-II-based feature selection method for high-dimensional classification

被引:16
作者
Li, Min [1 ,2 ]
Ma, Huan [1 ]
Lv, Siyu [1 ]
Wang, Lei [1 ]
Deng, Shaobo [1 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Peoples R China
[2] Nanchang Inst Technol, 289 Tianxiang Ave, Nanchang 330099, Peoples R China
基金
中国国家自然科学基金;
关键词
NSGA-II; Feature selection; Multi -objective optimization; High -dimensional data; Classification; MULTIOBJECTIVE FEATURE-SELECTION; PARTICLE SWARM OPTIMIZATION; GENETIC ALGORITHM;
D O I
10.1016/j.ins.2024.120269
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection in high-dimensional data faces significant challenges owing to large and discrete decision spaces. In this study, we propose a feature selection method based on the nondominated sorting genetic algorithm-II (NSGA-II) to enhance the performance of feature selection in highdimensional data. This study makes four contributions: 1) The sparse initialization strategy is used to sparsen the search space and accelerate the convergence speed of the algorithm; 2) the guided selection operator is employed to strike a balance between exploration and exploitation abilities; 3) an intra-population evolution-based mutation operator dynamically shrinks the search space; and 4) a greedy repair strategy is adopted to generate improved feature subsets. The proposed method was validated on 15 publicly available high-dimensional datasets and compared with eight competitive multi-objective feature selection methods. The results demonstrate that the proposed method can achieve superior classification accuracy in a shorter time, with a smaller subset of features containing less redundancy.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Alelyani S, 2014, CH CRC DATA MIN KNOW, P29
  • [2] An approach to feature selection for keystroke dynamics systems based on PSO and feature weighting
    Azevedo, Gabriel L. F. B. G.
    Cavalcanti, George D. C.
    Carvalho Filho, E. C. B.
    [J]. 2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 3577 - 3584
  • [3] Novel chaotic oppositional fruit fly optimization algorithm for feature selection applied on COVID 19 patients' health prediction
    Bacanin, Nebojsa
    Budimirovic, Nebojsa
    Venkatachalam, K.
    Strumberger, Ivana
    Alrasheedi, Adel Fahad
    Abouhawwash, Mohamed
    [J]. PLOS ONE, 2022, 17 (10):
  • [4] Bidgoli AA, 2019, IEEE C EVOL COMPUTAT, P1588, DOI [10.1109/CEC.2019.8790287, 10.1109/cec.2019.8790287]
  • [5] A hybrid feature selection approach for Microarray datasets using graph theoretic-based method
    Chamlal, Hasna
    Ouaderhman, Tayeb
    Rebbah, Fatima Ezzahra
    [J]. INFORMATION SCIENCES, 2022, 615 : 449 - 474
  • [6] A survey on feature selection methods
    Chandrashekar, Girish
    Sahin, Ferat
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) : 16 - 28
  • [7] Evolutionary Multitasking for Feature Selection in High-Dimensional Classification via Particle Swarm Optimization
    Chen, Ke
    Xue, Bing
    Zhang, Mengjie
    Zhou, Fengyu
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (03) : 446 - 460
  • [8] Evolutionary multi-objective optimization: A historical view of the field
    Coello Coello, Carlos A.
    [J]. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2006, 1 (01) : 28 - 36
  • [9] Improving the ranking quality of medical image retrieval using a genetic feature selection method
    da Silva, Sergio Francisco
    Ribeiro, Marcela Xavier
    Batista Neto, Joao do E. S.
    Traina-, Caetano, Jr.
    Traina, Agma J. M.
    [J]. DECISION SUPPORT SYSTEMS, 2011, 51 (04) : 810 - 820
  • [10] Normal-boundary intersection: A new method for generating the Pareto surface in nonlinear multicriteria optimization problems
    Das, I
    Dennis, JE
    [J]. SIAM JOURNAL ON OPTIMIZATION, 1998, 8 (03) : 631 - 657