Enhanced NSGA-II-based feature selection method for high-dimensional classification

被引:25
作者
Li, Min [1 ,2 ]
Ma, Huan [1 ]
Lv, Siyu [1 ]
Wang, Lei [1 ]
Deng, Shaobo [1 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Peoples R China
[2] Nanchang Inst Technol, 289 Tianxiang Ave, Nanchang 330099, Peoples R China
基金
中国国家自然科学基金;
关键词
NSGA-II; Feature selection; Multi -objective optimization; High -dimensional data; Classification; MULTIOBJECTIVE FEATURE-SELECTION; PARTICLE SWARM OPTIMIZATION; GENETIC ALGORITHM;
D O I
10.1016/j.ins.2024.120269
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection in high-dimensional data faces significant challenges owing to large and discrete decision spaces. In this study, we propose a feature selection method based on the nondominated sorting genetic algorithm-II (NSGA-II) to enhance the performance of feature selection in highdimensional data. This study makes four contributions: 1) The sparse initialization strategy is used to sparsen the search space and accelerate the convergence speed of the algorithm; 2) the guided selection operator is employed to strike a balance between exploration and exploitation abilities; 3) an intra-population evolution-based mutation operator dynamically shrinks the search space; and 4) a greedy repair strategy is adopted to generate improved feature subsets. The proposed method was validated on 15 publicly available high-dimensional datasets and compared with eight competitive multi-objective feature selection methods. The results demonstrate that the proposed method can achieve superior classification accuracy in a shorter time, with a smaller subset of features containing less redundancy.
引用
收藏
页数:29
相关论文
共 50 条
[1]  
Alelyani S, 2014, CH CRC DATA MIN KNOW, P29
[2]   An approach to feature selection for keystroke dynamics systems based on PSO and feature weighting [J].
Azevedo, Gabriel L. F. B. G. ;
Cavalcanti, George D. C. ;
Carvalho Filho, E. C. B. .
2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, :3577-3584
[3]   Novel chaotic oppositional fruit fly optimization algorithm for feature selection applied on COVID 19 patients' health prediction [J].
Bacanin, Nebojsa ;
Budimirovic, Nebojsa ;
Venkatachalam, K. ;
Strumberger, Ivana ;
Alrasheedi, Adel Fahad ;
Abouhawwash, Mohamed .
PLOS ONE, 2022, 17 (10)
[4]  
Bidgoli AA, 2019, IEEE C EVOL COMPUTAT, P1588, DOI [10.1109/cec.2019.8790287, 10.1109/CEC.2019.8790287]
[5]   A hybrid feature selection approach for Microarray datasets using graph theoretic-based method [J].
Chamlal, Hasna ;
Ouaderhman, Tayeb ;
Rebbah, Fatima Ezzahra .
INFORMATION SCIENCES, 2022, 615 :449-474
[6]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28
[7]   Evolutionary Multitasking for Feature Selection in High-Dimensional Classification via Particle Swarm Optimization [J].
Chen, Ke ;
Xue, Bing ;
Zhang, Mengjie ;
Zhou, Fengyu .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (03) :446-460
[8]   Evolutionary multi-objective optimization: A historical view of the field [J].
Coello Coello, Carlos A. .
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2006, 1 (01) :28-36
[9]   Improving the ranking quality of medical image retrieval using a genetic feature selection method [J].
da Silva, Sergio Francisco ;
Ribeiro, Marcela Xavier ;
Batista Neto, Joao do E. S. ;
Traina-, Caetano, Jr. ;
Traina, Agma J. M. .
DECISION SUPPORT SYSTEMS, 2011, 51 (04) :810-820
[10]   Normal-boundary intersection: A new method for generating the Pareto surface in nonlinear multicriteria optimization problems [J].
Das, I ;
Dennis, JE .
SIAM JOURNAL ON OPTIMIZATION, 1998, 8 (03) :631-657