A binary individual search strategy-based bi-objective evolutionary algorithm for high-dimensional feature selection

被引:36
作者
Li, Tao [1 ,2 ]
Zhan, Zhi-Hui [3 ]
Xu, Jiu-Cheng [1 ]
Yang, Qiang [4 ]
Ma, Yuan-Yuan [1 ,2 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Peoples R China
[2] Key Lab Artificial Intelligence & Personalized Lea, Xinxiang, Henan, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Coll Artificial Intelligence, Nanjing 210000, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-objective optimization; Feature selection; Binary individual; Search strategy; Evolutionary Computation; PARTICLE SWARM OPTIMIZATION; GENETIC ALGORITHM; DIFFERENTIAL EVOLUTION; CLASSIFICATION;
D O I
10.1016/j.ins.2022.07.183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Evolutionary computation is promising in tackling with the feature selection problem, but still has poor performance in obtaining good feature subset in high-dimensional problems. In order to efficiently obtain the optimal feature subset with higher classification accuracy and lower feature dimensions, a binary individual search strategy-based bi-objective evo-lutionary algorithm is proposed. The proposed algorithm has three advantages and contri-butions. Firstly, an improved fisher score is utilized to preprocess the feature space to remove the irrelevant and redundant features. It can decrease the feature dimensionality and compress the search space of feature subset effectively. Secondly, a binary individual search strategy is developed that contains a nearest neighbor binary individual crossover operator and an adaptive binary individual mutation operator, which can search the global optimal feature combination. Thirdly, enhanced population entropy and improved average convergence rate are adopted to monitor the correlation between the diversity of the pop-ulation and the convergence of optimization objectives. Promising experimental results on twelve high-dimensional datasets reveal that the proposed algorithm can obtain compet-itive classification accuracy and effectively reduce the size of feature subset compared with ten state-of-the-art evolutionary algorithms.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:651 / 673
页数:23
相关论文
共 50 条
[21]   Distributed Feature Selection for Multi-Class Classification Using ADMM [J].
Jung, Daniel .
IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (03) :821-826
[22]  
Khan A., 2015, J. appl. res. technol, V13, P145
[23]   MLFS-CCDE: multi-objective large-scale feature selection by cooperative coevolutionary differential evolution [J].
Li, Haoran ;
He, Fazhi ;
Chen, Yilin ;
Pan, Yiteng .
MEMETIC COMPUTING, 2021, 13 (01) :1-18
[24]   Challenges of Feature Selection for Big Data Analytics [J].
Li J. ;
Liu H. .
1600, Institute of Electrical and Electronics Engineers Inc., United States (32) :9-15
[25]   Binary Differential Evolution Based on Individual Entropy for Feature Subset Optimization [J].
Li, Tao ;
Dong, Hongbin ;
Sun, Jing .
IEEE ACCESS, 2019, 7 :24109-24121
[26]   A particle swarm optimization based multiobjective memetic algorithm for high-dimensional feature selection [J].
Luo, Juanjuan ;
Zhou, Dongqing ;
Jiang, Lingling ;
Ma, Huadong .
MEMETIC COMPUTING, 2022, 14 (01) :77-93
[27]   A tribe competition-based genetic algorithm for feature selection in pattern classification [J].
Ma, Benteng ;
Xia, Yong .
APPLIED SOFT COMPUTING, 2017, 58 :328-338
[28]   Variable-Size Cooperative Coevolutionary Particle Swarm Optimization for Feature Selection on High-Dimensional Data [J].
Song, Xian-Fang ;
Zhang, Yong ;
Guo, Yi-Nan ;
Sun, Xiao-Yan ;
Wang, Yong-Li .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (05) :882-895
[29]   Feature Selection Using Fuzzy Neighborhood Entropy-Based Uncertainty Measures for Fuzzy Neighborhood Multigranulation Rough Sets [J].
Sun, Lin ;
Wang, Lanying ;
Ding, Weiping ;
Qian, Yuhua ;
Xu, Jiucheng .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (01) :19-33
[30]   A novel wrapper-based feature subset selection method using modified binary differential evolution algorithm [J].
Tarkhaneh, Omid ;
Thanh Thi Nguyen ;
Mazaheri, Samaneh .
INFORMATION SCIENCES, 2021, 565 :278-305