A feature-thresholds guided genetic algorithm based on a multi-objective feature scoring method for high-dimensional feature selection

被引:31
作者
Deng, Shaobo [1 ]
Li, Yulong [1 ]
Wang, Junke [1 ]
Cao, Rutun [1 ]
Li, Min [1 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, 289 Tianxiang Rd, Nanchang, Jiangxi, Peoples R China
基金
美国国家科学基金会;
关键词
Gene selection; Feature selection; Genetic algorithm; Feature score; Feature threshold; FEATURE SUBSET-SELECTION; DIFFERENTIAL EVOLUTION; OPTIMIZATION;
D O I
10.1016/j.asoc.2023.110765
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The classical genetic algorithm utilizes random population initialization, an unguided crossover operator, and an unguided mutation operator for feature selection. However, this approach may be too stochastic and result in slow convergence. This paper proposes a hybrid feature selection algorithm named the Feature-Thresholds Guided Genetic Algorithm (FTGGA) to overcome this deficiency. FTGGA first employs ReliefF to filter out redundant features and retains crucial ones. Then, it generates a feature-thresholds set that contains all the feature thresholds. Each feature threshold represents the probability that the corresponding feature will be selected. The feature-thresholds set continuously updates to guide the iteration process of the genetic algorithm, accelerating its convergence. The experimental data demonstrates that FTGGA has a smaller feature subset and better classification accuracy compared to other algorithms.
引用
收藏
页数:14
相关论文
共 40 条
[1]   A new fusion of grey wolf optimizer algorithm with a two-phase mutation for feature selection [J].
Abdel-Basset, Mohamed ;
El-Shahat, Doaa ;
El-henawy, Ibrahim ;
de Albuquerque, Victor Hugo C. ;
Mirjalili, Seyedali .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 139
[2]  
Tran B, 2016, IEEE C EVOL COMPUTAT, P3801, DOI 10.1109/CEC.2016.7744271
[3]   Molecular classification of Crohn's disease and ulcerative colitis patients using transcriptional profiles in peripheral blood mononuclear cells [J].
Burczynski, ME ;
Peterson, RL ;
Twine, NC ;
Zuberek, KA ;
Brodeur, BJ ;
Casciotti, L ;
Maganti, V ;
Reddy, PS ;
Strahs, A ;
Immermann, F ;
Spinelli, W ;
Schwertschlag, U ;
Slager, AM ;
Cotreau, MM ;
Dorner, AJ .
JOURNAL OF MOLECULAR DIAGNOSTICS, 2006, 8 (01) :51-61
[4]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[5]   Prognostic gene expression signatures can be measured in tissues collected in RNAlater preservative [J].
Chowdary, D ;
Lathrop, J ;
Skelton, J ;
Curtin, K ;
Briggs, T ;
Zhang, Y ;
Yu, J ;
Wang, YX ;
Mazumder, A .
JOURNAL OF MOLECULAR DIAGNOSTICS, 2006, 8 (01) :31-39
[6]   Ensemble feature selection using bi-objective genetic algorithm [J].
Das, Asit K. ;
Das, Sunanda ;
Ghosh, Arka .
KNOWLEDGE-BASED SYSTEMS, 2017, 123 :116-127
[7]   Hybrid gene selection approach using XGBoost and multi-objective genetic algorithm for cancer classification [J].
Deng, Xiongshi ;
Li, Min ;
Deng, Shaobo ;
Wang, Lei .
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (03) :663-681
[8]   Time-varying hierarchical chains of salps with random weight networks for feature selection [J].
Faris, Hossam ;
Heidari, Ali Asghar ;
Al-Zoubi, Ala' M. ;
Mafarja, Majdi ;
Aljarah, Ibrahim ;
Eshtay, Mohammed ;
Mirjalili, Seyedali .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
[9]   Feature subset selection in large dimensionality domains [J].
Gheyas, Iffat A. ;
Smith, Leslie S. .
PATTERN RECOGNITION, 2010, 43 (01) :5-13
[10]  
Guyon I., 2003, Journal of Machine Learning Research, V3, P1157, DOI 10.1162/153244303322753616