Serial filter-wrapper feature selection method with elite guided mutation strategy on cancer gene expression data

被引:1
|
作者
Song, Yu-Wei [1 ]
Wang, Jie-Sheng [1 ]
Qi, Yu-Liang [1 ]
Wang, Yu-Cai [1 ]
Song, Hao-Ming [1 ]
Shang-Guan, Yi-Peng [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Elect & Informat Engn, Anshan, Liaoning, Peoples R China
关键词
Feature selection; Cancer gene expression; Equilibrium optimizer; Parallel filter methods; Elite guided mutation strategies; Serial hybrid frameworks; CLASSIFICATION; ALGORITHM;
D O I
10.1007/s10462-024-11029-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, many researchers utilize cancer gene expression data to solve the problem of cancer subtype diagnosis, but cancer gene expression data are often high-dimensional, multi-sample, and multi-classified, so a hybrid serial filter-wrapper feature selection (FS) method based on elite guided mutation strategy for cancer gene expression data is proposed. It is divided into a preliminary screening phase and a combined modeling phase. In the preliminary screening stage, the threshold values of seven filter methods are determined by the leave-one cross-validation method, and the features selected by these seven filter methods are combined to form two subsets by using the thoughts of ''And'' and ''Or'' in the logical operation. The union subset of two subsets is used in the equilibrium optimizer (EO) in the subsequent combination model stage as the reserved subset in the preliminary screening stage. The resulting hybrid framework is connected by a parallel filter method designed in the first stage with an improved EO in the second stage, which is named as SFEMEO. In order to prove the effectiveness and generalization of the proposed SFEMEO, it is compared with other 9 basic algorithms on 10 UCI data sets. It is found that the classification accuracy of the SFEMEO is improved by 0.56% similar to 20.19%, and the optimal fitness is also greatly improved. After comparing SFEMEO with other nine intelligent optimization algorithms on ten cancer gene expression data sets, it can be found that compared with most algorithms, the accuracy rate is improved by 3.73% similar to 18.13%, and the optimal fitness is relatively superior. At the same time, Wilcoxon rank sum test was used to evaluate the results of intelligent optimization algorithms such as SFEMEO, which proved the effectiveness of the proposed hybrid framework and its superiority in solving the FS problem of high-dimensional cancer gene expression data.
引用
收藏
页数:49
相关论文
共 50 条
  • [31] A novel feature selection method for classifying cancer subtype with centroid of gene expression
    Cho, J
    Lee, D
    Park, J
    Jung, J
    Lee, I
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VIII, PROCEEDINGS, 2003, : 7 - 11
  • [32] Dynamic time-varying transfer function for cancer gene expression data feature selection problem
    Song, Hao-Ming
    Wang, Yu-Cai
    Wang, Jie-Sheng
    Song, Yu-Wei
    Li, Shi
    Qi, Yu-Liang
    Hou, Jia-Ning
    JOURNAL OF BIG DATA, 2025, 12 (01)
  • [33] A robust fuzzy rule based integrative feature selection strategy for gene expression data in TCGA
    Fan, Shicai
    Tang, Jianxiong
    Tian, Qi
    Wu, Chunguo
    BMC MEDICAL GENOMICS, 2019, 12 (Suppl 1)
  • [34] A novel bio-inspired hybrid multi-filter wrapper gene selection method with ensemble classifier for microarray data
    Nouri-Moghaddam, Babak
    Ghazanfari, Mehdi
    Fathian, Mohammad
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (16) : 11531 - 11561
  • [35] A combinational feature selection and ensemble neural network method for classification of gene expression data
    Bing Liu
    Qinghua Cui
    Tianzi Jiang
    Songde Ma
    BMC Bioinformatics, 5
  • [36] Feature Selection for Microarray Gene Expression Data Using Simulated Annealing Guided by the Multivariate Joint Entropy
    Fernando Gonzalez-Navarro, Felix
    Belanche-Munoz, Lluis A.
    COMPUTACION Y SISTEMAS, 2014, 18 (02): : 275 - 293
  • [37] A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification
    Almugren, Nada
    Alshamlan, Hala
    IEEE ACCESS, 2019, 7 : 78533 - 78548
  • [38] A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data
    Wang, Hong
    Jing, Xingjian
    Niu, Ben
    KNOWLEDGE-BASED SYSTEMS, 2017, 126 : 8 - 19
  • [39] Effective and Stable Feature Selection Method Based on Filter for Gene Signature Identification in Paired Microarray Data
    Cao, Zhongbo
    Wang, Yan
    Sun, Ying
    Du, Wei
    Liang, Yanchun
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [40] An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data
    Zhang, Ying
    Deng, Qingchun
    Liang, Wenbin
    Zou, Xianchun
    BIOMED RESEARCH INTERNATIONAL, 2018, 2018