FCM-SVM-RFE gene feature selection algorithm for leukemia classification from microarray gene expression data

被引:0
|
作者
Tang, YC [1 ]
Zhang, YQ [1 ]
Huang, Z [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
关键词
microarray gene expression data analysis; gene selection; support vector machines; recursive feature elimination; fuzzy C-means clustering; ACUTE MYELOID-LEUKEMIA; CANCER; TUMOR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Selecting the most possibly cancer-related genes from huge microarray gene expression data is an important bioinformatics research topic due to its significance to improve human's understandability of the inherent cancer-resulting mechanism. This is actually a feature selection problem. The huge number of genes makes it impossible to execute an exhaustive search. In this work, we propose a Recursive Feature Elimination (RFE) algorithm named FCM-SVM-RFE for the gene selection task. In each step, similar genes are grouped into clusters by the Fuzzy C-Means clustering algorithm, and then a Support Vector Machine (SVM) is modeled in each cluster-induced space, the genes which contribute large to the margin width of the SVM are selected to survive to the next step. This process is repeated until a pre-specified number of genes are selected. FCM-SVM-RFE is compared with SVM-RFE on AML/ALL microarray gene expression data. The experimental results show that FCM-SVM-RFE is more accurate than SVM-RFE to predict the unknown samples. More importantly, FCM-SVM-RFE can find some compact subsets of genes on each of which a SVM with perfect prediction accuracy can be modeled. These "most informative genes" are very helpful for biologists to efficiently and effectively find the inherent cancer-resulting mechanism.
引用
收藏
页码:97 / 101
页数:5
相关论文
共 50 条
  • [41] Distributed feature selection (DFS) strategy for microarray gene expression data to improve the classification performance
    Potharaju, Sai Prasad
    Sreedevi, M.
    CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2019, 7 (02): : 171 - 176
  • [42] Microarray Gene Feature Classification based on LS-SVM
    Gao, Zhenbin
    INTERNATIONAL CONFERENCE ON FRONTIERS OF BIOLOGICAL SCIENCES AND ENGINEERING (FBSE 2018), 2019, 2058
  • [43] Incremental forward feature selection with application to microarray gene expression data
    Lee, Yuh-Jye
    Chang, Chien-Chung
    Chao, Chia-Huang
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2008, 18 (05) : 827 - 840
  • [44] Optimized gene selection and classification of cancer from microarray gene expression data using deep learning
    Shah, Shamveel Hussain
    Iqbal, Muhammad Javed
    Ahmad, Iftikhar
    Khan, Suleman
    Rodrigues, Joel J. P. C.
    NEURAL COMPUTING & APPLICATIONS, 2020,
  • [45] Feature selection and gene clustering from gene expression data
    Mitra, P
    Majumder, DD
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 343 - 346
  • [46] Gene expression data classification using genetic algorithm-based feature selection
    Sonmez, Oznur Sinem
    Dagtekin, Mustafa
    Ensari, Tolga
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (07) : 3165 - 3179
  • [47] Hybridization of Genetic and Quantum Algorithm for Gene Selection and Classification of Microarray Data
    Abderrahim, Allani
    Talbi, El-Ghazali
    Khaled, Mellouli
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 2226 - +
  • [48] A hybrid LDA and genetic algorithm for gene selection and classification of microarray data
    Bonilla Huerta, Edmundo
    Duval, Beatrice
    Hao, Jin-Kao
    NEUROCOMPUTING, 2010, 73 (13-15) : 2375 - 2383
  • [49] HYBRIDIZATION OF GENETIC AND QUANTUM ALGORITHM FOR GENE SELECTION AND CLASSIFICATION OF MICROARRAY DATA
    Abderrahim, Allani
    Talbi, El-Ghazali
    Khaled, Mellouli
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2012, 23 (02) : 431 - 444
  • [50] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    AUTOMATIKA, 2024, 65 (01) : 152 - 158