FCM-SVM-RFE gene feature selection algorithm for leukemia classification from microarray gene expression data

被引:0
|
作者
Tang, YC [1 ]
Zhang, YQ [1 ]
Huang, Z [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
关键词
microarray gene expression data analysis; gene selection; support vector machines; recursive feature elimination; fuzzy C-means clustering; ACUTE MYELOID-LEUKEMIA; CANCER; TUMOR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Selecting the most possibly cancer-related genes from huge microarray gene expression data is an important bioinformatics research topic due to its significance to improve human's understandability of the inherent cancer-resulting mechanism. This is actually a feature selection problem. The huge number of genes makes it impossible to execute an exhaustive search. In this work, we propose a Recursive Feature Elimination (RFE) algorithm named FCM-SVM-RFE for the gene selection task. In each step, similar genes are grouped into clusters by the Fuzzy C-Means clustering algorithm, and then a Support Vector Machine (SVM) is modeled in each cluster-induced space, the genes which contribute large to the margin width of the SVM are selected to survive to the next step. This process is repeated until a pre-specified number of genes are selected. FCM-SVM-RFE is compared with SVM-RFE on AML/ALL microarray gene expression data. The experimental results show that FCM-SVM-RFE is more accurate than SVM-RFE to predict the unknown samples. More importantly, FCM-SVM-RFE can find some compact subsets of genes on each of which a SVM with perfect prediction accuracy can be modeled. These "most informative genes" are very helpful for biologists to efficiently and effectively find the inherent cancer-resulting mechanism.
引用
收藏
页码:97 / 101
页数:5
相关论文
共 50 条
  • [31] An Integrated Feature Selection Algorithm for Cancer Classification using Gene Expression Data
    Ahmed, Saeed
    Kabir, Muhammad
    Ali, Zakir
    Arif, Muhammad
    Ali, Farman
    Yu, Dong-Jun
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2018, 21 (09) : 631 - 645
  • [32] A novel parallel feature rank aggregation algorithm for gene selection applied to microarray data classification
    Longkumer, Imtisenla
    Mazumder, Dilwar Hussain
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2024, 112
  • [33] Cancer Classification from Gene Expression Based Microarray Data Using SVM Ensemble
    Begum, Shemim
    Chakraborty, Debasis
    Sarkar, Ram
    2015 International Conference on Condition Assessment Techniques in Electrical Systems (CATCON), 2015, : 13 - 16
  • [34] Algorithm for Gene Selection from DNA-microarray data for Disease Classification
    Ikumi, Yoshida
    Chakraborty, Goutam
    TENCON 2010: 2010 IEEE REGION 10 CONFERENCE, 2010, : 460 - 465
  • [35] Kernel PCA and SVM-RFE Based Feature Selection for Classification of Dengue Microarray Dataset
    Octaria, Elke Annisa
    Siswantining, Titin
    Bustamam, Alhadi
    Sarwinda, Devvi
    SYMPOSIUM ON BIOMATHEMATICS 2019 (SYMOMATH 2019), 2020, 2264
  • [36] Correlated Based SVM-RFE as Feature Selection for Cancer Classification Using Microarray Databases
    Rustam, Z.
    Maghfirah, N.
    PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2017 (ISCPMS2017), 2018, 2023
  • [37] Feature Selection and Classification in gene expression cancer data
    Pavithra, D.
    Lakshmanan, B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [38] Feature Selection in Breast Cancer Gene Expression Data Using KAO and AOA with SVM Classification
    Yaqoob, Abrar
    Verma, Navneet Kumar
    JOURNAL OF MEDICAL SYSTEMS, 2025, 49 (01)
  • [39] SVM-based local search for gene selection and classification of microarray data
    Hernandez, Jose Crispin Hernandez
    Duval, Beatrice
    Hao, Jin-Kao
    BIOINFORMATICS RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 13 : 499 - 508
  • [40] Feature selection and ranking of key genes for tumor classification: Using microarray gene expression data
    Mukkamala, Srinivas
    Liu, Qingzhong
    Veeraghattam, Rajeev
    Sung, Andrew H.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2006, PROCEEDINGS, 2006, 4029 : 951 - 961