New Gene Selection Method Using Gene Expression Programing Approach on Microarray Data Sets

被引:6
|
作者
Alanni, Russul [1 ]
Hou, Jingyu [1 ]
Azzawi, Hasseeb [1 ]
Xiang, Yong [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
关键词
Feature selection; Gain ratio (GR); Gene expression programming (GEP); Support vector machine (SVM); PARTICLE SWARM OPTIMIZATION; MOLECULAR CLASSIFICATION; CANCER; PREDICTION; CARCINOMAS;
D O I
10.1007/978-3-319-98693-7_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection in machine learning and data mining facilitates the optimization of accuracy attained from the classifier with smallest number of features. The use of feature selection in microarray data mining is quite promising. However, usually it is hard to identify and select the feature genes from microarray data sets because multi-class categories and high dimensionality features exist in microarray data with a small-sized sample. Therefore, using good selection approaches to eliminate incomprehensibility and optimize prediction accuracy is becoming necessary, because it will help obtain genes that are relevant to sample classification when investigating large number of genes. In his paper, we propose a new feature selection method for microarray data sets. The method consists of the Gain Ratio (GR) and Improved Gene Expression Programming (IGEP) algorithms which are for gene filtering and feature selection respectively. Support Vector Machine (SVM) alongside with leave-one-out cross-validation (LOOCV) method was used to evaluate the proposed method on eight microarray datasets captured in the literature. The experimental results showed the effectiveness of the proposed method in selecting small number of features while generating higher classification accuracies compared with other existing feature selection approaches.
引用
收藏
页码:17 / 31
页数:15
相关论文
共 50 条
  • [31] Gene Selection for Microarray Expression Data with Imbalanced Sample Distributions
    Kamal, Abu H. M.
    Zhu, Xingquan
    Narayanan, Ramaswamy
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 3 - +
  • [32] Unsupervised selection of informative genes in microarray gene expression data
    Liaghat, Samaneh
    Mansoori, Eghbal G.
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2016, 3 (04) : 351 - 367
  • [33] Parsimonious Selection of Useful Genes in Microarray Gene Expression Data
    Gonzalez-Navarro, Felix F.
    Belanche-Munoz, Lluis A.
    SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 : 45 - 55
  • [34] Quality of feature selection based on microarray gene expression data
    Maciejewski, Henryk
    COMPUTATIONAL SCIENCE - ICCS 2008, PT 3, 2008, 5103 : 140 - 147
  • [35] A Hybrid Feature Selection Method Using Gene Expression Data
    Chuang, Li-Yeh
    Wu, Kuo-Chuan
    Yang, Cheng-Hong
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, 2009, : 100 - +
  • [36] Hybrid Feature Selection Method using Gene Expression Data
    Chuang, Li-Yeh
    Wu, Kuo-Chuan
    Yang, Cheng-Hong
    2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, : 199 - +
  • [37] A Novel BPSO Approach for Gene Selection and Classification of Microarray Data
    Yang, Cheng-San
    Chuang, Li-Yeh
    li, Jung-Chike
    Yang, Cheng-Hong
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2147 - +
  • [38] A Metaheuristic Approach for Simultaneous Gene Selection and Clustering of Microarray Data
    Deepthi, P. S.
    Thampi, Sabu M.
    INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 2, 2016, 385 : 449 - 461
  • [39] A genetic embedded approach for gene selection and classification of microarray data
    Hernandez, Jose Crispin Hernandez
    Duval, Atrice
    Hao, Jin-Kao
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2007, 4447 : 90 - +
  • [40] Clustering analysis of microarray gene expression data with new clustering ensemble method
    Luo, Fei
    Liu, Juan
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 500 - 504