New Gene Selection Method Using Gene Expression Programing Approach on Microarray Data Sets

被引:6
|
作者
Alanni, Russul [1 ]
Hou, Jingyu [1 ]
Azzawi, Hasseeb [1 ]
Xiang, Yong [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
关键词
Feature selection; Gain ratio (GR); Gene expression programming (GEP); Support vector machine (SVM); PARTICLE SWARM OPTIMIZATION; MOLECULAR CLASSIFICATION; CANCER; PREDICTION; CARCINOMAS;
D O I
10.1007/978-3-319-98693-7_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection in machine learning and data mining facilitates the optimization of accuracy attained from the classifier with smallest number of features. The use of feature selection in microarray data mining is quite promising. However, usually it is hard to identify and select the feature genes from microarray data sets because multi-class categories and high dimensionality features exist in microarray data with a small-sized sample. Therefore, using good selection approaches to eliminate incomprehensibility and optimize prediction accuracy is becoming necessary, because it will help obtain genes that are relevant to sample classification when investigating large number of genes. In his paper, we propose a new feature selection method for microarray data sets. The method consists of the Gain Ratio (GR) and Improved Gene Expression Programming (IGEP) algorithms which are for gene filtering and feature selection respectively. Support Vector Machine (SVM) alongside with leave-one-out cross-validation (LOOCV) method was used to evaluate the proposed method on eight microarray datasets captured in the literature. The experimental results showed the effectiveness of the proposed method in selecting small number of features while generating higher classification accuracies compared with other existing feature selection approaches.
引用
收藏
页码:17 / 31
页数:15
相关论文
共 50 条
  • [41] Gene selection and gene identification in Microarray data analysis
    Chen, J. J.
    Zou, W.
    Chang, C-W
    Morris, S. M.
    ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2008, 49 (07) : 558 - 558
  • [42] A gene selection method for microarray data based on risk genes
    Wong, Tzu-Tsung
    Chen, Ding-Qun
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (11) : 14065 - 14071
  • [43] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [44] A novel aggregate gene selection method for microarray data classification
    Thanh Nguyen
    Khosravi, Abbas
    Creighton, Douglas
    Nahavandi, Saeid
    PATTERN RECOGNITION LETTERS, 2015, 60-61 : 16 - 23
  • [45] Improving feature subset selection using a genetic algorithm for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Zhang, Yanqing
    Bourgeois, Anu G.
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 2514 - 2519
  • [46] Analysis of Microarray Gene Expression Data Using Various Feature Selection and Classification Techniques
    Singh, W. Jai
    Kavitha, R. K.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (11): : 105 - 108
  • [47] Hybrid feature selection using micro genetic algorithm on microarray gene expression data
    Pragadeesh, C.
    Jeyaraj, Rohana
    Siranjeevi, K.
    Abishek, R.
    Jeyakumar, G.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 2241 - 2246
  • [48] Impact of Feature Selection on Support Vector Machine Using Microarray Gene Expression Data
    Wahid, Choudhury Muhammad Mufassil
    Ali, A. B. M. Shawkat
    Tickle, Kevin
    2009 SECOND INTERNATIONAL CONFERENCE ON MACHINE VISION, PROCEEDINGS, ( ICMV 2009), 2009, : 189 - 193
  • [49] Gene selection and ranking with microarray data
    Hero, AO
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 457 - 464
  • [50] A Novel Information Theoretic Approach to Gene Selection for Cancer Classification Using Microarray Data
    Naseem, Imran
    Togneri, Roberto
    Bennamoun, Mohammed
    CURRENT BIOINFORMATICS, 2015, 10 (04) : 431 - 440