A two-stage gene selection method for biomarker discovery from microarray data for cancer classification

被引:48
|
作者
Shukla, Alok Kumar [1 ]
Singh, Pradeep [1 ]
Vardhan, Manu [1 ]
机构
[1] NIT, Dept Comp Sci & Engn, Raipur 492010, Madhya Pradesh, India
关键词
Accuracy; Bioinformatics; Extreme learning machine; Gene selection; Support vector machine; OPTIMAL ENSEMBLE CLASSIFIERS; PARTICLE SWARM OPTIMIZATION; OUTCOME PREDICTION; NEURAL-NETWORK; ALGORITHM; INFORMATION; SEARCH; DISCRIMINATION; PROBABILITIES; CROSSOVER;
D O I
10.1016/j.chemolab.2018.10.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The microarrays permit experts to monitor the gene profiling for thousands of genes across an array of cellular responses, phenotype, and circumstances. Selecting a tiny subset of discriminate genes (biomarkers) from high dimensional data is one of the most significant tasks in bioinformatics. In this article, we develop a new hybrid framework by combining CMIM and AGA called CMIMAGA that can help to determine the significant biomarkers from the gene expression data. In the proposed approach, CMIM applied as a filter which is easy to understand and filter out most of the meaningless genes, and wrapper method as adaptive genetic algorithm (AGA) is employed to select the highly discriminating genes for distinguishing of instances from the reduced datasets. The AGA method uses the classifiers as a fitness function to select the relevant genes and to classify the tumour and cancer correctly. The performance of proposed approach is evaluated over six widely used microarray datasets using three classifiers, namely Extreme Learning Machine (ELM), Support Vector Machine (SVM), and k-nearest neighbor (k-NN). The experimental results reveal that our approach with ELM achieves the goal of better classification accuracy with a minimum number of genes and outperform to other filter and wrapper approaches.
引用
收藏
页码:47 / 58
页数:12
相关论文
共 50 条
  • [2] An Efficient Two-Stage Gene Selection Method for Microarray Data
    Du, Dajun
    Li, Kang
    Deng, Jing
    INTELLIGENT COMPUTING FOR SUSTAINABLE ENERGY AND ENVIRONMENT, 2013, 355 : 424 - 432
  • [3] Two-stage gene selection for support vector machine classification of microarray data
    Xia, Xiao-Lei
    Li, Kang
    Irwin, George W.
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2009, 8 (02) : 164 - 171
  • [4] A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
    Peng, Yanxiong
    Li, Wenyuan
    Liu, Ying
    CANCER INFORMATICS, 2006, 2 : 301 - 311
  • [5] Two-stage classification methods for microarray data
    Wong, Tzu-Tsung
    Hsu, Ching-Han
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (01) : 375 - 383
  • [6] A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification
    Zakariya Yahya Algamal
    Muhammad Hisyam Lee
    Advances in Data Analysis and Classification, 2019, 13 : 753 - 771
  • [7] A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 753 - 771
  • [8] A Two-Stage Feature Selection Method for Gene Expression Data
    Chuang, Li-Yeh
    Ke, Chao-Hsuan
    Chang, Hsueh-Wei
    Yang, Cheng-Hong
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2009, 13 (02) : 127 - 137
  • [9] Gene selection for cancer classification in microarray data
    Zhang, Lijuan
    Li, Zhoujun
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (05): : 794 - 802
  • [10] Informative gene discovery for cancer classification from microarray expression data
    Ng, M
    Chan, LW
    2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 393 - 398