A novel feature extraction approach based on ensemble feature selection and modified discriminant independent component analysis for microarray data classification

被引:44
作者
Mollaee, Maryam [1 ]
Moattar, Mohammad Hossein [2 ]
机构
[1] Islamic Azad Univ, Mashhad Branch, Young Researchers & Elite Club, Mashhad, Iran
[2] Islamic Azad Univ, Mashhad Branch, Dept Software Engn, Mashhad, Iran
关键词
Discriminant independent; component analysis; Feature selection; Microarray classification; Particle swarm optimization; Bayesian logistic regression; PARTICLE SWARM OPTIMIZATION; GENE SELECTION; CANCER CLASSIFICATION; PREDICTION; ALGORITHM; FILTER; TUMOR; PSO;
D O I
10.1016/j.bbe.2016.05.001
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Microarray data play critical role in cancer classification. However, with respect to the samples scarcity compared to intrinsic high dimensionality, most approaches fail to classify small subset of genes. Feature selection techniques can reduce the dimension of the problem, which can reduce computational cost of the microarray data classification. However, previous studies have shown that feature extraction methods can also be useful in improving the performance of data classification. In this paper, we propose an ensemble schema for cancer diagnosis and classification that has three stages. At first, a hybrid filter based feature selection method using modified Bayesian logistic regression (BLogReg), Ttest and Fisher ratio is applied for selecting genes. In the second stage, selected genes are mapped via the proposed PSO-dICA method which is a modification of dICA. Finally, mapped features are classified using SVM classifier. To demonstrate the effectiveness of the proposed method, some traditional microarray data including Colon, Lung cancer, DLBCL, SRBCT, Leukemia-ALL and Prostate Tumor datasets are used. Experimental results show the efficiency and effectiveness of the proposed method. (C) 2016 Nalecz Institute of Biocybemetics and Biomedical Engineering of the Polish Academy of Sciences. Published by Elsevier Sp. z o.o. All rights reserved.
引用
收藏
页码:521 / 529
页数:9
相关论文
共 56 条
  • [1] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [2] Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification
    Alshamlan, Hala M.
    Badr, Ghada H.
    Alohali, Yousef A.
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 56 : 49 - 60
  • [3] [Anonymous], J ELECT
  • [4] MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia
    Armstrong, SA
    Staunton, JE
    Silverman, LB
    Pieters, R
    de Boer, ML
    Minden, MD
    Sallan, SE
    Lander, ES
    Golub, TR
    Korsmeyer, SJ
    [J]. NATURE GENETICS, 2002, 30 (01) : 41 - 47
  • [5] Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses
    Bhattacharjee, A
    Richards, WG
    Staunton, J
    Li, C
    Monti, S
    Vasa, P
    Ladd, C
    Beheshti, J
    Bueno, R
    Gillette, M
    Loda, M
    Weber, G
    Mark, EJ
    Lander, ES
    Wong, W
    Johnson, BE
    Golub, TR
    Sugarbaker, DJ
    Meyerson, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) : 13790 - 13795
  • [6] An ensemble of filters and classifiers for microarray data classification
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    [J]. PATTERN RECOGNITION, 2012, 45 (01) : 531 - 539
  • [7] Burges JC, 1999, TUTORIAL SUPPORT VEC
  • [8] ITERATIVE FEATURE PERTURBATION AS A GENE SELECTOR FOR MICROARRAY DATA
    Canul-Reich, Juana
    Hall, Lawrence O.
    Goldgof, Dmitry B.
    Korecki, John N.
    Eschrich, Steven
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (05)
  • [9] Gene selection in cancer classification using sparse logistic regression with Bayesian regularization
    Cawley, Gavin C.
    Talbot, Nicola L. C.
    [J]. BIOINFORMATICS, 2006, 22 (19) : 2348 - 2355
  • [10] Applying particle swarm optimization-based decision tree classifier for cancer classification on gene expression data
    Chen, Kun-Huang
    Wang, Kung-Jeng
    Wang, Kung-Min
    Angelia, Melani-Adrian
    [J]. APPLIED SOFT COMPUTING, 2014, 24 : 773 - 780