Effective dimension reduction methods for tumor classification using gene expression data

被引:122
|
作者
Antoniadis, A [1 ]
Lambert-Lacroix, S [1 ]
Leblanc, F [1 ]
机构
[1] Univ Grenoble 1, Lab IMAG LMC, F-38041 Grenoble 9, France
关键词
D O I
10.1093/bioinformatics/btg062
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One particular application of microarray data, is to uncover the molecular variation among cancers. One feature of microarray studies is the fact that the number n of samples collected is relatively small compared to the number p of genes per sample which are usually in the thousands. In statistical terms this very large number of predictors compared to a small number of samples or observations makes the classification problem difficult. An efficient way to solve this problem is by using dimension reduction statistical techniques in conjunction with nonparametric discriminant procedures. Results: We view the classification problem as a regression problem with few observations and many predictor variables. We use an adaptive dimension reduction method for generalized semi-parametric regression models that allows us to solve the 'curse of dimensionality problem' arising in the context of expression data. The predictive performance of the resulting classification rule is illustrated on two well know data sets in the microarray literature: the leukemia data that is known to contain classes that are easy 'separable' and the colon data set.
引用
收藏
页码:563 / 570
页数:8
相关论文
共 50 条
  • [21] BagBoosting for tumor classification with gene expression data
    Dettling, M
    BIOINFORMATICS, 2004, 20 (18) : 3583 - 3593
  • [22] Boosting for tumor classification with gene expression data
    Dettling, M
    Bühlmann, P
    BIOINFORMATICS, 2003, 19 (09) : 1061 - 1069
  • [23] A survey of methods for classification of gene expression data using evolutionary algorithms
    Wahde, M
    Szallasi, Z
    EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2006, 6 (01) : 101 - 110
  • [24] Comparison of discrimination methods for the classification of tumors using gene expression data
    Dudoit, S
    Fridlyand, J
    Speed, TP
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
  • [25] Dimension Reduction and Classifier-Based Feature Selection for Oversampled Gene Expression Data and Cancer Classification
    Petinrin, Olutomilayo Olayemi
    Saeed, Faisal
    Salim, Naomie
    Toseef, Muhammad
    Liu, Zhe
    Muyide, Ibukun Omotayo
    PROCESSES, 2023, 11 (07)
  • [26] EEG Feature Extraction and Classification Using Data Dimension Reduction
    Park, So-Youn
    Lee, Ju-Jang
    2008 6TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1-3, 2008, : 328 - 331
  • [27] Using fuzzy kernel discriminant analysis for tumor classification with gene expression data
    Zhou, Xiaoyan
    Zheng, Wenming
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2007, 35 (SUPPL. 1): : 173 - 176
  • [28] Deep Learning Based Tumor Type Classification Using Gene Expression Data
    Lyu, Boyu
    Haque, Anamul
    ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 89 - 96
  • [29] A supervised orthogonal discriminant projection for tumor classification using gene expression data
    Zhang, Chuanlei
    Zhang, Shanwen
    COMPUTERS IN BIOLOGY AND MEDICINE, 2013, 43 (05) : 568 - 575
  • [30] Tumor classification by partial least squares using microarray gene expression data
    Nguyen, DV
    Rocke, DM
    BIOINFORMATICS, 2002, 18 (01) : 39 - 50