Effective dimension reduction methods for tumor classification using gene expression data

被引:122
|
作者
Antoniadis, A [1 ]
Lambert-Lacroix, S [1 ]
Leblanc, F [1 ]
机构
[1] Univ Grenoble 1, Lab IMAG LMC, F-38041 Grenoble 9, France
关键词
D O I
10.1093/bioinformatics/btg062
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One particular application of microarray data, is to uncover the molecular variation among cancers. One feature of microarray studies is the fact that the number n of samples collected is relatively small compared to the number p of genes per sample which are usually in the thousands. In statistical terms this very large number of predictors compared to a small number of samples or observations makes the classification problem difficult. An efficient way to solve this problem is by using dimension reduction statistical techniques in conjunction with nonparametric discriminant procedures. Results: We view the classification problem as a regression problem with few observations and many predictor variables. We use an adaptive dimension reduction method for generalized semi-parametric regression models that allows us to solve the 'curse of dimensionality problem' arising in the context of expression data. The predictive performance of the resulting classification rule is illustrated on two well know data sets in the microarray literature: the leukemia data that is known to contain classes that are easy 'separable' and the colon data set.
引用
收藏
页码:563 / 570
页数:8
相关论文
共 50 条
  • [1] Effective dimension reduction using sequential projection pursuit on gene expression data for cancer classification
    Webb-Robertson, BJM
    Havre, SL
    METMBS '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2004, : 345 - 351
  • [2] Dimension reduction for classification with gene expression microarray data
    Dai, Jian J.
    Lieu, Linh
    Rocke, David
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2006, 5
  • [3] A framework for significance analysis of gene expression data using dimension reduction methods
    Lars Gidskehaug
    Endre Anderssen
    Arnar Flatberg
    Bjørn K Alsberg
    BMC Bioinformatics, 8
  • [4] A framework for significance analysis of gene expression data using dimension reduction methods
    Gidskehaug, Lars
    Anderssen, Endre
    Flatberg, Arnar
    Alsberg, Bjorn K.
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [5] Importance of data structure in comparing two dimension reduction methods for classification of microarray gene expression data
    Truntzer, Caroline
    Mercier, Catherine
    Esteve, Jacques
    Gautier, Christian
    Roy, Pascal
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [6] Importance of data structure in comparing two dimension reduction methods for classification of microarray gene expression data
    Caroline Truntzer
    Catherine Mercier
    Jacques Estève
    Christian Gautier
    Pascal Roy
    BMC Bioinformatics, 8
  • [7] Dimension reduction of gene expression data
    Lee J.
    Ciccarello S.
    Acharjee M.
    Das K.
    Journal of Statistical Theory and Practice, 2018, 12 (2) : 450 - 461
  • [8] Tumor classification using phylogenetic methods on expression data
    Desper, R
    Khan, J
    Schäffer, AA
    JOURNAL OF THEORETICAL BIOLOGY, 2004, 228 (04) : 477 - 496
  • [9] Dimension reduction with redundant gene elimination for tumor classification
    Zeng, Xue-Qiang
    Li, Guo-Zheng
    Yang, Jack Y.
    Yang, Mary Qu
    Wu, Geng-Feng
    BMC BIOINFORMATICS, 2008, 9 (Suppl 6)
  • [10] Dimension reduction with redundant gene elimination for tumor classification
    Xue-Qiang Zeng
    Guo-Zheng Li
    Jack Y Yang
    Mary Qu Yang
    Geng-Feng Wu
    BMC Bioinformatics, 9