Effective dimension reduction methods for tumor classification using gene expression data

被引:122
|
作者
Antoniadis, A [1 ]
Lambert-Lacroix, S [1 ]
Leblanc, F [1 ]
机构
[1] Univ Grenoble 1, Lab IMAG LMC, F-38041 Grenoble 9, France
关键词
D O I
10.1093/bioinformatics/btg062
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One particular application of microarray data, is to uncover the molecular variation among cancers. One feature of microarray studies is the fact that the number n of samples collected is relatively small compared to the number p of genes per sample which are usually in the thousands. In statistical terms this very large number of predictors compared to a small number of samples or observations makes the classification problem difficult. An efficient way to solve this problem is by using dimension reduction statistical techniques in conjunction with nonparametric discriminant procedures. Results: We view the classification problem as a regression problem with few observations and many predictor variables. We use an adaptive dimension reduction method for generalized semi-parametric regression models that allows us to solve the 'curse of dimensionality problem' arising in the context of expression data. The predictive performance of the resulting classification rule is illustrated on two well know data sets in the microarray literature: the leukemia data that is known to contain classes that are easy 'separable' and the colon data set.
引用
收藏
页码:563 / 570
页数:8
相关论文
共 50 条
  • [41] Computational methods for gene expression-based tumor classification
    Xiong, MM
    Jin, L
    Li, WJ
    Boerwinkle, E
    BIOTECHNIQUES, 2000, 29 (06) : 1264 - +
  • [42] Computational methods for gene expression based tumor classification.
    Li, W
    Xiong, M
    AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (04) : 78 - 78
  • [43] Dimension reduction strategies for analyzing global gene expression data with a response
    Chiaromonte, F
    Martinelli, J
    MATHEMATICAL BIOSCIENCES, 2002, 176 (01) : 123 - 144
  • [44] An Effective Dimension Reduction Approach to Chinese Document Classification Using Genetic Algorithm
    Guo, Zhishan
    Lu, Li
    Xi, Shijia
    Sun, Fuchun
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 480 - 489
  • [45] Cancer classification using gene expression data
    Lu, Y
    Han, JW
    INFORMATION SYSTEMS, 2003, 28 (04) : 243 - 268
  • [46] Cancer Classification Using Gene Expression Data
    Sonsare, Pravinkumar
    Mujumdar, Aarya
    Joshi, Pranjali
    Morayya, Nipun
    Hablani, Sachal
    Khergade, Vedant
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024, 2024, 945 : 1 - 11
  • [47] Classification of normal and tumor tissues using geometric representation of gene expression microarray data
    Kim, Saejoon
    Shin, Donghyuk
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4617 : 393 - +
  • [48] A Novel Hybrid Dimension Reduction Technique for Undersized High Dimensional Gene Expression Data Sets Using Information Complexity Criterion for Cancer Classification
    Pamukcu, Esra
    Bozdogan, Hamparsum
    Caljk, Sinan
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
  • [49] A Comparative Study of Two Multiple Classification Methods Based on Partial Least Squares Using Tumor Microarray Gene Expression Data
    Jin Zhichao
    Gao Qingbin
    He Jia
    COMPREHENSIVE EVALUATION OF ECONOMY AND SOCIETY WITH STATISTICAL SCIENCE, 2009, : 1212 - 1222
  • [50] Classification of Microarray Gene Expression Data using Associative Classification
    Alagukumar, S.
    Lawrance, R.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,