Computational selection of distinct class- and subclass-specific gene expression signatures

被引:29
作者
Bushel, PR
Hamadeh, HK
Bennett, L
Green, J
Ableson, A
Misener, S
Afshari, CA
Paules, RS
机构
[1] NIEHS, Res Triangle Pk, NC 27709 USA
[2] Mol Min Corp, Kingston, ON K7L 2Y4, Canada
关键词
clustering; classification; microarray; ANOVA; LDA; gene expression; pattern recognition; bioinformatics;
D O I
10.1016/S1532-0464(02)00525-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this investigation we used statistical methods to select genes with expression profiles that partition classes and subclasses of biological samples. Gene expression data corresponding to liver samples from rats treated for 24 h with an enzyme inducer (phenobarbital) or a peroxisome proliferator (clofibrate, gemfibrozil or Wyeth 14,643) were subjected to a modified Z-score test to identify gene outliers and a binomial distribution to reduce the probability of detecting genes as differentially expressed by chance. Hierarchical clustering of 238 statistically valid differentially expressed genes partitioned class-specific gene expression signatures into groups that clustered samples exposed to the enzyme inducer or to peroxisome proliferators. Using analysis of variance (ANOVA) and linear discriminant analysis methods we identified single genes as well as coupled gene expression profiles that separated the phenobarbital from the peroxisome proliferator treated samples and discerned the fibrate (gemfibrozil and clofibrate) subclass of peroxisome proliferators. A comparison of genes ranked by ANOVA with genes assessed as significant by mixed linear models analysis [J. Comput. Biol. 8 (2001) 625] or ranked by information gain revealed good congruence with the top 10 genes from each statistical method in the contrast between phenobarbital and peroxisome proliferators expression profiles. We propose building upon a classification regimen comprised of analysis of replicate data, outlier diagnostics and gene selection procedures to utilize cDNA microarray data to categorize subclasses of samples exposed to pharmacologic agents. (C) 2002 Elsevier Science (USA). All rights reserved.
引用
收藏
页码:160 / 170
页数:11
相关论文
共 34 条
  • [1] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [2] [Anonymous], 2001, PATTERN RECOGNITION
  • [3] Bo TH, 2002, GENOME BIOL, V3
  • [4] Toxicogenomics-based discrimination of toxic mechanism in HepG2 human hepatoma cells
    Burczynski, ME
    McMillian, M
    Ciervo, J
    Li, L
    Parker, JB
    Dunn, RT
    Hicken, S
    Farr, S
    Johnson, MD
    [J]. TOXICOLOGICAL SCIENCES, 2000, 58 (02) : 399 - 415
  • [5] MAPS: a microarray project system for gene expression experiment information and data validation
    Bushel, PR
    Hamadeh, H
    Bennett, L
    Sieber, S
    Martin, K
    Nuwaysir, EF
    Johnson, K
    Reynolds, K
    Paules, RS
    Afshari, CA
    [J]. BIOINFORMATICS, 2001, 17 (06) : 564 - 565
  • [6] Microarray expression profiling identifies genes with altered expression in HDL-deficient mice
    Callow, MJ
    Dudoit, S
    Gong, EL
    Speed, TP
    Rubin, EM
    [J]. GENOME RESEARCH, 2000, 10 (12) : 2022 - 2029
  • [7] Casella G., 2021, STAT INFERENCE
  • [8] Chen Y, 1997, J Biomed Opt, V2, P364, DOI 10.1117/12.281504
  • [9] A systematic statistical linear modeling approach to oligonucleotide array experiments
    Chu, TM
    Weir, B
    Wolfinger, R
    [J]. MATHEMATICAL BIOSCIENCES, 2002, 176 (01) : 35 - 51
  • [10] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868