Multiple hypothesis testing in microarray experiments

被引:654
作者
Dudoit, S [1 ]
Shaffer, JP
Boldrick, JC
机构
[1] Univ Calif Berkeley, Sch Publ Hlth, Div Biostat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[3] Stanford Univ, Dept Microbiol & Immunol, Stanford, CA 94305 USA
关键词
multiple hypothesis testing; adjusted p-value; family-wise Type I error rate; false discovery rate; permutation; DNA microarray;
D O I
10.1214/ss/1056397487
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
DNA microarrays are part of a new and promising class of biotechnologies that allow the monitoring of expression levels in cells for thousands of genes simultaneously. An important and common question in DNA microarray experiments is the identification of differentially expressed genes, that is, genes whose expression levels are associated with a response or covariate of interest. The biological question of differential expression can be restated as a problem in multiple hypothesis testing: the simultaneous test for each gene of the null hypothesis of no association between the expression levels and the responses or covariates. As a typical microarray experiment measures expression levels for thousands of genes simultaneously, large multiplicity problems are generated. This article discusses different approaches to multiple hypothesis testing in the context of DNA microarray experiments and compares the procedures on microarray and simulated data sets.
引用
收藏
页码:71 / 103
页数:33
相关论文
共 68 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [3] [Anonymous], 1993, Resampling-based multiple testing: Examples and methods for P-value adjustment
  • [4] [Anonymous], BIOSTATISTICAL METHO
  • [5] Benjamini Y, 2001, ANN STAT, V29, P1165
  • [6] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [7] BALANCED SIMULTANEOUS CONFIDENCE SETS
    BERAN, R
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (403) : 679 - 686
  • [8] Stereotyped and specific gene expression programs in human innate immune responses to bacteria
    Boldrick, JC
    Alizadeh, AA
    Diehn, M
    Dudoit, S
    Liu, CL
    Belcher, CE
    Botstein, D
    Staudt, LM
    Brown, PO
    Relman, DA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) : 972 - 977
  • [9] SPLITTING TAILS UNEQUALLY - NEW PERSPECTIVE ON ONE-TAILED VERSUS 2-TAILED TESTS
    BRAVER, SL
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1975, 35 (02) : 283 - 301
  • [10] Exploring the new world of the genome with DNA microarrays
    Brown, PO
    Botstein, D
    [J]. NATURE GENETICS, 1999, 21 (Suppl 1) : 33 - 37