Resampling-based multiple testing for microarray data analysis

被引:231
作者
Ge, YC
Dudoit, S
Speed, TP
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[3] Walter & Eliza Hall Inst Med Res, Div Genet & Bioinformat, Parkville, Vic, Australia
关键词
multiple testing; family-wise error rate; false discovery rate; adjusted p-value; fast algorithm; minP; microarray;
D O I
10.1007/BF02595811
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The burgeoning field of genomics has revived interest in multiple testing procedures by raising new methodological and computational challenges. For example, microarray experiments generate large multiplicity problems in which thousands of hypotheses are tested simultaneously. Westfall and Young (1993) propose resampling-based p-value adjustment procedures which are highly relevant to microarray experiments. This article discusses different criteria for error control in resampling-based multiple testing, including (a) the family wise error rate of Westfall and Young (1993) and (b) the false discovery rate developed by Benjamini and Hochberg (1995), both from a frequentist viewpoint; and (c) the positive false discovery rate of Storey (2002a), which has a Bayesian motivation. We also introduce our recently developed fast algorithm for implementing the minP adjustment to control family-wise error rate. Adjusted p-values for different approaches are applied to gene expression data from two recently published microarray studies. The properties of these procedures for multiple testing are compared.
引用
收藏
页码:1 / 77
页数:77
相关论文
共 62 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [3] [Anonymous], 1993, Resampling-based multiple testing: Examples and methods for P-value adjustment
  • [4] Benjamini Y, 2001, ANN STAT, V29, P1165
  • [5] Benjamini Y, 2002, ANN STAT, V30, P1576
  • [6] On the adaptive control of the false discovery fate in multiple testing with independent statistics
    Benjamini, Y
    Hochberg, Y
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2000, 25 (01) : 60 - 83
  • [7] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [8] BALANCED SIMULTANEOUS CONFIDENCE SETS
    BERAN, R
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (403) : 679 - 686
  • [9] Stereotyped and specific gene expression programs in human innate immune responses to bacteria
    Boldrick, JC
    Alizadeh, AA
    Diehn, M
    Dudoit, S
    Liu, CL
    Belcher, CE
    Botstein, D
    Staudt, LM
    Brown, PO
    Relman, DA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) : 972 - 977
  • [10] Buckley M.J., 2000, SPOT USERS GUIDE