Power and sample size for DNA microarray studies

被引:120
作者
Lee, MLT
Whitmore, GA
机构
[1] Brigham & Womens Hosp, Dept Med, Boston, MA 02115 USA
[2] Harvard Univ, Sch Med, Boston, MA 02115 USA
[3] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[4] McGill Univ, Montreal, PQ H3A 2T5, Canada
关键词
Bayesian inferences; false discovery rate; family type I error; microarray studies; multiple testing; power and sample size;
D O I
10.1002/sim.1335
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A microarray study aims at having a high probability of declaring genes to be differentially expressed if they are truly expressed, while keeping the probability of making false declarations of expression acceptably low. Thus, in formal terms, well-designed microarray studies will have high power while controlling type I error risk. Achieving this objective is the purpose of this paper. Here, we discuss conceptual issues and present computational methods for statistical power and sample size in microarray studies, taking account of the multiple testing that is generic to these studies. The discussion encompasses choices of experimental design and replication for a study. Practical examples are used to demonstrate the methods. The examples show forcefully that replication of a microarray experiment can yield large increases in statistical power. The paper refers to cDNA arrays in the discussion and illustrations but the proposed methodology is equally applicable to expression data from oligonucleotide arrays. Copyright (C) 2002 John Wiley Sons, Ltd.
引用
收藏
页码:3543 / 3570
页数:28
相关论文
共 11 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]  
DELONGCHAMP RR, 2001, HFT20 NAT CTR TOX RE
[3]  
DUDOIT S, 2000, 578 STANF U SCH MED
[4]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[5]  
Kerr M K, 2001, Biostatistics, V2, P183, DOI 10.1093/biostatistics/2.2.183
[6]   Analysis of variance for gene expression microarray data [J].
Kerr, MK ;
Martin, M ;
Churchill, GA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (06) :819-837
[7]  
Lee Mei-Ling Ting, 2002, J Biopharm Stat, V12, P1, DOI 10.1081/BIP-120005737
[8]   Importance of replication in microarray gene expression studies: Statistical methods and evidence from repetitive cDNA hybridizations [J].
Lee, MLT ;
Kuo, FC ;
Whitmore, GA ;
Sklar, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :9834-9839
[9]  
LEE MLT, 2002, IN PRESS J DATA SCI
[10]  
SCHUCHHARDT J, 2000, NUCLEIC ACIDS RES, V28, pE47, DOI DOI 10.1093/NAR/28.10.E47