Sample size determination in microarray experiments for class comparison and prognostic classification

被引:91
作者
Dobbin, K [1 ]
Simon, R [1 ]
机构
[1] NCI, Biometr Res Branch, Bethesda, MD 20892 USA
关键词
experimental design; gene expression; microarrays; sample size;
D O I
10.1093/biostatistics/kxh015
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Determining sample sizes for microarray experiments is important but the complexity of these experiments, and the large amounts of data they produce, can make the sample size issue seem daunting. and tempt researchers to use rules of thumb in place of formal calculations based on the goals of the experiment. Here we present formulae for determining sample sizes to achieve a variety of experimental goals, including class comparison and the development of prognostic markers. Results are derived which describe the impact of pooling, technical replicates and dye-swap arrays on sample size requirements. These results are shown to depend on the relative sizes of different sources of variability. A variety of common types of experimental situations and designs used with single-label and dual-label microarrays are considered. We discuss procedures for controlling the false discovery rate. Our calculations are based on relatively simple yet realistic statistical models for the data, and provide straightforward sample size calculation formulae.
引用
收藏
页码:27 / 38
页数:12
相关论文
共 21 条
[1]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]  
Billingsley P., 1986, PROBABILITY MEASURE
[4]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[5]   A systematic statistical linear modeling approach to oligonucleotide array experiments [J].
Chu, TM ;
Weir, B ;
Wolfinger, R .
MATHEMATICAL BIOSCIENCES, 2002, 176 (01) :35-51
[6]  
Cochran W.G., 1992, Experimental designs
[7]   Questions and answers on design of dual-label microarrays for identifying differentially expressed genes [J].
Dobbin, K ;
Shih, JH ;
Simon, R .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2003, 95 (18) :1362-1369
[8]   Statistical design of reverse dye microarrays [J].
Dobbin, K ;
Shih, JH ;
Simon, R .
BIOINFORMATICS, 2003, 19 (07) :803-810
[9]   Comparison of microarray designs for class comparison and class discovery [J].
Dobbin, K ;
Simon, R .
BIOINFORMATICS, 2002, 18 (11) :1438-1445
[10]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160