Effects of pooling mRNA in microarray class comparisons

被引:46
作者
Shih, JH [1 ]
Michalowska, AM
Dobbin, K
Ye, YM
Qiu, TH
Green, JE
机构
[1] NCI, Biometr Res Branch, Div Canc Treatment & Diag, Bethesda, MD 20892 USA
[2] NCI, Lab Cell Regulat & Carcinogenesis, Bethesda, MD 20892 USA
关键词
D O I
10.1093/bioinformatics/bth391
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In microarray experiments investigators sometimes wish to pool RNA samples before labeling and hybridization due to insufficient RNA from each individual sample or to reduce the number of arrays for the purpose of saving cost. The basic assumption of pooling is that the expression of an mRNA molecule in the pool is close to the average expression from individual samples. Recently, a method for studying the effect of pooling mRNA on statistical power in detecting differentially expressed genes between classes has been proposed, but the different sources of variation arising in microarray experiments were not distinguished. Another paper recently did take different sources of variation into account, but did not address power and sample size for class comparison. In this paper, we study the implication of pooling in detecting differential gene expression taking into account different sources of variation and check the basic assumption of pooling using data from both the cDNA and Affymetrix GeneChip microarray experiments. Results: We present formulas for the required number of subjects and arrays to achieve a desired power at a specified significance level. We show that due to the loss of degrees of freedom for a pooled design, a large increase in the number of subjects may be required to achieve a power comparable to that of a non-pooled design. The added expense of additional samples for the pooled design may outweigh the benefit of saving on microarray cost. The microarray data from both platforms show that the major assumption of pooling may not hold.
引用
收藏
页码:3318 / 3325
页数:8
相关论文
共 10 条
[1]   Initiating oncogenic event determines gene-expression patterns of human breast cancer models [J].
Desai, KV ;
Xiao, N ;
Wang, W ;
Gangi, L ;
Greene, J ;
Powell, JI ;
Dickson, R ;
Furth, P ;
Hunter, K ;
Kucherlapati, R ;
Simon, R ;
Liu, ET ;
Green, JE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6967-6972
[2]   Questions and answers on design of dual-label microarrays for identifying differentially expressed genes [J].
Dobbin, K ;
Shih, JH ;
Simon, R .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2003, 95 (18) :1362-1369
[3]   Statistical design of reverse dye microarrays [J].
Dobbin, K ;
Shih, JH ;
Simon, R .
BIOINFORMATICS, 2003, 19 (07) :803-810
[4]   Comparison of microarray designs for class comparison and class discovery [J].
Dobbin, K ;
Simon, R .
BIOINFORMATICS, 2002, 18 (11) :1438-1445
[5]   POWER AND SAMPLE-SIZE CALCULATIONS - A REVIEW AND COMPUTER-PROGRAM [J].
DUPONT, WD ;
PLUMMER, WD .
CONTROLLED CLINICAL TRIALS, 1990, 11 (02) :116-128
[6]   Summaries of affymetrix GeneChip probe level data [J].
Irizarry, RA ;
Bolstad, BM ;
Collin, F ;
Cope, LM ;
Hobbs, B ;
Speed, TP .
NUCLEIC ACIDS RESEARCH, 2003, 31 (04) :e15
[7]   The efficiency of pooling mRNA in microarray experiments [J].
Kendziorski, CM ;
Zhang, Y ;
Lan, H ;
Attie, AD .
BIOSTATISTICS, 2003, 4 (03) :465-477
[8]   Statistical implications of pooling RNA samples for microarray experiments [J].
Peng, XJ ;
Wood, CL ;
Blalock, EM ;
Chen, KC ;
Landfield, PW ;
Stromberg, AJ .
BMC BIOINFORMATICS, 2003, 4 (1)
[9]  
WU Z, 2004, IN PRESS J AM STAT A
[10]   Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation [J].
Yang, YH ;
Dudoit, S ;
Luu, P ;
Lin, DM ;
Peng, V ;
Ngai, J ;
Speed, TP .
NUCLEIC ACIDS RESEARCH, 2002, 30 (04) :e15