A two-sample Bayesian t-test for microarray data

被引:65
作者
Fox, RJ [1 ]
Dimmic, MW
机构
[1] Codexis Inc, Redwood City, CA 94063 USA
[2] Cornell Univ, Dept Biol Stat & Computat Biol, Ithaca, NY 14853 USA
[3] Divergence Inc, St Louis, MO 63141 USA
关键词
D O I
10.1186/1471-2105-7-126
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Determining whether a gene is differentially expressed in two different samples remains an important statistical problem. Prior work in this area has featured the use of t-tests with pooled estimates of the sample variance based on similarly expressed genes. These methods do not display consistent behavior across the entire range of pooling and can be biased when the prior hyperparameters are specified heuristically. Results: A two-sample Bayesian t-test is proposed for use in determining whether a gene is differentially expressed in two different samples. The test method is an extension of earlier work that made use of point estimates for the variance. The method proposed here explicitly calculates in analytic form the marginal distribution for the difference in the mean expression of two samples, obviating the need for point estimates of the variance without recourse to posterior simulation. The prior distribution involves a single hyperparameter that can be calculated in a statistically rigorous manner, making clear the connection between the prior degrees of freedom and prior variance. Conclusion: The test is easy to understand and implement and application to both real and simulated data shows that the method has equal or greater power compared to the previous method and demonstrates consistent Type I error rates. The test is generally applicable outside the microarray field to any situation where prior information about the variance is available and is not limited to cases where estimates of the variance are based on many similar observations.
引用
收藏
页数:11
相关论文
共 31 条
[1]  
[Anonymous], 2002, Probability and Statistics
[2]   Global gene expression profiling in Escherichia coli K12 -: The effects of integration host factor [J].
Arfin, SM ;
Long, AD ;
Ito, ET ;
Tolleri, L ;
Riehle, MM ;
Paegle, ES ;
Hatfield, GW .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (38) :29672-29684
[3]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[4]  
Chen Y, 1997, J Biomed Opt, V2, P364, DOI 10.1117/12.281504
[5]   Statistical tests for differential expression in cDNA microarray experiments [J].
Cui, XQ ;
Churchill, GA .
GENOME BIOLOGY, 2003, 4 (04)
[6]   VarMixt: efficient variance modelling for the differential analysis of replicated gene expression data [J].
Delmar, P ;
Robin, S ;
Daudin, JJ .
BIOINFORMATICS, 2005, 21 (04) :502-508
[7]  
Durbin B P, 2002, Bioinformatics, V18 Suppl 1, pS105
[8]  
FOX RJ, 2006, BAYESIAN 2 SAMPLE T
[9]   Statistical analysis of microarray data: a Bayesian approach [J].
Gottardo, R ;
Pannucci, JA ;
Kuske, CR ;
Brettin, T .
BIOSTATISTICS, 2003, 4 (04) :597-620
[10]   Induced gene expression in human brain after the split from chimpanzee [J].
Gu, JY ;
Gu, X .
TRENDS IN GENETICS, 2003, 19 (02) :63-65