Bayesian meta-analysis models for microarray data: a comparative study

被引:40
作者
Conlon, Erin M. [1 ]
Song, Joon J.
Liu, Anna
机构
[1] Univ Massachusetts, Dept Math & Stat, Amherst, MA 01003 USA
[2] Univ Arkansas, Dept Math, Fayetteville, AR 72701 USA
关键词
D O I
10.1186/1471-2105-8-80
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods. Results: Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets were pre-scaled. Conclusion: The Bayesian meta-analysis model that combines probabilities across studies does not aggregate gene expression measures, thus an inter-study variability parameter is not included in the model. This results in a simpler modeling approach than aggregating expression measures, which accounts for variability across studies. The probability integration model identified more true discovered genes and fewer true omitted genes than combining expression measures, for our data sets.
引用
收藏
页数:21
相关论文
共 63 条
[1]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]   Bayesian hierarchical model for identifying changes in gene expression from microarray experiments [J].
Broët, P ;
Richardson, S ;
Radvanyi, F .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (04) :671-683
[4]   Combining multiple microarray studies and modeling interstudy variation [J].
Choi, Jung Kyoon ;
Yu, Ungsik ;
Kim, Sangsoo ;
Yoo, Ook Joon .
BIOINFORMATICS, 2003, 19 :i84-i90
[5]   Determining and analyzing differentially expressed genes from cDNA microarray experiments with complementary designs [J].
Conlon, EM ;
Eichenberger, P ;
Liu, JS .
JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 90 (01) :1-18
[6]   Bayesian models for pooling microarray studies with multiple sources of replications [J].
Conlon, Erin M. ;
Song, Joon J. ;
Liu, Jun S. .
BMC BIOINFORMATICS, 2006, 7 (1)
[7]   A Bayesian mixture model for differential gene expression [J].
Do, KA ;
Müller, P ;
Tang, F .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 :627-644
[8]  
DOMINICI F, 2000, METAANALYSIS MED HLT, P105
[9]  
Dudoit S, 2002, STAT SINICA, V12, P111
[10]   BAYES METHODS FOR COMBINING THE RESULTS OF CANCER STUDIES IN HUMANS AND OTHER SPECIES [J].
DUMOUCHEL, WH ;
HARRIS, JE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (382) :293-308