Fully bayesian mixture model for differential gene expression: Simulations and model checks

被引:22
作者
Lewin, Alex [1 ]
Bochkina, Natalia [2 ]
Richardson, Sylvia [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, London SW7 2AZ, England
[2] Univ Edinburgh, Edinburgh EH8 9YL, Midlothian, Scotland
基金
英国惠康基金; 英国生物技术与生命科学研究理事会;
关键词
microarray; mixture model; predictive checks; Bayesian analysis; MCMC;
D O I
10.2202/1544-6115.1314
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a Bayesian hierarchical model for detecting differentially expressed genes using a mixture prior on the parameters representing differential effects. We formulate an easily interpretable 3-component mixture to classify genes as over-expressed, under-expressed and non-differentially expressed, and model gene variances as exchangeable to allow for variability between genes. We show how the proportion of differentially expressed genes, and the mixture parameters, can be estimated in a fully Bayesian way, extending previous approaches where this proportion was fixed and empirically estimated. Good estimates of the false discovery rates are also obtained. Different parametric families for the mixture components can lead to quite different classifications of genes for a given data set. Using Affymetrix data from a knock out and wildtype mice experiment, we show how predictive model checks can be used to guide the choice between possible mixture priors. These checks show that extending the mixture model to allow extra variability around zero instead of the usual point mass null fits the data better. A software package for R is available.
引用
收藏
页数:28
相关论文
共 31 条
[1]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[2]   P values for composite null models [J].
Bayarri, MJ ;
Berger, JO .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (452) :1127-1142
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   A Laplace mixture model for identification of differential expression in microarray experiments [J].
Bhowmick, Debjani ;
Davison, A. C. ;
Goldstein, Darlene R. ;
Ruffieux, Yann .
BIOSTATISTICS, 2006, 7 (04) :630-641
[5]   Degrees of differential gene expression: detecting biologically significant expression differences and estimating their magnitudes [J].
Bickel, DR .
BIOINFORMATICS, 2004, 20 (05) :682-U255
[6]  
BOCHKINA N, 2007, BIOMETRICS ADV ACCES, DOI DOI 10.1111/J.1541-0420.2006.00807.X
[7]   A mixture model-based strategy for selecting sets of genes in multiclass response microarray experiments [J].
Broët, P ;
Lewin, A ;
Richardson, S ;
Dalmasso, C ;
Magdelenat, H .
BIOINFORMATICS, 2004, 20 (16) :2562-2571
[8]   Bayesian hierarchical model for identifying changes in gene expression from microarray experiments [J].
Broët, P ;
Richardson, S ;
Radvanyi, F .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (04) :671-683
[9]  
CRAIG A, 2007, UNPUB PLOS BIOL
[10]  
Cressie NA, 1991, STAT SPATIAL DATA