A Bayesian mixture model for differential gene expression

被引:107
|
作者
Do, KA [1 ]
Müller, P [1 ]
Tang, F [1 ]
机构
[1] Univ Texas, MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
关键词
density estimation; Dirichlet process; gene expression; microarrays; mixture models; nonparametric Bayes method;
D O I
10.1111/j.1467-9876.2005.05593.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose model-based inference for differential gene expression, using a nonparametric Bayesian probability model for the distribution of gene intensities under various conditions. The probability model is a mixture of normal distributions. The resulting inference is similar to a popular empirical Bayes approach that is used for the same inference problem. The use of fully model-based inference mitigates some of the necessary limitations of the empirical Bayes method. We argue that inference is no more difficult than posterior simulation in traditional nonparametric mixture-of-normal models. The approach proposed is motivated by a microarray experiment that was carried out to identify genes that are differentially expressed between normal tissue and colon cancer tissue samples. Additionally, we carried out a small simulation study to verify the methods proposed. In the motivating case-studies we show how the nonparametric Bayes approach facilitates the evaluation of posterior expected false discovery rates. We also show how inference can proceed even in the absence of a null sample of known non-differentially expressed scores. This highlights the difference from alternative empirical Bayes approaches that are based on plug-in estimates.
引用
收藏
页码:627 / 644
页数:18
相关论文
共 50 条