A Bayesian mixture model for differential gene expression

被引:108
作者
Do, KA [1 ]
Müller, P [1 ]
Tang, F [1 ]
机构
[1] Univ Texas, MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
关键词
density estimation; Dirichlet process; gene expression; microarrays; mixture models; nonparametric Bayes method;
D O I
10.1111/j.1467-9876.2005.05593.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose model-based inference for differential gene expression, using a nonparametric Bayesian probability model for the distribution of gene intensities under various conditions. The probability model is a mixture of normal distributions. The resulting inference is similar to a popular empirical Bayes approach that is used for the same inference problem. The use of fully model-based inference mitigates some of the necessary limitations of the empirical Bayes method. We argue that inference is no more difficult than posterior simulation in traditional nonparametric mixture-of-normal models. The approach proposed is motivated by a microarray experiment that was carried out to identify genes that are differentially expressed between normal tissue and colon cancer tissue samples. Additionally, we carried out a small simulation study to verify the methods proposed. In the motivating case-studies we show how the nonparametric Bayes approach facilitates the evaluation of posterior expected false discovery rates. We also show how inference can proceed even in the absence of a null sample of known non-differentially expressed scores. This highlights the difference from alternative empirical Bayes approaches that are based on plug-in estimates.
引用
收藏
页码:627 / 644
页数:18
相关论文
共 50 条
  • [31] A semi-parametric Bayesian model for unsupervised differential co-expression analysis
    Freudenberg, Johannes M.
    Sivaganesan, Siva
    Wagner, Michael
    Medvedovic, Mario
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [32] Bayesian mixture model averaging for identifying the different gene expressions of chickpea (Cicer arietinum) plant tissue
    Astuti, Ani Budi
    Iriawan, Nur
    Irhamah
    Kuswanto, Heri
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (21) : 10564 - 10581
  • [33] Bayesian feature and model selection for Gaussian mixture models
    Constantinopoulos, C
    Titsias, MK
    Likas, A
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (06) : 1013 - U1
  • [34] Semi-parametric differential expression analysis via partial mixture estimation
    Rossell, David
    Guerra, Rudy
    Scott, Clayton
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2008, 7 (01)
  • [35] Differential gene expression in patients with amyotrophic lateral sclerosis
    Shtilbans, Alexander
    Choi, Soon-Gang
    Fowkes, Mary E.
    Khitrov, Greg
    Shahbazi, Mona
    Ting, Jess
    Zhang, Weijia
    Sun, Yezhou
    Sealfon, Stuart C.
    Lange, Dale J.
    [J]. AMYOTROPHIC LATERAL SCLEROSIS, 2011, 12 (04): : 250 - 256
  • [36] Differential gene expression in leiomyosarcoma
    Skubitz, KM
    Skubitz, APN
    [J]. CANCER, 2003, 98 (05) : 1029 - 1038
  • [37] A recursively partitioned mixture model for clustering time-course gene expression data
    Koestler, Devin C.
    Marsit, Carmen J.
    Christensen, Brock C.
    Kelsey, Karl T.
    Houseman, E. Andres
    [J]. TRANSLATIONAL CANCER RESEARCH, 2014, 3 (03) : 217 - +
  • [38] Bayesian Fourier clustering of gene expression data
    Kim, Jaehee
    Kyung, Minjung
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (08) : 6475 - 6494
  • [39] A Bayesian multivariate mixture model for high throughput spatial transcriptomics
    Allen, Carter
    Chang, Yuzhou
    Neelon, Brian
    Chang, Won
    Kim, Hang J.
    Li, Zihai
    Ma, Qin
    Chung, Dongjun
    [J]. BIOMETRICS, 2023, 79 (03) : 1775 - 1787
  • [40] A Bayesian semiparametric accelerate failure time mixture cure model
    Wang, Yijun
    Wang, Weiwei
    Tang, Yincai
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2022, 18 (02) : 473 - 485