A Bayesian Model for Cross-Study Differential Gene Expression

被引:25
作者
Scharpf, Robert B.
Tjelmeland, Hakon [1 ]
Parmigiani, Giovanni [2 ,3 ]
Nobel, Andrew B. [4 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Math Sci, NO-7491 Trondheim, Norway
[2] Johns Hopkins Univ, Johns Hopkins Bloomberg Sch Publ Hlth, Dept Biostat, Baltimore, MD 21205 USA
[3] Johns Hopkins Univ, Sidney Kimmel Comprehens Canc Ctr, Baltimore, MD 21205 USA
[4] Univ N Carolina, Dept Stat, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
Bayesian hierarchical model; Bayesian meta-analysis; Differential expression; Gene expression; Multiple studies; MICROARRAY DATA; MOLECULAR CLASSIFICATION; MIXTURE MODEL; METAANALYSIS; PROFILES; ADENOCARCINOMA; NORMALIZATION; COMPUTATION; VALIDATION; CARCINOMAS;
D O I
10.1198/jasa.2009.ap07611
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article we define a hierarchical Bayesian model for microarray expression data collected from several studies and use it to identify genes that show differential expression between two conditions. Key features include shrinkage across both gene.; and studies, and flexible modeling that allows for interactions between platforms and the estimated effect, as well as concordant and discordant differential expression across studies. We evaluate the performance of our model in a comprehensive Fashion, using both artificial data, and a "split-study" validation approach that provides an agnostic assessment of the model's behavior under both the null hypothesis and a realistic alternative. The simulation results from the artificial data demonstrate the advantages of the Bayesian model. Furthermore, the simulations provide guidelines for when the Bayesian model is most likely to be useful. Most notably, in small studies the Bayesian model generally outperforms other methods when evaluated based on several performance measures across a range of simulation parameters, with the differences diminishing for larger sample sizes in the individual Studies. The split-study validation illustrates appropriate shrinkage of the Bayesian model in the absence of platform, sample, and annotation differences that otherwise complicate experimental data analyses. Finally, we fit our model to four breast cancer studies using different technologies (cDNA and Affymetrix) to estimate differential expression in estrogen receptor-positive tumors versus estrogen receptor-negative tumors. Software and data for reproducing our analysis are available publicly.
引用
收藏
页码:1295 / 1310
页数:16
相关论文
共 50 条
  • [31] BAYESIAN LATENT HIERARCHICAL MODEL FOR TRANSCRIPTOMIC META-ANALYSIS TO DETECT BIOMARKERS WITH CLUSTERED META-PATTERNS OF DIFFERENTIAL EXPRESSION SIGNALS
    Huo, Zhiguang
    Song, Chi
    Tseng, George
    [J]. ANNALS OF APPLIED STATISTICS, 2019, 13 (01) : 340 - 366
  • [32] Predictability of human differential gene expression
    Crow, Megan
    Lim, Nathaniel
    Ballouz, Sara
    Pavlidis, Paul
    Gillis, Jesse
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (13) : 6491 - 6500
  • [33] A Finite Mixture Model for Gene Expression and Methylation Profiles in a Bayesian Framework
    Jeong, Jaesik
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (04) : 609 - 622
  • [34] Bayesian Identifcation of Differential Gene Expression Induced by Metals in Human Bronchial Epithelial Cells
    House, Leanna L.
    Clyde, Merlise A.
    Huang, Yuh-Chin T.
    [J]. BAYESIAN ANALYSIS, 2006, 1 (01): : 105 - 120
  • [35] Context Specific and Differential Gene Co-expression Networks via Bayesian Biclustering
    Gao, Chuan
    McDowell, Ian C.
    Zhao, Shiwen
    Brown, Christopher D.
    Engelhardt, Barbara E.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (07)
  • [36] Phenotype Harmonization and Cross-Study Collaboration in GWAS Consortia: The GENEVA Experience
    Bennett, Siiri N.
    Caporaso, Neil
    Fitzpatrick, Annette L.
    Agrawal, Arpana
    Barnes, Kathleen
    Boyd, Heather A.
    Cornelis, Marilyn C.
    Hansel, Nadia N.
    Heiss, Gerardo
    Heit, John A.
    Kang, Jae Hee
    Kittner, Steven J.
    Kraft, Peter
    Lowe, William
    Marazita, Mary L.
    Monroe, Kristine R.
    Pasquale, Louis R.
    Ramos, Erin M.
    van Dam, Rob M.
    Udren, Jenna
    Williams, Kayleen
    [J]. GENETIC EPIDEMIOLOGY, 2011, 35 (03) : 159 - 173
  • [37] Modelling and Assessing Differential Gene Expression Using the Alpha Stable Distribution
    Salas-Gonzalez, Diego
    Kuruoglu, Ercan E.
    Ruiz, Diego P.
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2009, 5 (01)
  • [38] Study on Gene Differential Expression in Tetraploid Populus Leaves
    Zhang, Ying
    Ren, Yongyu
    Kang, Xiangyang
    [J]. FORESTS, 2020, 11 (11): : 1 - 17
  • [39] Study with microarrays of the differential gene expression profiles of glioblastoma
    杨志林
    徐如祥
    姜晓丹
    柯以铨
    罗成义
    金莹
    胡庚熙
    [J]. Journal of Medical Colleges of PLA, 2001, (04) : 269 - 273
  • [40] Implementation of quality controls is essential to prevent batch effects in breathomics data and allow for cross-study comparisons
    Stavropoulos, Georgios
    Jonkers, Daisy M. A. E.
    Mujagic, Zlatan
    Koek, Ger H.
    Masclee, Ad A. M.
    Pierik, Marieke J.
    Dallinga, Jan W.
    Van Schooten, Frederik-Jan
    Smolinska, Agnieszka
    [J]. JOURNAL OF BREATH RESEARCH, 2020, 14 (02)