Bayesian shrinkage models for integration and analysis of multiplatform high-dimensional genomics data

被引:1
|
作者
Xue, Hao [1 ]
Chakraborty, Sounak [2 ,4 ]
Dey, Tanujit [3 ]
机构
[1] Cornell Univ, Dept Computat Biol, Ithaca, NY USA
[2] Univ Missouri, Dept Stat, Columbia, MO USA
[3] Harvard Med Sch, Brigham & Womens Hosp, Ctr Surg & Publ Hlth, Dept Surg, Boston, MA USA
[4] Univ Missouri, Dept Stat, C209F Middlebush Hall, Columbia, MO 65211 USA
关键词
data integration; Expectation Maximization; glioblastoma; hierarchical Bayesian model; multiomics; VARIABLE SELECTION; DNA METHYLATION; PENALIZED LIKELIHOOD; GLIOBLASTOMA; EXPRESSION; INTERLEUKIN-8;
D O I
10.1002/sam.11682
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing availability of biomedical data from multiple platforms of the same patients in clinical research, such as epigenomics, gene expression, and clinical features, there is a growing need for statistical methods that can jointly analyze data from different platforms to provide complementary information for clinical studies. In this paper, we propose a two-stage hierarchical Bayesian model that integrates high-dimensional biomedical data from diverse platforms to select biomarkers associated with clinical outcomes of interest. In the first stage, we use Expectation Maximization-based approach to learn the regulating mechanism between epigenomics (e.g., gene methylation) and gene expression while considering functional gene annotations. In the second stage, we group genes based on the regulating mechanism learned in the first stage. Then, we apply a group-wise penalty to select genes significantly associated with clinical outcomes while incorporating clinical features. Simulation studies suggest that our model-based data integration method shows lower false positives in selecting predictive variables compared with existing method. Moreover, real data analysis based on a glioblastoma (GBM) dataset reveals our method's potential to detect genes associated with GBM survival with higher accuracy than the existing method. Moreover, most of the selected biomarkers are crucial in GBM prognosis as confirmed by existing literature.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] High-dimensional Bayesian inference in nonparametric additive models
    Shang, Zuofeng
    Li, Ping
    ELECTRONIC JOURNAL OF STATISTICS, 2014, 8 : 2804 - 2847
  • [32] Homogeneity and Sparsity Analysis for High-Dimensional Panel Data Models
    Wang, Wu
    Zhu, Zhongyi
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2024, 42 (01) : 26 - 35
  • [33] COMPLEXITY ANALYSIS OF BAYESIAN LEARNING OF HIGH-DIMENSIONAL DAG MODELS AND THEIR EQUIVALENCE CLASSES
    Zhou, Quan
    Chang, Hyunwoong
    ANNALS OF STATISTICS, 2023, 51 (03): : 1058 - 1085
  • [34] MetaGate: Interactive analysis of high-dimensional cytometry data with metadata integration
    Ask, Eivind Heggernes
    Tschan-Plessl, Astrid
    Hoel, Hanna Julie
    Kolstad, Arne
    Holte, Harald
    Malmberg, Karl-Johan
    PATTERNS, 2024, 5 (07):
  • [35] ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
    Archer, Kellie J.
    Seffernick, Anna Eames
    Sun, Shuai
    Zhang, Yiran
    STATS, 2022, 5 (02): : 371 - 384
  • [36] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [37] Efficient quadratures for high-dimensional Bayesian data assimilation
    Cheng, Ming
    Wang, Peng
    Tartakovsky, Daniel M.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2024, 506
  • [38] Shrinkage-based Diagonal Discriminant Analysis and Its Applications in High-Dimensional Data
    Pang, Herbert
    Tong, Tiejun
    Zhao, Hongyu
    BIOMETRICS, 2009, 65 (04) : 1021 - 1029
  • [39] Bayesian variable selection for high-dimensional rank data
    Cui, Can
    Singh, Susheela P.
    Staicu, Ana-Maria
    Reich, Brian J.
    ENVIRONMETRICS, 2021, 32 (07)
  • [40] Adaptive Bayesian density regression for high-dimensional data
    Shen, Weining
    Ghosal, Subhashis
    BERNOULLI, 2016, 22 (01) : 396 - 420