Bayesian shrinkage models for integration and analysis of multiplatform high-dimensional genomics data

被引:1
|
作者
Xue, Hao [1 ]
Chakraborty, Sounak [2 ,4 ]
Dey, Tanujit [3 ]
机构
[1] Cornell Univ, Dept Computat Biol, Ithaca, NY USA
[2] Univ Missouri, Dept Stat, Columbia, MO USA
[3] Harvard Med Sch, Brigham & Womens Hosp, Ctr Surg & Publ Hlth, Dept Surg, Boston, MA USA
[4] Univ Missouri, Dept Stat, C209F Middlebush Hall, Columbia, MO 65211 USA
关键词
data integration; Expectation Maximization; glioblastoma; hierarchical Bayesian model; multiomics; VARIABLE SELECTION; DNA METHYLATION; PENALIZED LIKELIHOOD; GLIOBLASTOMA; EXPRESSION; INTERLEUKIN-8;
D O I
10.1002/sam.11682
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing availability of biomedical data from multiple platforms of the same patients in clinical research, such as epigenomics, gene expression, and clinical features, there is a growing need for statistical methods that can jointly analyze data from different platforms to provide complementary information for clinical studies. In this paper, we propose a two-stage hierarchical Bayesian model that integrates high-dimensional biomedical data from diverse platforms to select biomarkers associated with clinical outcomes of interest. In the first stage, we use Expectation Maximization-based approach to learn the regulating mechanism between epigenomics (e.g., gene methylation) and gene expression while considering functional gene annotations. In the second stage, we group genes based on the regulating mechanism learned in the first stage. Then, we apply a group-wise penalty to select genes significantly associated with clinical outcomes while incorporating clinical features. Simulation studies suggest that our model-based data integration method shows lower false positives in selecting predictive variables compared with existing method. Moreover, real data analysis based on a glioblastoma (GBM) dataset reveals our method's potential to detect genes associated with GBM survival with higher accuracy than the existing method. Moreover, most of the selected biomarkers are crucial in GBM prognosis as confirmed by existing literature.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Saving behaviour and health: A high-dimensional Bayesian analysis of British panel data
    Brown, Sarah
    Ghosh, Pulak
    Gray, Daniel
    Pareek, Bhuvanesh
    Roberts, Jennifer
    EUROPEAN JOURNAL OF FINANCE, 2021, 27 (16): : 1581 - 1603
  • [42] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Aijun Yang
    Xuejun Jiang
    Lianjie Shu
    Jinguan Lin
    Computational Statistics, 2017, 32 : 127 - 143
  • [43] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Yang, Aijun
    Jiang, Xuejun
    Shu, Lianjie
    Lin, Jinguan
    COMPUTATIONAL STATISTICS, 2017, 32 (01) : 127 - 143
  • [44] Scalable spatio-temporal Bayesian analysis of high-dimensional electroencephalography data
    Mohammed, Shariq
    Dey, Dipak K.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2021, 49 (01): : 107 - 128
  • [45] Boosting threshold classifiers for high-dimensional data in functional genomics
    Lausser, Ludwig
    Buchholz, Malte
    Kestler, Hans A.
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2008, 5064 : 147 - +
  • [46] Bayesian Variable Selection in Structured High-Dimensional Covariate Spaces With Applications in Genomics
    Li, Fan
    Zhang, Nancy R.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (491) : 1202 - 1214
  • [47] In Nonparametric and High-Dimensional Models, Bayesian Ignorability is an Informative Prior
    Linero, Antonio R.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 2785 - 2798
  • [48] Variational Bayesian Inference in High-Dimensional Linear Mixed Models
    Yi, Jieyi
    Tang, Niansheng
    MATHEMATICS, 2022, 10 (03)
  • [49] High-Dimensional Posterior Consistency in Bayesian Vector Autoregressive Models
    Ghosh, Satyajit
    Khare, Kshitij
    Michailidis, George
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 735 - 748
  • [50] A comparison study of Bayesian high-dimensional linear regression models
    Shin, Ju-Won
    Lee, Kyoungjae
    KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (03) : 491 - 505