Genomic data analysis using a two stage expectation propagation algorithm for analysis of sparse Bayesian high-dimensional instrumental variables regression

被引:0
|
作者
Amini, Morteza [1 ]
机构
[1] Univ Tehran, Coll Sci, Sch Math Stat & Comp Sci, Dept Stat, POB 14155-6455, Tehran, Iran
基金
美国国家科学基金会;
关键词
Causal inference; Expectation propagation; Spike-and-slab prior; Sparse instrumental variables model; GENE-EXPRESSION; SELECTION; INFERENCE; PRIORS; LASSO;
D O I
10.1080/03610918.2022.2075896
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Simultaneous analysis of gene expression data and genetic variants is highly of interest, especially when the number of gene expressions and genetic variants are both greater than the sample size. Association of both causal genes and effective SNPs makes the use of sparse modeling of such genetic data sets, highly important. The high-dimensional sparse instrumental variables models are one of such useful association models, which models the simultaneous relation of the gene expressions and genetic variants with complex traits. From a Bayesian viewpoint, the sparsity can be favored using sparsity-enforcing priors such as spike-and-slab priors. A two-stage modification of the expectation propagation (EP) algorithm is proposed and examined for approximate inference in high-dimensional sparse instrumental variables models with spike-and-slab priors. This method is an adoption of the classical two-stage least squares method, to be used with the Bayes context. A simulation study is performed to examine the performance of the methods. The proposed method is applied to analysis of the mouse obesity data.
引用
收藏
页码:2351 / 2365
页数:15
相关论文
共 41 条
  • [11] A two-stage sequential conditional selection approach to sparse high-dimensional multivariate regression models
    Chen, Zehua
    Jiang, Yiwei
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2020, 72 (01) : 65 - 90
  • [12] Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model
    Wang, Ning
    Zhang, Xin
    Mai, Qing
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [13] Evaluating the performance of sparse principal component analysis methods in high-dimensional data scenarios
    Bonner, Ashley J.
    Beyene, Joseph
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (05) : 3794 - 3811
  • [14] iBAG: integrative Bayesian analysis of high-dimensional multiplatform genomics data
    Wang, Wenting
    Baladandayuthapani, Veerabhadran
    Morris, Jeffrey S.
    Broom, Bradley M.
    Manyam, Ganiraju
    Do, Kim-Anh
    BIOINFORMATICS, 2013, 29 (02) : 149 - 159
  • [15] Dissecting High-Dimensional Phenotypes with Bayesian Sparse Factor Analysis of Genetic Covariance Matrices
    Runcie, Daniel E.
    Mukherjee, Sayan
    GENETICS, 2013, 194 (03) : 753 - +
  • [16] Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data
    Cai, Tianxi
    Liu, Molei
    Xia, Yin
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (540) : 2105 - 2119
  • [17] A Sparse Learning Machine for High-Dimensional Data with Application to Microarray Gene Analysis
    Cheng, Qiang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (04) : 636 - 646
  • [18] Exploring high-dimensional biological data with sparse contrastive principal component analysis
    Boileau, Philippe
    Hejazi, Nima S.
    Dudoit, Sandrine
    BIOINFORMATICS, 2020, 36 (11) : 3422 - 3430
  • [19] Robust and sparse correlation matrix estimation for the analysis of high-dimensional genomics data
    Serra, Angela
    Coretto, Pietro
    Fratello, Michele
    Tagliaferri, Roberto
    BIOINFORMATICS, 2018, 34 (04) : 625 - 634
  • [20] Partial least squares: a versatile tool for the analysis of high-dimensional genomic data
    Boulesteix, Anne-Laure
    Strimmer, Korbinian
    BRIEFINGS IN BIOINFORMATICS, 2007, 8 (01) : 32 - 44