Genomic data analysis using a two stage expectation propagation algorithm for analysis of sparse Bayesian high-dimensional instrumental variables regression

被引:0
|
作者
Amini, Morteza [1 ]
机构
[1] Univ Tehran, Coll Sci, Sch Math Stat & Comp Sci, Dept Stat, POB 14155-6455, Tehran, Iran
基金
美国国家科学基金会;
关键词
Causal inference; Expectation propagation; Spike-and-slab prior; Sparse instrumental variables model; GENE-EXPRESSION; SELECTION; INFERENCE; PRIORS; LASSO;
D O I
10.1080/03610918.2022.2075896
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Simultaneous analysis of gene expression data and genetic variants is highly of interest, especially when the number of gene expressions and genetic variants are both greater than the sample size. Association of both causal genes and effective SNPs makes the use of sparse modeling of such genetic data sets, highly important. The high-dimensional sparse instrumental variables models are one of such useful association models, which models the simultaneous relation of the gene expressions and genetic variants with complex traits. From a Bayesian viewpoint, the sparsity can be favored using sparsity-enforcing priors such as spike-and-slab priors. A two-stage modification of the expectation propagation (EP) algorithm is proposed and examined for approximate inference in high-dimensional sparse instrumental variables models with spike-and-slab priors. This method is an adoption of the classical two-stage least squares method, to be used with the Bayes context. A simulation study is performed to examine the performance of the methods. The proposed method is applied to analysis of the mouse obesity data.
引用
收藏
页码:2351 / 2365
页数:15
相关论文
共 41 条
  • [1] HYPOTHESIS TESTING IN HIGH-DIMENSIONAL INSTRUMENTAL VARIABLES REGRESSION WITH AN APPLICATION TO GENOMICS DATA
    Lu, Jiarui
    Li, Hongzhe
    STATISTICA SINICA, 2022, 32 : 613 - 633
  • [2] Bayesian high-dimensional regression for change point analysis
    Datta, Abhirup
    Zou, Hui
    Banerjee, Sudipto
    STATISTICS AND ITS INTERFACE, 2019, 12 (02) : 253 - 264
  • [3] Sparse meta-analysis with high-dimensional data
    He, Qianchuan
    Zhang, Hao Helen
    Avery, Christy L.
    Lin, D. Y.
    BIOSTATISTICS, 2016, 17 (02) : 205 - 220
  • [4] Regression analysis on high-dimensional, block diagonal structure data with focus on latent variables
    Seki, Shinei
    Nagata, Yasushi
    MATHEMATICAL METHODS AND COMPUTATIONAL TECHNIQUES IN SCIENCE AND ENGINEERING II, 2018, 1982
  • [5] A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 753 - 771
  • [6] A two-stage approach of gene network analysis for high-dimensional heterogeneous data
    Lee, Sangin
    Liang, Faming
    Cai, Ling
    Xiao, Guanghua
    BIOSTATISTICS, 2018, 19 (02) : 216 - 232
  • [7] A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification
    Zakariya Yahya Algamal
    Muhammad Hisyam Lee
    Advances in Data Analysis and Classification, 2019, 13 : 753 - 771
  • [8] ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
    Archer, Kellie J.
    Seffernick, Anna Eames
    Sun, Shuai
    Zhang, Yiran
    STATS, 2022, 5 (02): : 371 - 384
  • [9] Factor Analysis Regression for Predictive Modeling with High-Dimensional Data
    Carter, Randy
    Michael, Netsanet
    JOURNAL OF QUANTITATIVE ECONOMICS, 2022, 20 (SUPPL 1) : 115 - 132
  • [10] Molecular Classification of Endometriosis and Disease Stage Using High-Dimensional Genomic Data
    Tamaresis, John S.
    Irwin, Juan C.
    Goldfien, Gabriel A.
    Rabban, Joseph T.
    Burney, Richard O.
    Nezhat, Camran
    DePaolo, Louis V.
    Giudice, Linda C.
    ENDOCRINOLOGY, 2014, 155 (12) : 4986 - 4999