Genomic data analysis using a two stage expectation propagation algorithm for analysis of sparse Bayesian high-dimensional instrumental variables regression

被引:0
|
作者
Amini, Morteza [1 ]
机构
[1] Univ Tehran, Coll Sci, Sch Math Stat & Comp Sci, Dept Stat, POB 14155-6455, Tehran, Iran
基金
美国国家科学基金会;
关键词
Causal inference; Expectation propagation; Spike-and-slab prior; Sparse instrumental variables model; GENE-EXPRESSION; SELECTION; INFERENCE; PRIORS; LASSO;
D O I
10.1080/03610918.2022.2075896
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Simultaneous analysis of gene expression data and genetic variants is highly of interest, especially when the number of gene expressions and genetic variants are both greater than the sample size. Association of both causal genes and effective SNPs makes the use of sparse modeling of such genetic data sets, highly important. The high-dimensional sparse instrumental variables models are one of such useful association models, which models the simultaneous relation of the gene expressions and genetic variants with complex traits. From a Bayesian viewpoint, the sparsity can be favored using sparsity-enforcing priors such as spike-and-slab priors. A two-stage modification of the expectation propagation (EP) algorithm is proposed and examined for approximate inference in high-dimensional sparse instrumental variables models with spike-and-slab priors. This method is an adoption of the classical two-stage least squares method, to be used with the Bayes context. A simulation study is performed to examine the performance of the methods. The proposed method is applied to analysis of the mouse obesity data.
引用
收藏
页码:2351 / 2365
页数:15
相关论文
共 41 条
  • [31] Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
    Lien, Tonje G.
    Borgan, Ornulf
    Reppe, Sjur
    Gautvik, Kaare
    Glad, Ingrid Kristine
    BMC MEDICAL GENOMICS, 2018, 11
  • [32] Joint analysis of multiple high-dimensional data types using sparse matrix approximations of rank-1 with applications to ovarian and liver cancer
    Gordon Okimoto
    Ashkan Zeinalzadeh
    Tom Wenska
    Michael Loomis
    James B. Nation
    Tiphaine Fabre
    Maarit Tiirikainen
    Brenda Hernandez
    Owen Chan
    Linda Wong
    Sandi Kwee
    BioData Mining, 9
  • [33] New Analysis Framework Incorporating Mixed Mutual Information and Scalable Bayesian Networks for Multimodal High Dimensional Genomic and Epigenomic Cancer Data
    Wang, Xichun
    Branciamore, Sergio
    Gogoshin, Grigoriy
    Ding, Shuyu
    Rodin, Andrei S.
    FRONTIERS IN GENETICS, 2020, 11
  • [34] Joint analysis of multiple high-dimensional data types using sparse matrix approximations of rank-1 with applications to ovarian and liver cancer
    Okimoto, Gordon
    Zeinalzadeh, Ashkan
    Wenska, Tom
    Loomis, Michael
    Nation, James B.
    Fabre, Tiphaine
    Tiirikainen, Maarit
    Hernandez, Brenda
    Chan, Owen
    Wong, Linda
    Kwee, Sandi
    BIODATA MINING, 2016, 9
  • [35] Influential Gene Selection From High-Dimensional Genomic Data Using a Bio-Inspired Algorithm Wrapped Broad Learning System
    Parhi, Pournamasi
    Bisoi, Ranjeeta
    Dash, Pradipta Kishore
    IEEE ACCESS, 2022, 10 (49219-49232) : 49219 - 49232
  • [36] JDINAC: joint density-based non-parametric differential interaction network analysis and classification using high-dimensional sparse omics data
    Ji, Jiadong
    He, Di
    Feng, Yang
    He, Yong
    Xue, Fuzhong
    Xie, Lei
    BIOINFORMATICS, 2017, 33 (19) : 3080 - 3087
  • [37] High-dimensional omics data analysis using a variable screening protocol with prior knowledge integration (SKI)
    Liu, Cong
    Jiang, Jianping
    Gu, Jianlei
    Yu, Zhangsheng
    Wang, Tao
    Lu, Hui
    BMC SYSTEMS BIOLOGY, 2016, 10
  • [38] Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
    Tonje G. Lien
    Ørnulf Borgan
    Sjur Reppe
    Kaare Gautvik
    Ingrid Kristine Glad
    BMC Medical Genomics, 11
  • [39] Survival Analysis with High-Dimensional Omics Data Using a Threshold Gradient Descent Regularization-Based Neural Network Approach
    Fan, Yu
    Zhang, Sanguo
    Ma, Shuangge
    GENES, 2022, 13 (09)
  • [40] A Bayesian random regression method using mixture priors for genome-enabled analysis of time-series high-throughput phenotyping data
    Qu, Jiayi
    Morota, Gota
    Cheng, Hao
    PLANT GENOME, 2022, 15 (03)