Fast and accurate Bayesian polygenic risk modeling with variational inference

被引:12
作者
Zabad, Shadi [1 ]
Gravel, Simon [2 ]
Li, Yue [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] McGill Univ, Dept Human Genet, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
GENOME-WIDE ASSOCIATION; HUMAN COMPLEX TRAITS; UK BIOBANK; VARIABLE SELECTION; MIXED-MODEL; PREDICTION; SCORES; RARE; REGRESSION; VARIANTS;
D O I
10.1016/j.ajhg.2023.03.009
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The advent of large-scale genome-wide association studies (GWASs) has motivated the development of statistical methods for phenotype prediction with single-nucleotide polymorphism (SNP) array data. These polygenic risk score (PRS) methods use a multiple linear regres-sion framework to infer joint effect sizes of all genetic variants on the trait. Among the subset of PRS methods that operate on GWAS summary statistics, sparse Bayesian methods have shown competitive predictive ability. However, most existing Bayesian approaches employ Markov chain Monte Carlo (MCMC) algorithms, which are computationally inefficient , do not scale favorably to higher di-mensions, for posterior inference. Here, we introduce variational inference of polygenic risk scores (VIPRS), a Bayesian summary statis-tics-based PRS method that utilizes variational inference techniques to approximate the posterior distribution for the effect sizes. Our experiments with 36 simulation configurations and 12 real phenotypes from the UK Biobank dataset demonstrated that VIPRS is consis-tently competitive with the state-of-the-art in prediction accuracy while being more than twice as fast as popular MCMC-based ap-proaches. This performance advantage is robust across a variety of genetic architectures, SNP heritabilities , independent GWAS co-horts. In addition to its competitive accuracy on the "White British"samples, VIPRS showed improved transferability when applied to other ethnic groups, with up to 1.7-fold increase in R2 among individuals of Nigerian ancestry for low-density lipoprotein (LDL) cholesterol. To illustrate its scalability, we applied VIPRS to a dataset of 9.6 million genetic markers, which conferred further improvements in prediction accuracy for highly polygenic traits, such as height.
引用
收藏
页码:741 / 761
页数:22
相关论文
共 50 条
[21]   Sparse Bayesian Nonlinear System Identification Using Variational Inference [J].
Jacobs, William R. ;
Baldacchino, Tara ;
Dodd, Tony ;
Anderson, Sean R. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (12) :4172-4187
[22]   Fast and accurate inference for the smoothing parameter in semiparametric models [J].
Paige, Robert L. ;
Trindade, A. Alexandre .
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2013, 55 (01) :25-41
[23]   Scalable Variational Inference for Bayesian Variable Selection in Regression, and Its Accuracy in Genetic Association Studies [J].
Carbonetto, Peter ;
Stephens, Matthew .
BAYESIAN ANALYSIS, 2012, 7 (01) :73-107
[24]   Sparse Bayesian Neural Networks: Bridging Model and Parameter Uncertainty through Scalable Variational Inference [J].
Hubin, Aliaksandr ;
Storvik, Geir .
MATHEMATICS, 2024, 12 (06)
[25]   Variational Bayesian Inference in High-Dimensional Linear Mixed Models [J].
Yi, Jieyi ;
Tang, Niansheng .
MATHEMATICS, 2022, 10 (03)
[26]   A comparison of variational approximations for fast inference in mixed logit models [J].
Depraetere, Nicolas ;
Vandebroek, Martina .
COMPUTATIONAL STATISTICS, 2017, 32 (01) :93-125
[27]   Bayesian inference for risk minimization via exponentially tilted empirical likelihood [J].
Tang, Rong ;
Yang, Yun .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (04) :1257-1286
[28]   Robust Bayesian hierarchical modeling and inference using scale mixtures of normal distributions [J].
Ouyang, Linhan ;
Zhu, Shichao ;
Ye, Keying ;
Park, Chanseok ;
Wang, Min .
IISE TRANSACTIONS, 2022, 54 (07) :659-671
[29]   Fast Bayesian inference for gene regulatory networks using ScanBMA [J].
Young, William Chad ;
Raftery, Adrian E. ;
Yeung, Ka Yee .
BMC SYSTEMS BIOLOGY, 2014, 8
[30]   Bayesian semiparametric modeling and inference with mixtures of symmetric distributions [J].
Kottas, Athanasios ;
Fellingham, Gilbert W. .
STATISTICS AND COMPUTING, 2012, 22 (01) :93-106