Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets

被引:0
作者
Carla Márquez-Luna
Steven Gazal
Po-Ru Loh
Samuel S. Kim
Nicholas Furlotte
Adam Auton
Alkes L. Price
机构
[1] Harvard T.H. Chan School of Public Health,Department of Biostatistics
[2] Broad Institute of Harvard and MIT,Program in Medical and Population Genetics
[3] Icahn School of Medicine at Mount Sinai,Charles R. Bronfman Institute for Personalized Medicine
[4] Harvard T.H. Chan School of Public Health,Department of Epidemiology
[5] Brigham and Women’s Hospital and Harvard Medical School,Division of Genetics, Department of Medicine
[6] Massachusetts Institute of Technology,Department of Electrical Engineering and Computer Science
[7] 23andMe Inc.,undefined
来源
Nature Communications | / 12卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Polygenic risk prediction is a widely investigated topic because of its promising clinical applications. Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, including coding, conserved, regulatory, and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank (avg N = 373 K as training data). LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R2 = 0.144; highest R2 = 0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (N = 1107 K) increased prediction R2 to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.
引用
收藏
相关论文
共 90 条
  • [1] Maurano MT(2012)Systematic localization of common disease-associated variation in regulatory DNA Science 337 1222794-573
  • [2] Trynka G(2013)Chromatin marks identify critical cell types for fine mapping complex trait variants Nat. Genet. 45 124-330
  • [3] Pickrell JK(2014)Joint analysis of functional genomic data and genome-wide association studies of 18 human traits Am. J. Hum Genet. 94 559-1235
  • [4] Finucane HK(2015)Integrative analysis of 111 reference human epigenomes Nature 518 317-406
  • [5] Farh KK-H(2015)Partitioning heritability by functional annotation using genome-wide association summary statistics Nat. Genet. 47 1228-1224
  • [6] Chatterjee N(2015)Genetic and epigenetic fine mapping of causal autoimmune disease variants Nature 518 337-14
  • [7] Shi J(2016)Developing and evaluating polygenic risk prediction models for stratified disease prevention Nat. Rev. Genet. 17 392-22
  • [8] García-Closas M(2018)Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations Nat. Genet. 50 1219-1557
  • [9] Khera AV(2019)Improved polygenic prediction by bayesian multiple regression on summary statistics Nat. Commun. 10 1561-592
  • [10] Lloyd-Jones LR(2019)Polygenic prediction via Bayesian regression and continuous shrinkage priors Nat. Commun. 10 1-447