PRSice-2: Polygenic Risk Score software for biobank-scale data

被引:868
作者
Choi, Shing Wan [1 ,2 ]
O'Reilly, Paul F. [1 ,2 ]
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, MRC Social Genet & Dev Psychiat Ctr, De Crespigny Pk,Denmark Hill, London SE5 8AF, England
[2] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, 1 Gustave L Levy Pl, New York, NY 10029 USA
来源
GIGASCIENCE | 2019年 / 8卷 / 07期
基金
英国医学研究理事会;
关键词
polygenic risk score; GWAS; imputation; PREDICTION;
D O I
10.1093/gigascience/giz082
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
t Background: Polygenic risk score (PRS) analyses have become an integral part of biomedical research, exploited to gain insights into shared aetiology among traits, to control for genomic profile in experimental studies, and to strengthen causal inference, among a range of applications. Substantial efforts are now devoted to biobank projects to collect large genetic and phenotypic data, providing unprecedented opportunity for genetic discovery and applications. To process the large-scale data provided by such biobank resources, highly efficient and scalable methods and software are required. Results: Here we introduce PRSice-2, an efficient and scalable software program for automating and simplifying PRS analyses on large-scale data. PRSice-2 handles both genotyped and imputed data, provides empirical association P-values free from inflation due to overfitting, supports different inheritance models, and can evaluate multiple continuous and binary target traits simultaneously. We demonstrate that PRSice-2 is dramatically faster and more memory-efficient than PRSice-1 and alternative PRS software, LDpred and lassosum, while having comparable predictive power. Conclusion: PRSice-2's combination of efficiency and power will be increasingly important as data sizes grow and as the applications of PRS become more sophisticated, e.g., when incorporated into high-dimensional or gene set-based analyses. PRSice-2 is written in C++, with an R script for plotting, and is freely available for download from http://PRSice.info.
引用
收藏
页数:6
相关论文
共 29 条
  • [1] Allegrini A, 2018, BIORXIV, DOI [10.1101/418210, DOI 10.1101/418210]
  • [2] [Anonymous], 2018, BIORXIV, DOI DOI 10.1101/398396
  • [3] Second-generation PLINK: rising to the challenge of larger and richer datasets
    Chang, Christopher C.
    Chow, Carson C.
    Tellier, Laurent C. A. M.
    Vattikuti, Shashaank
    Purcell, Shaun M.
    Lee, James J.
    [J]. GIGASCIENCE, 2015, 4
  • [4] CHOI S, 2018, GUIDE PERFORMING POL, DOI DOI 10.1101/416545
  • [5] Secondary use of clinical data: The Vanderbilt approach
    Danciu, Ioana
    Cowan, James D.
    Basford, Melissa
    Wang, Xiaoming
    Saip, Alexander
    Osgood, Susan
    Shirey-Rice, Jana
    Kirby, Jacqueline
    Harris, Paul A.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 52 : 28 - 35
  • [6] Association of Polygenic Risk for Attention-Deficit/Hyperactivity Disorder With Co-occurring Traits and Disorders
    Du Rietz, Ebba
    Coleman, Jonathan
    Glanville, Kylie
    Choi, Shing Wan
    O'Reilly, Paul F.
    Kuntsi, Jonna
    [J]. BIOLOGICAL PSYCHIATRY-COGNITIVE NEUROSCIENCE AND NEUROIMAGING, 2018, 3 (07) : 635 - 643
  • [7] PRSice: Polygenic Risk Score software
    Euesden, Jack
    Lewis, Cathryn M.
    O'Reilly, Paul F.
    [J]. BIOINFORMATICS, 2015, 31 (09) : 1466 - 1468
  • [8] Polygenic prediction via Bayesian regression and continuous shrinkage priors
    Ge, Tian
    Chen, Chia-Yen
    Ni, Yang
    Feng, Yen-Chen Anne
    Smoller, Jordan W.
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [9] Shared genetic aetiology between cognitive functions and physical and mental health in UK Biobank (N=112151) and 24 GWAS consortia
    Hagenaars, S. P.
    Harris, S. E.
    Davies, G.
    Hill, W. D.
    Liewald, D. C. M.
    Ritchie, S. J.
    Marioni, R. E.
    Fawns-Ritchie, C.
    Cullen, B.
    Malik, R.
    Worrall, B. B.
    Sudlow, C. L. M.
    Wardlaw, J. M.
    Gallacher, J.
    Pell, J.
    McIntosh, A. M.
    Smith, D. J.
    Gale, C. R.
    Deary, I. J.
    [J]. MOLECULAR PSYCHIATRY, 2016, 21 (11) : 1624 - 1632
  • [10] Polygenic Risk Scores That Predict Common Diseases Using Millions of Single Nucleotide Polymorphisms: Is More, Better?
    Janssens, A. Cecile J. W.
    Joyner, Michael J.
    [J]. CLINICAL CHEMISTRY, 2019, 65 (05) : 609 - 611