Improving genetic risk prediction by leveraging pleiotropy

被引:58
作者
Li, Cong [1 ]
Yang, Can [2 ,3 ]
Gelernter, Joel [3 ,4 ,5 ,6 ]
Zhao, Hongyu [1 ,2 ]
机构
[1] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[2] Yale Univ, Yale Sch Publ Hlth, Dept Biostat, New Haven, CT 06520 USA
[3] Yale Univ, Sch Med, Dept Psychiat, New Haven, CT 06520 USA
[4] Yale Univ, Sch Med, VACT Healthcare Ctr, West Haven, CT 06516 USA
[5] Yale Univ, Sch Med, Dept Genet, West Haven, CT 06516 USA
[6] Yale Univ, Sch Med, Dept Neurobiol, West Haven, CT 06516 USA
关键词
GENOME-WIDE ASSOCIATION; MISSING HERITABILITY; COMPLEX DISEASES; TRAITS; VARIANCE;
D O I
10.1007/s00439-013-1401-5
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
An important task of human genetics studies is to predict accurately disease risks in individuals based on genetic markers, which allows for identifying individuals at high disease risks, and facilitating their disease treatment and prevention. Although hundreds of genome-wide association studies (GWAS) have been conducted on many complex human traits in recent years, there has been only limited success in translating these GWAS data into clinically useful risk prediction models. The predictive capability of GWAS data is largely bottlenecked by the available training sample size due to the presence of numerous variants carrying only small to modest effects. Recent studies have shown that different human traits may share common genetic bases. Therefore, an attractive strategy to increase the training sample size and hence improve the prediction accuracy is to integrate data from genetically correlated phenotypes. Yet, the utility of genetic correlation in risk prediction has not been explored in the literature. In this paper, we analyzed GWAS data for bipolar and related disorders and schizophrenia with a bivariate ridge regression method, and found that jointly predicting the two phenotypes could substantially increase prediction accuracy as measured by the area under the receiver operating characteristic curve. We also found similar prediction accuracy improvements when we jointly analyzed GWAS data for Crohn's disease and ulcerative colitis. The empirical observations were substantiated through our comprehensive simulation studies, suggesting that a gain in prediction accuracy can be obtained by combining phenotypes with relatively high genetic correlations. Through both real data and simulation studies, we demonstrated pleiotropy can be leveraged as a valuable asset that opens up a new opportunity to improve genetic risk prediction in the future.
引用
收藏
页码:639 / 650
页数:12
相关论文
共 37 条
  • [1] Improved Detection of Common Variants Associated with Schizophrenia by Leveraging Pleiotropy with Cardiovascular-Disease Risk Factors
    Andreassen, Ole A.
    Djurovic, Srdjan
    Thompson, Wesley K.
    Schork, Andrew J.
    Kendler, Kenneth S.
    O'Donovan, Michael C.
    Rujescu, Dan
    Werge, Thomas
    van de Bunt, Martijn
    Morris, Andrew P.
    McCarthy, Mark I.
    Roddey, J. Cooper
    McEvoy, Linda K.
    Desikan, Rahul S.
    Dale, Anders M.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 92 (02) : 197 - 209
  • [2] ADAPTIVE MULTIVARIATE RIDGE-REGRESSION
    BROWN, PJ
    ZIDEK, JV
    [J]. ANNALS OF STATISTICS, 1980, 8 (01) : 64 - 74
  • [3] Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
    Burton, Paul R.
    Clayton, David G.
    Cardon, Lon R.
    Craddock, Nick
    Deloukas, Panos
    Duncanson, Audrey
    Kwiatkowski, Dominic P.
    McCarthy, Mark I.
    Ouwehand, Willem H.
    Samani, Nilesh J.
    Todd, John A.
    Donnelly, Peter
    Barrett, Jeffrey C.
    Davison, Dan
    Easton, Doug
    Evans, David
    Leung, Hin-Tak
    Marchini, Jonathan L.
    Morris, Andrew P.
    Spencer, Chris C. A.
    Tobin, Martin D.
    Attwood, Antony P.
    Boorman, James P.
    Cant, Barbara
    Everson, Ursula
    Hussey, Judith M.
    Jolley, Jennifer D.
    Knight, Alexandra S.
    Koch, Kerstin
    Meech, Elizabeth
    Nutland, Sarah
    Prowse, Christopher V.
    Stevens, Helen E.
    Taylor, Niall C.
    Walters, Graham R.
    Walker, Neil M.
    Watkins, Nicholas A.
    Winzer, Thilo
    Jones, Richard W.
    McArdle, Wendy L.
    Ring, Susan M.
    Strachan, David P.
    Pembrey, Marcus
    Breen, Gerome
    St Clair, David
    Caesar, Sian
    Gordon-Smith, Katherine
    Jones, Lisa
    Fraser, Christine
    Green, Elain K.
    [J]. NATURE, 2007, 447 (7145) : 661 - 678
  • [4] GWAS: heritability missing in action?
    Clarke, Angus J.
    Cooper, David N.
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2010, 18 (08) : 859 - 861
  • [5] Implications of the Human Genome Project for medical science
    Collins, FS
    McKusick, VA
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2001, 285 (05): : 540 - 544
  • [6] Predicting genetic predisposition in humans: the promise of whole-genome markers
    de los Campos, Gustavo
    Gianola, Daniel
    Allison, David B.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (12) : 880 - 886
  • [7] Rare Variants Create Synthetic Genome-Wide Associations
    Dickson, Samuel P.
    Wang, Kai
    Krantz, Ian
    Hakonarson, Hakon
    Goldstein, David B.
    [J]. PLOS BIOLOGY, 2010, 8 (01)
  • [8] VIEWPOINT Missing heritability and strategies for finding the underlying causes of complex disease
    Eichler, Evan E.
    Flint, Jonathan
    Gibson, Greg
    Kong, Augustine
    Leal, Suzanne M.
    Moore, Jason H.
    Nadeau, Joseph H.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (06) : 446 - 450
  • [10] Fan RE, 2008, J MACH LEARN RES, V9, P1871