Using information of relatives in genomic prediction to apply effective stratified medicine

被引:31
作者
Lee, S. Hong [1 ]
Weerasinghe, W. M. Shalanee P. [1 ]
Wray, Naomi R. [2 ]
Goddard, Michael E. [3 ,4 ]
van der Werf, Julius H. J. [1 ]
机构
[1] Univ New England, Sch Environm & Rural Sci, Armidale, NSW 2351, Australia
[2] Univ Queensland, Queensland Brain Inst, Ctr Neurogenet & Stat Genom, Brisbane, Qld 4072, Australia
[3] Univ Melbourne, Fac Land & Food Resources, Melbourne, Vic, Australia
[4] Dept Primary Ind, Biosci Res Div, Bundoora, Vic, Australia
基金
英国医学研究理事会; 澳大利亚研究理事会;
关键词
FAMILY-HISTORY; RISK PREDICTION; MISSING HERITABILITY; ACCURACY; HEIGHT; HEALTH; POPULATION; ALGORITHM; SELECTION; LIVESTOCK;
D O I
10.1038/srep42091
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genomic prediction shows promise for personalised medicine in which diagnosis and treatment are tailored to individuals based on their genetic profiles for complex diseases. We present a theoretical framework to demonstrate that prediction accuracy can be improved by targeting more informative individuals in the data set used to generate the predictors ("discovery sample") to include those with genetically close relationships with the subjects put forward for risk prediction. Increase of prediction accuracy from closer relationships is achieved under an additive model and does not rely on any family or interaction effects. Using theory, simulations and real data analyses, we show that the predictive accuracy or the area under the receiver operating characteristic curve (AUC) increased exponentially with decreasing effective size (N-e), i. e. when individuals are closely related. For example, with the sample size of discovery set N = 3000, heritability h(2) = 0.5 and population prevalence K = 0.1, AUC value approached to 0.9 and the top percentile of the estimated genetic profile scores had 23 times higher proportion of cases than the general population. This suggests that there is considerable room to increase prediction accuracy by using a design that does not exclude closer relationships.
引用
收藏
页数:13
相关论文
共 61 条
[1]   Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning [J].
Abraham, Gad ;
Tye-Din, Jason A. ;
Bhalala, Oneil G. ;
Kowalczyk, Adam ;
Zobel, Justin ;
Inouye, Michael .
PLOS GENETICS, 2014, 10 (02)
[2]   Predicting human height by Victorian and genomic methods [J].
Aulchenko, Yurii S. ;
Struchalin, Maksim V. ;
Belonogova, Nadezhda M. ;
Axenovich, Tatiana I. ;
Weedon, Michael N. ;
Hofman, Albert ;
Uitterlinden, Andre G. ;
Kayser, Manfred ;
Oostra, Ben A. ;
van Duijn, Cornelia M. ;
Janssens, A. Cecile J. W. ;
Borodin, Pavel M. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2009, 17 (08) :1070-1075
[3]   Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort [J].
Banda, Yambazi ;
Kvale, Mark N. ;
Hoffmann, Thomas J. ;
Hesselson, Stephanie E. ;
Ranatunga, Dilrini ;
Tang, Hua ;
Sabatti, Chiara ;
Croen, Lisa A. ;
Dispensa, Brad P. ;
Henderson, Mary ;
Iribarren, Carlos ;
Jorgenson, Eric ;
Kushi, Lawrence H. ;
Ludwig, Dana ;
Olberg, Diane ;
Quesenberry, Charles P., Jr. ;
Rowell, Sarah ;
Sadler, Marianne ;
Sakoda, Lori C. ;
Sciortino, Stanley ;
Shen, Ling ;
Smethurst, David ;
Somkin, Carol P. ;
Van Den Eeden, Stephen K. ;
Walter, Lawrence ;
Whitmer, Rachel A. ;
Kwok, Pui-Yan ;
Schaefer, Catherine ;
Risch, Neil .
GENETICS, 2015, 200 (04) :1285-+
[4]   Improved ancestry inference using weights from external reference panels [J].
Chen, Chia-Yen ;
Pollack, Samuela ;
Hunter, David J. ;
Hirschhorn, Joel N. ;
Kraft, Peter ;
Price, Alkes L. .
BIOINFORMATICS, 2013, 29 (11) :1399-1406
[5]   EigenGWAS: finding loci under selection through genome-wide association studies of eigenvectors in structured populations [J].
Chen, G-B ;
Lee, S. H. ;
Zhu, Z-X ;
Benyamin, B. ;
Robinson, M. R. .
HEREDITY, 2016, 117 (01) :51-61
[6]   The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemes [J].
Clark, Samuel A. ;
Hickey, John M. ;
Daetwyler, Hans D. ;
van der Werf, Julius H. J. .
GENETICS SELECTION EVOLUTION, 2012, 44 :4
[7]   GWAS: heritability missing in action? [J].
Clarke, Angus J. ;
Cooper, David N. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2010, 18 (08) :859-861
[8]   Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach [J].
Daetwyler, Hans D. ;
Villanueva, Beatriz ;
Woolliams, John A. .
PLOS ONE, 2008, 3 (10)
[9]   Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor [J].
de los Campos, Gustavo ;
Vazquez, Ana I. ;
Fernando, Rohan ;
Klimentidis, Yann C. ;
Sorensen, Daniel .
PLOS GENETICS, 2013, 9 (07)
[10]   Comparison of Family History and SNPs for Predicting Risk of Complex Disease [J].
Do, Chuong B. ;
Hinds, David A. ;
Francke, Uta ;
Eriksson, Nicholas .
PLOS GENETICS, 2012, 8 (10)