A Better Coefficient of Determination for Genetic Profile Analysis

被引:215
作者
Lee, Sang Hong [1 ,2 ]
Goddard, Michael E. [3 ]
Wray, Naomi R. [1 ,2 ]
Visscher, Peter M. [1 ,2 ,4 ]
机构
[1] Univ Queensland, Queensland Brain Inst, Brisbane, Qld 4072, Australia
[2] Queensland Inst Med Res, Brisbane, Qld 4006, Australia
[3] Univ Melbourne, Dept Agr & Food Syst, Melbourne, Vic, Australia
[4] Univ Queensland, Diamantina Inst, Princess Alexandra Hosp, Brisbane, Qld, Australia
基金
澳大利亚研究理事会; 英国医学研究理事会;
关键词
coefficient of determination; risk predictors; genetic profiles; goodness-of-fit; genome-wide association studies; GENOME-WIDE ASSOCIATION; BREAST-CANCER; LOGISTIC-REGRESSION; SUSCEPTIBILITY LOCI; RISK-FACTORS; PREDICTION; DISEASE; VARIANTS; ACCURACY; SCORE;
D O I
10.1002/gepi.21614
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome-wide association studies have facilitated the construction of risk predictors for disease from multiple Single Nucleotide Polymorphism markers. The ability of such "genetic profiles" to predict outcome is usually quantified in an independent data set. Coefficients of determination (R-2) have been a useful measure to quantify the goodness-of-fit of the genetic profile. Various pseudo-R-2 measures for binary responses have been proposed. However, there is no standard or consensus measure because the concept of residual variance is not easily defined on the observed probability scale. Unlike other nongenetic predictors such as environmental exposure, there is prior information on genetic predictors because for most traits there are estimates of the proportion of variation in risk in the population due to all genetic factors, the heritability. It is this useful ability to benchmark that makes the choice of a measure of goodness-of-fit in genetic profiling different from that of nongenetic predictors. In this study, we use a liability threshold model to establish the relationship between the observed probability scale and underlying liability scale in measuring R-2 for binary responses. We show that currently used R-2 measures are difficult to interpret, biased by ascertainment, and not comparable to heritability. We suggest a novel and globally standard measure of R-2 that is interpretable on the liability scale. Furthermore, even when using ascertained case-control studies that are typical in human disease studies, we can obtain an R-2 measure on the liability scale that can be compared directly to heritability. Genet. Epidemiol. 36:214-224, 2012. (C) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:214 / 224
页数:11
相关论文
共 52 条
[1]   Polygenic inheritance of breast cancer: Implications for design of association studies [J].
Antoniou, AC ;
Easton, DF .
GENETIC EPIDEMIOLOGY, 2003, 25 (03) :190-202
[2]   Tamoxifen resistance in early breast cancer: statistical modelling of tissue markers to improve risk prediction [J].
Baneshi, M. R. ;
Warner, P. ;
Anderson, N. ;
Edwards, J. ;
Cooke, T. G. ;
Bartlett, J. M. S. .
BRITISH JOURNAL OF CANCER, 2010, 102 (10) :1503-1510
[3]   Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease [J].
Barrett, Jeffrey C. ;
Hansoul, Sarah ;
Nicolae, Dan L. ;
Cho, Judy H. ;
Duerr, Richard H. ;
Rioux, John D. ;
Brant, Steven R. ;
Silverberg, Mark S. ;
Taylor, Kent D. ;
Barmada, M. Michael ;
Bitton, Alain ;
Dassopoulos, Themistocles ;
Datta, Lisa Wu ;
Green, Todd ;
Griffiths, Anne M. ;
Kistner, Emily O. ;
Murtha, Michael T. ;
Regueiro, Miguel D. ;
Rotter, Jerome I. ;
Schumm, L. Philip ;
Steinhart, A. Hillary ;
Targan, Stephan R. ;
Xavier, Ramnik J. ;
Libioulle, Cecile ;
Sandor, Cynthia ;
Lathrop, Mark ;
Belaiche, Jacques ;
Dewit, Olivier ;
Gut, Ivo ;
Heath, Simon ;
Laukens, Debby ;
Mni, Myriam ;
Rutgeerts, Paul ;
Van Gossum, Andre ;
Zelenika, Diana ;
Franchimont, Denis ;
Hugot, Jean-Pierre ;
de Vos, Martine ;
Vermeire, Severine ;
Louis, Edouard ;
Cardon, Lon R. ;
Anderson, Carl A. ;
Drummond, Hazel ;
Nimmo, Elaine ;
Ahmad, Tariq ;
Prescott, Natalie J. ;
Onnie, Clive M. ;
Fisher, Sheila A. ;
Marchini, Jonathan ;
Ghori, Jilur .
NATURE GENETICS, 2008, 40 (08) :955-962
[4]   Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes [J].
Barrett, Jeffrey C. ;
Clayton, David G. ;
Concannon, Patrick ;
Akolkar, Beena ;
Cooper, Jason D. ;
Erlich, Henry A. ;
Julier, Cecile ;
Morahan, Grant ;
Nerup, Jorn ;
Nierras, Concepcion ;
Plagnol, Vincent ;
Pociot, Flemming ;
Schuilenburg, Helen ;
Smyth, Deborah J. ;
Stevens, Helen ;
Todd, John A. ;
Walker, Neil M. ;
Rich, Stephen S. .
NATURE GENETICS, 2009, 41 (06) :703-707
[5]   Evidence for Polygenic Susceptibility to Multiple Sclerosis-The Shape of Things to Come [J].
Bush, William S. ;
Sawcer, Stephen J. ;
de Jager, Philip L. ;
Oksenberg, Jorge R. ;
McCauley, Jacob L. ;
Pericak-Vance, Margaret A. ;
Haines, Jonathan L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (04) :621-625
[6]   A Genetic Risk Score Combining Ten Psoriasis Risk Loci Improves Disease Prediction [J].
Chen, Haoyan ;
Poon, Annie ;
Yeung, Celestine ;
Helms, Cynthia ;
Pons, Jennifer ;
Bowcock, Anne M. ;
Kwok, Pui-Yan ;
Liao, Wilson .
PLOS ONE, 2011, 6 (04)
[7]   A COMMENT ON THE COEFFICIENT OF DETERMINATION FOR BINARY RESPONSES [J].
COX, DR ;
WERMUTH, N .
AMERICAN STATISTICIAN, 1992, 46 (01) :1-4
[8]  
Cox DR., 1989, Analysis of Binary Data, V2nd ed.
[9]   Risk factors associated with the development of ischemic colitis [J].
Cubiella Fernandez, Joaquin ;
Nunez Calvo, Luisa ;
Gonzalez Vazquez, Elvira ;
Garcia Garcia, Maria Jesus ;
Alves Perez, Maria Teresa ;
Martinez Silva, Isabel ;
Fernandez Seara, Javier .
WORLD JOURNAL OF GASTROENTEROLOGY, 2010, 16 (36) :4564-4569
[10]   Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach [J].
Daetwyler, Hans D. ;
Villanueva, Beatriz ;
Woolliams, John A. .
PLOS ONE, 2008, 3 (10)