Predictive accuracy of risk factors and markers: a simulation study of the effect of novel markers on different performance measures for logistic regression models

被引:39
作者
Austin, Peter C. [1 ,2 ,3 ]
Steyerberg, Ewout W. [4 ]
机构
[1] Inst Clin Evaluat Sci, Toronto, ON M4N 3M5, Canada
[2] Univ Toronto, Dept Hlth Management Policy & Evaluat, Toronto, ON, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[4] Erasmus MC, Dept Publ Hlth, Rotterdam, Netherlands
基金
加拿大健康研究院;
关键词
logistic regression; predictive model; predictive accuracy; Brier score; discrimination; c-statistic; ROC curve; ROC CURVE; DISCRIMINATION; MORTALITY; ABILITY; AREA;
D O I
10.1002/sim.5598
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The change in c-statistic is frequently used to summarize the change in predictive accuracy when a novel risk factor is added to an existing logistic regression model. We explored the relationship between the absolute change in the c-statistic, Brier score, generalized R2, and the discrimination slope when a risk factor was added to an existing model in an extensive set of Monte Carlo simulations. The increase in model accuracy due to the inclusion of a novel marker was proportional to both the prevalence of the marker and to the odds ratio relating the marker to the outcome but inversely proportional to the accuracy of the logistic regression model with the marker omitted. We observed greater improvements in model accuracy when the novel risk factor or marker was uncorrelated with the existing predictor variable compared with when the risk factor has a positive correlation with the existing predictor variable. We illustrated these findings by using a study on mortality prediction in patients hospitalized with heart failure. In conclusion, the increase in predictive accuracy by adding a marker should be considered in the context of the accuracy of the initial model. Copyright (C) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:661 / 672
页数:12
相关论文
共 29 条
[1]  
[Anonymous], 2005, R LANG ENV STAT COMP
[2]   Putting Risk Prediction in Perspective: Relative Utility Curves [J].
Baker, Stuart G. .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2009, 101 (22) :1538-1542
[3]   Estimation of time-dependent area under the ROC curve for long-term risk prediction [J].
Chambless, Lloyd E. ;
Diao, Guoqing .
STATISTICS IN MEDICINE, 2006, 25 (20) :3474-3486
[4]   Several methods to assess improvement in risk prediction models: Extension to survival analysis [J].
Chambless, Lloyd E. ;
Cummiskey, Christopher P. ;
Cui, Gang .
STATISTICS IN MEDICINE, 2011, 30 (01) :22-38
[5]   Use and misuse of the receiver operating characteristic curve in risk prediction [J].
Cook, Nancy R. .
CIRCULATION, 2007, 115 (07) :928-935
[6]   DEMAND FOR AUTOMOBILES [J].
CRAGG, JG ;
UHLER, RS .
CANADIAN JOURNAL OF ECONOMICS, 1970, 3 (03) :386-406
[7]  
Harrell FE, 1996, STAT MED, V15, P361, DOI 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO
[8]  
2-4
[9]  
Harrell FE., 2001, Regression Modeling Strategies: with Applications to Linear Models, Logistic Regression, and Survival Analysis, V608, DOI DOI 10.2147/
[10]   The impact of genotype frequencies on the clinical validity of genomic profiling for predicting common chronic diseases [J].
Janssens, A. Cecile J. W. ;
Moonesinghe, Ramal ;
Yang, Quahne ;
Steyerberg, Ewout W. ;
van Duijn, Cornelia M. ;
Khoury, Muin J. .
GENETICS IN MEDICINE, 2007, 9 (08) :528-535