Assessing Discriminative Performance at External Validation of Clinical Prediction Models

被引:29
作者
Nieboer, Daan [1 ]
van der Ploeg, Tjeerd [2 ]
Steyerberg, Ewout W. [1 ]
机构
[1] Erasmus MC Univ Med Ctr, Dept Publ Hlth, Rotterdam, Netherlands
[2] Inholland Univ, Med Ctr Alkmaar, Dept Sci, Alkmaar, Netherlands
来源
PLOS ONE | 2016年 / 11卷 / 02期
关键词
TRAUMATIC BRAIN-INJURY; IMPACT; FRAMEWORK;
D O I
10.1371/journal.pone.0148820
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Introduction External validation studies are essential to study the generalizability of prediction models. Recently a permutation test, focusing on discrimination as quantified by the c-statistic, was proposed to judge whether a prediction model is transportable to a new setting. We aimed to evaluate this test and compare it to previously proposed procedures to judge any changes in c-statistic from development to external validation setting. Methods We compared the use of the permutation test to the use of benchmark values of the c-statistic following from a previously proposed framework to judge transportability of a prediction model. In a simulation study we developed a prediction model with logistic regression on a development set and validated them in the validation set. We concentrated on two scenarios: 1) the case-mix was more heterogeneous and predictor effects were weaker in the validation set compared to the development set, and 2) the case-mix was less heterogeneous in the validation set and predictor effects were identical in the validation and development set. Furthermore we illustrated the methods in a case study using 15 datasets of patients suffering from traumatic brain injury. Results The permutation test indicated that the validation and development set were homogenous in scenario 1 (in almost all simulated samples) and heterogeneous in scenario 2 (in 17%-39% of simulated samples). Previously proposed benchmark values of the c-statistic and the standard deviation of the linear predictors correctly pointed at the more heterogeneous case-mix in scenario 1 and the less heterogeneous case-mix in scenario 2. Conclusion The recently proposed permutation test may provide misleading results when externally validating prediction models in the presence of case-mix differences between the development and validation population. To correctly interpret the c-statistic found at external validation it is crucial to disentangle case-mix differences from incorrect regression coefficients.
引用
收藏
页数:10
相关论文
共 21 条
  • [1] Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers
    Austin, Peter C.
    Steyerberg, Ewout W.
    [J]. STATISTICS IN MEDICINE, 2014, 33 (03) : 517 - 535
  • [2] Interpreting the concordance statistic of a logistic regression model: relation to the variance and odds ratio of a continuous explanatory variable
    Austin, Peter C.
    Steyerberg, Ewout W.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2012, 12
  • [3] External validation is necessary in, prediction research: A clinical example
    Bleeker, SE
    Moll, HA
    Steyerberg, EW
    Donders, ART
    Derksen-Lubsen, G
    Grobbee, DE
    Moons, KGM
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2003, 56 (09) : 826 - 832
  • [4] COX DR, 1958, BIOMETRIKA, V45, P562, DOI 10.1093/biomet/45.3-4.562
  • [5] A new framework to enhance the interpretation of external validation studies of clinical prediction models
    Debray, Thomas P. A.
    Vergouwe, Yvonne
    Koffijberg, Hendrik
    Nieboer, Daan
    Steyerberg, Ewout W.
    Moons, Karel G. M.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2015, 68 (03) : 280 - 289
  • [6] EVALUATING THE YIELD OF MEDICAL TESTS
    HARRELL, FE
    CALIFF, RM
    PRYOR, DB
    LEE, KL
    ROSATI, RA
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1982, 247 (18): : 2543 - 2546
  • [7] Predicting outcome after traumatic brain injury: Development and validation of a prognostic score based on admission characteristics
    Hukkelhoven, CWPM
    Steyerberg, EW
    Habbema, JDF
    Farace, E
    Marmarou, A
    Murray, GD
    Marshall, LF
    Maas, AIR
    [J]. JOURNAL OF NEUROTRAUMA, 2005, 22 (10) : 1025 - 1039
  • [8] Assessing the generalizability of prognostic information
    Justice, AC
    Covinsky, KE
    Berlin, JA
    [J]. ANNALS OF INTERNAL MEDICINE, 1999, 130 (06) : 515 - 524
  • [9] Advancing care for traumatic brain injury: findings from the IMPACT studies and perspectives on future research
    Maas, Andrew I. R.
    Murray, Gordon D.
    Roozenbeek, Bob
    Lingsma, Hester F.
    Butcher, Isabella
    McHugh, Gillian S.
    Weir, James
    Lu, Juan
    Steyerberg, Ewout W.
    [J]. LANCET NEUROLOGY, 2013, 12 (12) : 1200 - 1210
  • [10] Risk prediction models: II. External validation, model updating, and impact assessment
    Moons, Karel G. M.
    Kengne, Andre Pascal
    Grobbee, Diederick E.
    Royston, Patrick
    Vergouwe, Yvonne
    Altman, Douglas G.
    Woodward, Mark
    [J]. HEART, 2012, 98 (09) : 691 - 698