Assessing Performance and Clinical Usefulness in Prediction Models With Survival Outcomes: Practical Guidance for Cox Proportional Hazards Models

被引:62
作者
McLernon, David J. [1 ,9 ]
Giardiello, Daniele [2 ,3 ,4 ]
Van Calster, Ben [3 ,5 ]
Wynants, Laure [6 ]
van Geloven, Nan [3 ]
van Smeden, Maarten [7 ]
Therneau, Terry [8 ]
Steyerberg, Ewout W. [3 ]
STRATOS Initiative
机构
[1] Univ Aberdeen, Inst Appl Hlth Sci, Aberdeen, Scotland
[2] Netherlands Canc Inst, Amsterdam, Netherlands
[3] Leiden Univ Med Ctr, Dept Biomed Data Sci, Leiden, Netherlands
[4] Inst Univ Lubeck, Inst Biomed, Eurac Res, Bolzano, Italy
[5] Katholieke Univ Leuven, Dept Dev & Regenerat, Leuven, Belgium
[6] Maastricht Univ, Sch Publ Hlth & Primary Care, Maastricht, Netherlands
[7] Univ Utrecht, Univ Med Ctr Utrecht, Julius Ctr Hlth Sci & Primary Care, Utrecht, Netherlands
[8] Mayo Clin, Dept Quantitat Hlth Sci, Rochester, MN USA
[9] Univ Aberdeen, Inst Appl Hlth Sci, Polwarth Bldg, Aberdeen AB25 2ZD, Scotland
关键词
EXTERNAL VALIDATION; PROGNOSTIC MODELS; COMPETING RISKS; CANCER; CALIBRATION; REGRESSION; IMPACT;
D O I
10.7326/M22-0844
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Risk prediction models need thorough validation to assess their performance. Validation of models for survival outcomes poses challenges due to the censoring of observations and the varying time horizon at which predictions can be made. This article describes measures to evaluate predictions and the potential improvement in decision making from survival models based on Cox proportional hazards regression.As a motivating case study, the authors consider the prediction of the composite outcome of recurrence or death (the "event ") in patients with breast cancer after surgery. They developed a simple Cox regression model with 3 predictors, as in the Nottingham Prognostic Index, in 2982 women (1275 events over 5 years of follow-up) and externally validated this model in 686 women (285 events over 5 years). Improvement in performance was assessed after the addition of progesterone receptor as a prognostic biomarker.The model predictions can be evaluated across the full range of observed follow-up times or for the event occurring by the end of a fixed time horizon of interest. The authors first discuss recommended statistical measures that evaluate model performance in terms of discrimination, calibration, or overall performance. Further, they evaluate the potential clinical utility of the model to support clinical decision making according to a net benefit measure. They provide SAS and R code to illustrate internal and external validation.The authors recommend the proposed set of performance measures for transparent reporting of the validity of predictions from survival models.
引用
收藏
页码:105 / +
页数:11
相关论文
共 61 条
[1]  
Altman DG, 2000, STAT MED, V19, P453, DOI 10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.3.CO
[2]  
2-X
[3]   Graphical calibration curves and the integrated calibration index (ICI) for survival models [J].
Austin, Peter C. ;
Harrell, Frank E., Jr. ;
van Klaveren, David .
STATISTICS IN MEDICINE, 2020, 39 (21) :2714-2742
[4]   A comparison of landmark methods and time-dependent ROC methods to evaluate the time-varying performance of prognostic markers for survival outcomes [J].
Aasthaa Bansal ;
Patrick J. Heagerty .
Diagnostic and Prognostic Research, 3 (1)
[5]   Pseudo-observations for competing risks with covariate dependent censoring [J].
Binder, Nadine ;
Gerds, Thomas A. ;
Andersen, Per Kragh .
LIFETIME DATA ANALYSIS, 2014, 20 (02) :303-315
[6]   Review and comparison of ROC curve estimators for a time-dependent outcome with marker-dependent censoring [J].
Blanche, Paul ;
Dartigues, Jean-Francois ;
Jacqmin-Gadda, Helene .
BIOMETRICAL JOURNAL, 2013, 55 (05) :687-704
[7]   How are missing data in covariates handled in observational time-to-event studies in oncology? A systematic review [J].
Carroll, Orlagh U. ;
Morris, Tim P. ;
Keogh, Ruth H. .
BMC MEDICAL RESEARCH METHODOLOGY, 2020, 20 (01)
[8]  
Collins GS, 2015, ANN INTERN MED, V162, P55, DOI [10.7326/M14-0697, 10.1136/bmj.g7594, 10.1016/j.jclinepi.2014.11.010, 10.1038/bjc.2014.639, 10.1002/bjs.9736, 10.1016/j.eururo.2014.11.025, 10.1186/s12916-014-0241-z, 10.7326/M14-0698]
[9]   Assessing calibration of prognostic risk scores [J].
Crowson, Cynthia S. ;
Atkinson, Elizabeth J. ;
Therneau, Terry M. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (04) :1692-1706
[10]  
Foekens JA, 2000, CANCER RES, V60, P636