Graphical calibration curves and the integrated calibration index (ICI) for competing risk models

被引:20
作者
Austin, Peter C. [1 ,2 ,3 ]
Putter, Hein [4 ]
Giardiello, Daniele [4 ,5 ,6 ]
van Klaveren, David [7 ,8 ]
机构
[1] ICES, G106,2075 Bayview Ave, Toronto, ON M4N 3M5, Canada
[2] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada
[3] Sunnybrook Res Inst, Toronto, ON, Canada
[4] Leiden Univ, Med Ctr, Dept Biomed Data Sci, Leiden, Netherlands
[5] Antoni van Leeuwenhoek Hosp, Netherlands Canc Inst, Div Mol Pathol, Amsterdam, Netherlands
[6] Univ Lubeck, Inst Biomed, Eurac Res, Bolzano, Italy
[7] Erasmus MC, Dept Publ Hlth, Rotterdam, Netherlands
[8] Tufts Med Ctr, Inst Clin Res & Hlth Policy Studies, Predict Analyt & Comparat Effectiveness Ctr, Boston, MA USA
基金
加拿大健康研究院;
关键词
Calibration; Competing risks; Survival analysis; Time-to-event model; Model validation; Random forests; REGRESSION-MODELS; PROGNOSTIC MODELS; PREDICTION MODELS; SUBDISTRIBUTION; MACHINE;
D O I
10.1186/s41512-021-00114-6
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Assessing calibration-the agreement between estimated risk and observed proportions-is an important component of deriving and validating clinical prediction models. Methods for assessing the calibration of prognostic models for use with competing risk data have received little attention.Methods We propose a method for graphically assessing the calibration of competing risk regression models. Our proposed method can be used to assess the calibration of any model for estimating incidence in the presence of competing risk (e.g., a Fine-Gray subdistribution hazard model; a combination of cause-specific hazard functions; or a random survival forest). Our method is based on using the Fine-Gray subdistribution hazard model to regress the cumulative incidence function of the cause-specific outcome of interest on the predicted outcome risk of the model whose calibration we want to assess. We provide modifications of the integrated calibration index (ICI), of E50 and of E90, which are numerical calibration metrics, for use with competing risk data. We conducted a series of Monte Carlo simulations to evaluate the performance of these calibration measures when the underlying model has been correctly specified and when the model was mis-specified and when the incidence of the cause-specific outcome differed between the derivation and validation samples. We illustrated the usefulness of calibration curves and the numerical calibration metrics by comparing the calibration of a Fine-Gray subdistribution hazards regression model with that of random survival forests for predicting cardiovascular mortality in patients hospitalized with heart failure.Results The simulations indicated that the method for constructing graphical calibration curves and the associated calibration metrics performed as desired. We also demonstrated that the numerical calibration metrics can be used as optimization criteria when tuning machine learning methods for competing risk outcomes.Conclusions The calibration curves and numeric calibration metrics permit a comprehensive comparison of the calibration of different competing risk models.
引用
收藏
页数:22
相关论文
共 28 条
  • [1] Predictive performance of machine and statistical learning methods: Impact of data-generating processes on external validity in the "large N, small p" setting
    Austin, Peter C.
    Harrell, Frank E., Jr.
    Steyerberg, Ewout W.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (06) : 1465 - 1483
  • [2] Fine-Gray subdistribution hazard models to simultaneously estimate the absolute risk of different event types: Cumulative total failure probability may exceed 1
    Austin, Peter C.
    Steyerberg, Ewout W.
    Putter, Hein
    [J]. STATISTICS IN MEDICINE, 2021, 40 (19) : 4200 - 4212
  • [3] Graphical calibration curves and the integrated calibration index (ICI) for survival models
    Austin, Peter C.
    Harrell, Frank E., Jr.
    van Klaveren, David
    [J]. STATISTICS IN MEDICINE, 2020, 39 (21) : 2714 - 2742
  • [4] The Integrated Calibration Index (ICI) and related metrics for quantifying the calibration of logistic regression models
    Austin, Peter C.
    Steyerberg, Ewout W.
    [J]. STATISTICS IN MEDICINE, 2019, 38 (21) : 4051 - 4065
  • [5] Propensity-score matching with competing risks in survival analysis
    Austin, Peter C.
    Fine, Jason P.
    [J]. STATISTICS IN MEDICINE, 2019, 38 (05) : 751 - 777
  • [6] The number of primary events per variable affects estimation of the subdistribution hazard competing risks model
    Austin, Peter C.
    Allignol, Arthur
    Fine, Jason P.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2017, 83 : 75 - 84
  • [7] Introduction to the Analysis of Survival Data in the Presence of Competing Risks
    Austin, Peter C.
    Lee, Douglas S.
    Fine, Jason P.
    [J]. CIRCULATION, 2016, 133 (06) : 601 - 609
  • [8] Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers
    Austin, Peter C.
    Steyerberg, Ewout W.
    [J]. STATISTICS IN MEDICINE, 2014, 33 (03) : 517 - 535
  • [9] Beyersmann J., 2012, COMPETING RISKS MULT, DOI DOI 10.1007/978-1-4614-2035-4
  • [10] Clinical Research Machine Learning Compared With Conventional Statistical Models for Predicting Myocardial Infarction Readmission and Mortality: A Systematic Review
    Cho, Sung Min
    Austin, Peter C.
    Ross, Heather J.
    Abdel-Qadir, Husam
    Chicco, Davide
    Tomlinson, George
    Taheri, Cameron
    Foroutan, Farid
    Lawler, Patrick R.
    Billia, Filio
    Gramolini, Anthony
    Epelman, Slava
    Wang, Bo
    Lee, Douglas S.
    [J]. CANADIAN JOURNAL OF CARDIOLOGY, 2021, 37 (08) : 1207 - 1214