Graphical calibration curves and the integrated calibration index (ICI) for competing risk models

被引：20

作者：

Austin, Peter C. ^{[1
,2
,3
]}

Putter, Hein ^{[4
]}

Giardiello, Daniele ^{[4
,5
,6
]}

van Klaveren, David ^{[7
,8
]}

机构：

[1] ICES, G106,2075 Bayview Ave, Toronto, ON M4N 3M5, Canada

[2] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada

[3] Sunnybrook Res Inst, Toronto, ON, Canada

[4] Leiden Univ, Med Ctr, Dept Biomed Data Sci, Leiden, Netherlands

[5] Antoni van Leeuwenhoek Hosp, Netherlands Canc Inst, Div Mol Pathol, Amsterdam, Netherlands

[6] Univ Lubeck, Inst Biomed, Eurac Res, Bolzano, Italy

[7] Erasmus MC, Dept Publ Hlth, Rotterdam, Netherlands

[8] Tufts Med Ctr, Inst Clin Res & Hlth Policy Studies, Predict Analyt & Comparat Effectiveness Ctr, Boston, MA USA

来源：

DIAGNOSTIC AND PROGNOSTIC RESEARCH | 2022年 / 6卷 / 01期

基金：

加拿大健康研究院;

关键词：

Calibration; Competing risks; Survival analysis; Time-to-event model; Model validation; Random forests; REGRESSION-MODELS; PROGNOSTIC MODELS; PREDICTION MODELS; SUBDISTRIBUTION; MACHINE;

D O I：

10.1186/s41512-021-00114-6

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Background Assessing calibration-the agreement between estimated risk and observed proportions-is an important component of deriving and validating clinical prediction models. Methods for assessing the calibration of prognostic models for use with competing risk data have received little attention.Methods We propose a method for graphically assessing the calibration of competing risk regression models. Our proposed method can be used to assess the calibration of any model for estimating incidence in the presence of competing risk (e.g., a Fine-Gray subdistribution hazard model; a combination of cause-specific hazard functions; or a random survival forest). Our method is based on using the Fine-Gray subdistribution hazard model to regress the cumulative incidence function of the cause-specific outcome of interest on the predicted outcome risk of the model whose calibration we want to assess. We provide modifications of the integrated calibration index (ICI), of E50 and of E90, which are numerical calibration metrics, for use with competing risk data. We conducted a series of Monte Carlo simulations to evaluate the performance of these calibration measures when the underlying model has been correctly specified and when the model was mis-specified and when the incidence of the cause-specific outcome differed between the derivation and validation samples. We illustrated the usefulness of calibration curves and the numerical calibration metrics by comparing the calibration of a Fine-Gray subdistribution hazards regression model with that of random survival forests for predicting cardiovascular mortality in patients hospitalized with heart failure.Results The simulations indicated that the method for constructing graphical calibration curves and the associated calibration metrics performed as desired. We also demonstrated that the numerical calibration metrics can be used as optimization criteria when tuning machine learning methods for competing risk outcomes.Conclusions The calibration curves and numeric calibration metrics permit a comprehensive comparison of the calibration of different competing risk models.

引用

页数：22

共 28 条

[1] Predictive performance of machine and statistical learning methods: Impact of data-generating processes on external validity in the "large N, small p" setting
Austin, Peter C.
Harrell, Frank E., Jr.
Steyerberg, Ewout W.
[J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (06) : 1465 - 1483
[2] Fine-Gray subdistribution hazard models to simultaneously estimate the absolute risk of different event types: Cumulative total failure probability may exceed 1
Austin, Peter C.
Steyerberg, Ewout W.
Putter, Hein
[J]. STATISTICS IN MEDICINE, 2021, 40 (19) : 4200 - 4212
[3] Graphical calibration curves and the integrated calibration index (ICI) for survival models
Austin, Peter C.
Harrell, Frank E., Jr.
van Klaveren, David
[J]. STATISTICS IN MEDICINE, 2020, 39 (21) : 2714 - 2742
[4] The Integrated Calibration Index (ICI) and related metrics for quantifying the calibration of logistic regression models
Austin, Peter C.
Steyerberg, Ewout W.
[J]. STATISTICS IN MEDICINE, 2019, 38 (21) : 4051 - 4065
[5] Propensity-score matching with competing risks in survival analysis
Austin, Peter C.
Fine, Jason P.
[J]. STATISTICS IN MEDICINE, 2019, 38 (05) : 751 - 777
[6] The number of primary events per variable affects estimation of the subdistribution hazard competing risks model
Austin, Peter C.
Allignol, Arthur
Fine, Jason P.
[J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2017, 83 : 75 - 84
[7] Introduction to the Analysis of Survival Data in the Presence of Competing Risks
Austin, Peter C.
Lee, Douglas S.
Fine, Jason P.
[J]. CIRCULATION, 2016, 133 (06) : 601 - 609
[8] Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers
Austin, Peter C.
Steyerberg, Ewout W.
[J]. STATISTICS IN MEDICINE, 2014, 33 (03) : 517 - 535
[9] Beyersmann J., 2012, COMPETING RISKS MULT, DOI DOI 10.1007/978-1-4614-2035-4
[10] Clinical Research Machine Learning Compared With Conventional Statistical Models for Predicting Myocardial Infarction Readmission and Mortality: A Systematic Review
Cho, Sung Min
Austin, Peter C.
Ross, Heather J.
Abdel-Qadir, Husam
Chicco, Davide
Tomlinson, George
Taheri, Cameron
Foroutan, Farid
Lawler, Patrick R.
Billia, Filio
Gramolini, Anthony
Epelman, Slava
Wang, Bo
Lee, Douglas S.
[J]. CANADIAN JOURNAL OF CARDIOLOGY, 2021, 37 (08) : 1207 - 1214

← 1 2 3 →