Tests of calibration and goodness-of-fit in the survival setting

被引:218
|
作者
Demler, Olga V. [1 ]
Paynter, Nina P. [1 ]
Cook, Nancy R. [1 ]
机构
[1] Harvard Univ, Sch Med, Brigham & Womens Hosp, Div Prevent Med, East Boston, MA 02215 USA
关键词
calibration; survival analysis; goodness-of-fit; CARDIOVASCULAR RISK; PERFORMANCE; PREDICTION; MODELS;
D O I
10.1002/sim.6428
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
To access the calibration of a predictive model in a survival analysis setting, several authors have extended the Hosmer-Lemeshow goodness-of-fit test to survival data. GrOnnesby and Borgan developed a test under the proportional hazards assumption, and Nam and D'Agostino developed a nonparametric test that is applicable in a more general survival setting for data with limited censoring. We analyze the performance of the two tests and show that the GrOnnesby-Borgan test attains appropriate size in a variety of settings, whereas the Nam-D'Agostino method has a higher than nominal Type 1 error when there is more than trivial censoring. Both tests are sensitive to small cell sizes. We develop a modification of the Nam-D'Agostino test to allow for higher censoring rates. We show that this modified Nam-D'Agostino test has appropriate control of Type 1 error and comparable power to the GrOnnesby-Borgan test and is applicable to settings other than proportional hazards. We also discuss the application to small cell sizes. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1659 / 1680
页数:22
相关论文
共 50 条