Simpson's paradox in the integrated discrimination improvement

被引:13
作者
Chipman, J. [1 ]
Braun, D. [2 ,3 ]
机构
[1] Vanderbilt Univ Sch Med, Dept Biostat, 2525 West End Ave,Suite 11000, Nashville, TN 37203 USA
[2] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[3] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02215 USA
基金
美国国家卫生研究院;
关键词
integrated discrimination improvement; simpson's paradox; risk prediction; reclassification; BRCAPRO; NET RECLASSIFICATION IMPROVEMENT; CARRIER PROBABILITY ESTIMATION; CLINICALLY RELEVANT MEASURES; PREDICTION MODELS; PERFORMANCE; MARKER; BRCA2; RISK; CALIBRATION; CAUTION;
D O I
10.1002/sim.6862
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The integrated discrimination improvement (IDI) is commonly used to compare two risk prediction models; it summarizes the extent a new model increases risk in events and decreases risk in non-events. The IDI averages risks across events and non-events and is therefore susceptible to Simpson's paradox. In some settings, adding a predictive covariate to a well calibrated model results in an overall negative (positive) IDI. However, if stratified by that same covariate, the strata-specific IDIs are positive (negative). Meanwhile, the calibration (observed to expected ratio and Hosmer-Lemeshow Goodness of Fit Test), area under the receiver operating characteristic curve, and Brier score improve overall and by stratum. We ran extensive simulations to investigate the impact of an imbalanced covariate upon metrics (IDI, area under the receiver operating characteristic curve, Brier score, and R-2), provide an analytic explanation for the paradox in the IDI, and use an investigative metric, a Weighted IDI, to better understand the paradox. In simulations, all instances of the paradox occurred under stratum-specific mis-calibration, yet there were mis-calibrated settings in which the paradox did not occur. The paradox is illustrated on Cancer Genomics Network data by calculating predictions based on two versions of BRCAPRO, a Mendelian risk prediction model for breast and ovarian cancer. In both simulations and the Cancer Genomics Network data, overall model calibration did not guarantee stratum-level calibration. We conclude that the IDI should only assess model performance among a clinically relevant subset when stratum-level calibration is strictly met and recommend calculating additional metrics to confirm the direction and conclusions of the IDI. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:4468 / 4481
页数:14
相关论文
共 30 条
[1]   Reclassification of predictions for uncovering subgroup specific improvement [J].
Biswas, Swati ;
Arun, Banu ;
Parmigiani, Giovanni .
STATISTICS IN MEDICINE, 2014, 33 (11) :1914-1927
[2]   Tailoring BRCAPRO to Asian-Americans [J].
Chen, Sining ;
Blackford, Amanda L. ;
Parmigiani, Giovanni .
JOURNAL OF CLINICAL ONCOLOGY, 2009, 27 (04) :642-643
[3]   Clinically Relevant Measures of Fit? A Note of Caution [J].
Cook, Nancy R. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2012, 176 (06) :488-491
[4]   When is a new prediction marker useful?: A consideration of lipoprotein-associated phospholipase A2 and C-reactive protein for stroke risk [J].
Greenland, P ;
O'Malley, PG .
ARCHIVES OF INTERNAL MEDICINE, 2005, 165 (21) :2454-2456
[5]   EVALUATING THE YIELD OF MEDICAL TESTS [J].
HARRELL, FE ;
CALIFF, RM ;
PRYOR, DB ;
LEE, KL ;
ROSATI, RA .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1982, 247 (18) :2543-2546
[6]   A note on the evaluation of novel biomarkers: do not rely on integrated discrimination improvement and net reclassification index [J].
Hilden, Jorgen ;
Gerds, Thomas A. .
STATISTICS IN MEDICINE, 2014, 33 (19) :3405-3414
[7]   Incorporating medical interventions into carrier probability estimation for genetic counseling [J].
Katki, Hormuzd A. .
BMC MEDICAL GENETICS, 2007, 8
[8]   Multiple diseases in carrier probability estimation: Accounting for surviving all cancers other than breast and ovary in BRCAPRO [J].
Katki, Hormuzd A. ;
Blackford, Amanda ;
Chen, Sining ;
Parmigiani, Giovanni .
STATISTICS IN MEDICINE, 2008, 27 (22) :4532-4548
[9]   Net reclassification improvement and integrated discrimination improvement require calibrated models: relevance from a marker and model perspective [J].
Leening, Maarten J. G. ;
Steyerberg, Ewout W. ;
Van Calster, Ben ;
D'Agostino, Ralph B., Sr. ;
Pencina, Michael J. .
STATISTICS IN MEDICINE, 2014, 33 (19) :3415-3418
[10]   Net Reclassification Improvement: Computation, Interpretation, and Controversies [J].
Leening, Maarten J. G. ;
Vedder, Moniek M. ;
Witteman, Jacqueline C. M. ;
Pencina, Michael J. ;
Steyerberg, Ewout W. .
ANNALS OF INTERNAL MEDICINE, 2014, 160 (02) :122-+