A practical perspective on the concordance index for the evaluation and selection of prognostic time-to-event models

被引:141
作者
Longato, Enrico [1 ]
Vettoretti, Martina [1 ]
Di Camillo, Barbara [1 ]
机构
[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
Concordance index; Predictive modelling; Survival analysis; Time-to-event models; PREDICTIVE ANALYTICS; SURVIVAL; CARE; RISK;
D O I
10.1016/j.jbi.2020.103496
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Developing a prognostic model for biomedical applications typically requires mapping an individual's set of covariates to a measure of the risk that he or she may experience the event to be predicted. Many scenarios, however, especially those involving adverse pathological outcomes, are better described by explicitly accounting for the timing of these events, as well as their probability. As a result, in these cases, traditional classification or ranking metrics may be inadequate to inform model evaluation or selection. To address this limitation, it is common practice to reframe the problem in the context of survival analysis, and resort, instead, to the concordance index (C-index), which summarises how well a predicted risk score describes an observed sequence of events. A practically meaningful interpretation of the C-index, however, may present several difficulties and pitfalls. Specifically, we identify two main issues: i) the C-index remains implicitly, and subtly, dependent on time, and ii) its relationship with the number of subjects whose risk was incorrectly predicted is not straightforward. Failure to consider these two aspects may introduce undesirable and unwanted biases in the evaluation process, and even result in the selection of a suboptimal model. Hence, here, we discuss ways to obtain a meaningful interpretation in spite of these difficulties. Aiming to assist experimenters regardless of their familiarity with the C-index, we start from an introductory-level presentation of its most popular estimator, highlighting the latter's temporal dependency, and suggesting how it might be correctly used to inform model selection. We also address the nonlinearity of the C-index with respect to the number of correct risk predictions, elaborating a simplified framework that may enable an easier interpretation and quantification of C-index improvements or deteriorations.
引用
收藏
页数:9
相关论文
共 33 条
[1]   Big Data: transforming drug development and health policy decision making [J].
Alemayehu D. ;
Berger M.L. .
Health Services and Outcomes Research Methodology, 2016, 16 (3) :92-102
[2]  
[Anonymous], **DATA OBJECT**
[3]  
[Anonymous], **DATA OBJECT**, DOI DOI 10.5281/ZENODO.2549675
[4]   A Tutorial on Evaluating the Time-Varying Discrimination Accuracy of Survival Models Used in Dynamic Decision Making [J].
Bansal, Aasthaa ;
Heagerty, Patrick J. .
MEDICAL DECISION MAKING, 2018, 38 (08) :904-916
[5]   Why policymakers should care about "big data" in healthcare [J].
Bates, David W. ;
Heitmueller, Axel ;
Kakad, Meetali ;
Saria, Suchi .
HEALTH POLICY AND TECHNOLOGY, 2018, 7 (02) :211-216
[6]   The c-index is not proper for the evaluation of -year predicted risks [J].
Blanche, Paul ;
Kattan, Michael W. ;
Gerds, Thomas A. .
BIOSTATISTICS, 2019, 20 (02) :347-357
[7]   Use of the concordance index for predictors of censored survival data [J].
Brentnall, Adam R. ;
Cuzick, Jack .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (08) :2359-2373
[8]   Reporting of artificial intelligence prediction models [J].
Collins, Gary S. ;
Moons, Karel G. M. .
LANCET, 2019, 393 (10181) :1577-1579
[9]   Critical Care, Critical Data [J].
Cosgriff, Christopher, V ;
Celi, Leo Anthony ;
Stone, David J. .
BIOMEDICAL ENGINEERING AND COMPUTATIONAL BIOLOGY, 2019, 10
[10]   Boosting Clinical Decision-making: Machine Learning for Intensive Care Unit Discharge [J].
Cosgriff, Christopher Vincent ;
Celi, Leo Anthony ;
Sauer, Christopher Martin .
ANNALS OF THE AMERICAN THORACIC SOCIETY, 2018, 15 (07) :804-805