ROC and AUC with a Binary Predictor: a Potentially Misleading Metric

被引:173
作者
Muschelli, John, III [1 ]
机构
[1] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Biostat, Baltimore, MD 21205 USA
关键词
ROC; AUC; Area under the curve; R; AREA;
D O I
10.1007/s00357-019-09345-1
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In analysis of binary outcomes, the receiver operator characteristic (ROC) curve is heavily used to show the performance of a model or algorithm. The ROC curve is informative about the performance over a series of thresholds and can be summarized by the area under the curve (AUC), a single number. When a predictor is categorical, the ROC curve has one less than number of categories as potential thresholds; when the predictor is binary, there is only one threshold. As the AUC may be used in decision-making processes on determining the best model, it important to discuss how it agrees with the intuition from the ROC curve. We discuss how the interpolation of the curve between thresholds with binary predictors can largely change the AUC. Overall, we show using a linear interpolation from the ROC curve with binary predictors corresponds to the estimated AUC, which is most commonly done in software, which we believe can lead to misleading results. We compare R, Python, Stata, and SAS software implementations. We recommend using reporting the interpolation used and discuss the merit of using the step function interpolator, also referred to as the "pessimistic" approach by Fawcett (2006).
引用
收藏
页码:696 / 708
页数:13
相关论文
共 25 条
[1]  
Allaire J.J., 2018, reticulate: Interface toPython
[2]  
[Anonymous], 2013, REL 13 STAT SOFTW
[3]   AREA ABOVE ORDINAL DOMINANCE GRAPH AND AREA BELOW RECEIVER OPERATING CHARACTERISTIC GRAPH [J].
BAMBER, D .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1975, 12 (04) :387-415
[4]   Technology and the Glaucoma Suspect [J].
Blumberg, Dana M. ;
De Moraes, Carlos Gustavo ;
Liebmann, Jeffrey M. ;
Garg, Reena ;
Chen, Cynthia ;
Theventhiran, Alex ;
Hood, Donald C. .
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2016, 57 (09) :OCT80-OCT85
[5]   Factors associated with significant MRI findings in medical walk-in patients with acute headache [J].
Budweg, Joris ;
Sprenger, Till ;
De Vere-Tyndall, Anthony ;
Hagenkord, Anne ;
Stippich, Christoph ;
Berger, Christoph T. .
SWISS MEDICAL WEEKLY, 2016, 146
[6]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[7]   An introduction to ROC analysis [J].
Fawcett, Tom .
PATTERN RECOGNITION LETTERS, 2006, 27 (08) :861-874
[8]   Value of scar imaging and inotropic reserve combination for the prediction of segmental and global left ventricular functional recovery after revascularisation [J].
Glaveckaite, Sigita ;
Valeviciene, Nomeda ;
Palionis, Darius ;
Skorniakov, Viktor ;
Celutkiene, Jelena ;
Tamosiunas, Algirdas ;
Uzdavinys, Giedrius ;
Laucevicius, Aleksandras .
JOURNAL OF CARDIOVASCULAR MAGNETIC RESONANCE, 2011, 13
[9]   THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE [J].
HANLEY, JA ;
MCNEIL, BJ .
RADIOLOGY, 1982, 143 (01) :29-36
[10]  
HSU YC, 2014, INFERENCE ROC UNPUB