ROC and AUC with a Binary Predictor: a Potentially Misleading Metric

被引:146
作者
Muschelli, John, III [1 ]
机构
[1] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Biostat, Baltimore, MD 21205 USA
关键词
ROC; AUC; Area under the curve; R; AREA;
D O I
10.1007/s00357-019-09345-1
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In analysis of binary outcomes, the receiver operator characteristic (ROC) curve is heavily used to show the performance of a model or algorithm. The ROC curve is informative about the performance over a series of thresholds and can be summarized by the area under the curve (AUC), a single number. When a predictor is categorical, the ROC curve has one less than number of categories as potential thresholds; when the predictor is binary, there is only one threshold. As the AUC may be used in decision-making processes on determining the best model, it important to discuss how it agrees with the intuition from the ROC curve. We discuss how the interpolation of the curve between thresholds with binary predictors can largely change the AUC. Overall, we show using a linear interpolation from the ROC curve with binary predictors corresponds to the estimated AUC, which is most commonly done in software, which we believe can lead to misleading results. We compare R, Python, Stata, and SAS software implementations. We recommend using reporting the interpolation used and discuss the merit of using the step function interpolator, also referred to as the "pessimistic" approach by Fawcett (2006).
引用
收藏
页码:696 / 708
页数:13
相关论文
共 26 条
  • [1] Allaire J.J., 2018, RETICULATE INTERFACE
  • [2] [Anonymous], 2017, S A S VERS S T A T 9
  • [3] [Anonymous], 2013, REL 13 STAT SOFTW
  • [4] AREA ABOVE ORDINAL DOMINANCE GRAPH AND AREA BELOW RECEIVER OPERATING CHARACTERISTIC GRAPH
    BAMBER, D
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1975, 12 (04) : 387 - 415
  • [5] Technology and the Glaucoma Suspect
    Blumberg, Dana M.
    De Moraes, Carlos Gustavo
    Liebmann, Jeffrey M.
    Garg, Reena
    Chen, Cynthia
    Theventhiran, Alex
    Hood, Donald C.
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2016, 57 (09) : OCT80 - OCT85
  • [6] Factors associated with significant MRI findings in medical walk-in patients with acute headache
    Budweg, Joris
    Sprenger, Till
    De Vere-Tyndall, Anthony
    Hagenkord, Anne
    Stippich, Christoph
    Berger, Christoph T.
    [J]. SWISS MEDICAL WEEKLY, 2016, 146
  • [7] COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH
    DELONG, ER
    DELONG, DM
    CLARKEPEARSON, DI
    [J]. BIOMETRICS, 1988, 44 (03) : 837 - 845
  • [8] An introduction to ROC analysis
    Fawcett, Tom
    [J]. PATTERN RECOGNITION LETTERS, 2006, 27 (08) : 861 - 874
  • [9] Value of scar imaging and inotropic reserve combination for the prediction of segmental and global left ventricular functional recovery after revascularisation
    Glaveckaite, Sigita
    Valeviciene, Nomeda
    Palionis, Darius
    Skorniakov, Viktor
    Celutkiene, Jelena
    Tamosiunas, Algirdas
    Uzdavinys, Giedrius
    Laucevicius, Aleksandras
    [J]. JOURNAL OF CARDIOVASCULAR MAGNETIC RESONANCE, 2011, 13
  • [10] THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE
    HANLEY, JA
    MCNEIL, BJ
    [J]. RADIOLOGY, 1982, 143 (01) : 29 - 36