AUC:: a misleading measure of the performance of predictive distribution models

被引:2520
作者
Lobo, Jorge M. [1 ]
Jimenez-Valverde, Alberto [1 ]
Real, Raimundo [2 ]
机构
[1] CSIC, Museo Nacl Ciencias Nat, Dept Biodiversidad & Biol, E-28006 Madrid, Spain
[2] Univ Malaga, Fac Ciencias, Dept Anim Biol, Lab Biogeografia Diversidad & Consevac, E-29071 Malaga, Spain
来源
GLOBAL ECOLOGY AND BIOGEOGRAPHY | 2008年 / 17卷 / 02期
关键词
AUC; distribution models; ecological statistics; goodness-of-fit; model accuracy; ROC curve;
D O I
10.1111/j.1466-8238.2007.00358.x
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The area under the receiver operating characteristic (ROC) curve, known as the AUC, is currently considered to be the standard method to assess the accuracy of predictive distribution models. It avoids the supposed subjectivity in the threshold selection process, when continuous probability derived scores are converted to a binary presence-absence variable, by summarizing overall model performance over all possible thresholds. In this manuscript we review some of the features of this measure and bring into question its reliability as a comparative measure of accuracy between model results. We do not recommend using AUC for five reasons: (1) it ignores the predicted probability values and the goodness-of-fit of the model; (2) it summarises the test performance over regions of the ROC space in which one would rarely operate; (3) it weights omission and commission errors equally; (4) it does not give information about the spatial distribution of model errors; and, most importantly, (5) the total extent to which models are carried out highly influences the rate of well-predicted absences and the AUC scores.
引用
收藏
页码:145 / 151
页数:7
相关论文
共 50 条
  • [1] Comparing classifiers when the misallocation costs are uncertain
    Adams, NM
    Hand, DJ
    [J]. PATTERN RECOGNITION, 1999, 32 (07) : 1139 - 1147
  • [2] Evaluating predictive models of species' distributions: criteria for selecting optimal models
    Anderson, RP
    Lew, D
    Peterson, AT
    [J]. ECOLOGICAL MODELLING, 2003, 162 (03) : 211 - 232
  • [3] From descriptive to predictive distribution models: A working example with Iberian amphibians and reptiles
    Arntzen J.W.
    [J]. Frontiers in Zoology, 3 (1)
  • [4] Species distribution models and ecological theory: A critical assessment and some possible new approaches
    Austin, Mike
    [J]. ECOLOGICAL MODELLING, 2007, 200 (1-2) : 1 - 19
  • [5] A proposed design and analysis for comparing digital and analog mammography: Special receiver operating characteristic methods for cancer screening
    Baker, SG
    Pinsky, PF
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (454) : 421 - 428
  • [6] The use of the area under the roc curve in the evaluation of machine learning algorithms
    Bradley, AP
    [J]. PATTERN RECOGNITION, 1997, 30 (07) : 1145 - 1159
  • [7] Presence-absence versus presence-only modelling methods for predicting bird habitat suitability
    Brotons, L
    Thuiller, W
    Araújo, MB
    Hirzel, AH
    [J]. ECOGRAPHY, 2004, 27 (04) : 437 - 448
  • [8] Cohen J, 1960, EDUC PSYCHOL MEAS, V41, P687
  • [9] Predictive performance of the binary logit model in unbalanced samples
    Cramer, JS
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1999, 48 : 85 - 94
  • [10] Novel methods improve prediction of species' distributions from occurrence data
    Elith, J
    Graham, CH
    Anderson, RP
    Dudík, M
    Ferrier, S
    Guisan, A
    Hijmans, RJ
    Huettmann, F
    Leathwick, JR
    Lehmann, A
    Li, J
    Lohmann, LG
    Loiselle, BA
    Manion, G
    Moritz, C
    Nakamura, M
    Nakazawa, Y
    Overton, JM
    Peterson, AT
    Phillips, SJ
    Richardson, K
    Scachetti-Pereira, R
    Schapire, RE
    Soberón, J
    Williams, S
    Wisz, MS
    Zimmermann, NE
    [J]. ECOGRAPHY, 2006, 29 (02) : 129 - 151