Without quality presence-absence data, discrimination metrics such as TSS can be misleading measures of model performance

被引:292
作者
Leroy, Boris [1 ]
Delsol, Robin [1 ,2 ]
Hugueny, Bernard [3 ]
Meynard, Christine N. [4 ]
Barhoumi, Cheima [1 ,2 ,5 ]
Barbet-Massin, Morgane [2 ]
Bellard, Celine [1 ,6 ]
机构
[1] Univ Antilles, Unite Biol Organismes & Ecosyst Aquat BOREA, Univ Caen Normandie,IRD, UMR 7208,Museum Natl Hist Nat,Sorbonne Univ,CNRS, Paris, France
[2] Univ Paris 11, CNRS, UMR 8079, Ecol Systemat & Evolut, Orsay, France
[3] Univ Toulouse Midi Pyrenees, Lab Evolut & Diversite Biol EDB, CNRS, UMR 5174,IRD,UPS, Toulouse 9, France
[4] Univ Montpellier, Montpellier SupAgro, IRD, CBGP,INRA,CIRAD, Montpellier, France
[5] Univ Montpellier, Inst Sci Evolut Montpellier, CNRS, UMR 5554, Montpellier, France
[6] UCL, Ctr Biodivers & Environm Res, Dept Genet Evolut & Environm, London, England
关键词
AUC; ecological niche models; model evaluation; prevalence; species distribution models; SPECIES DISTRIBUTION MODELS; CLIMATE-CHANGE; PREVALENCE; SELECTION; ACCURACY; THRESHOLDS; VALIDATION; AREA; BIAS; AUC;
D O I
10.1111/jbi.13402
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The discriminating capacity (i.e. ability to correctly classify presences and absences) of species distribution models (SDMs) is commonly evaluated with metrics such as the area under the receiving operating characteristic curve (AUC), the Kappa statistic and the true skill statistic (TSS). AUC and Kappa have been repeatedly criticized, but TSS has fared relatively well since its introduction, mainly because it has been considered as independent of prevalence. In addition, discrimination metrics have been contested because they should be calculated on presence-absence data, but are often used on presence-only or presence-background data. Here, we investigate TSS and an alternative set of metricssimilarity indices, also known as F-measures. We first show that even in ideal conditions (i.e. perfectly random presence-absence sampling), TSS can be misleading because of its dependence on prevalence, whereas similarity/F-measures provide adequate estimations of model discrimination capacity. Second, we show that in real-world situations where sample prevalence is different from true species prevalence (i.e. biased sampling or presence-pseudoabsence), no discrimination capacity metric provides adequate estimation of model discrimination capacity, including metrics specifically designed for modelling with presence-pseudoabsence data. Our conclusions are twofold. First, they unequivocally impel SDM users to understand the potential shortcomings of discrimination metrics when quality presence-absence data are lacking, and we recommend obtaining such data. Second, in the specific case of virtual species, which are increasingly used to develop and test SDM methodologies, we strongly recommend the use of similarity/F-measures, which were not biased by prevalence, contrary to TSS.
引用
收藏
页码:1994 / 2002
页数:9
相关论文
共 44 条
[1]   Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) [J].
Allouche, Omri ;
Tsoar, Asaf ;
Kadmon, Ronen .
JOURNAL OF APPLIED ECOLOGY, 2006, 43 (06) :1223-1232
[2]  
[Anonymous], 1908, Bull. Soc. Vaud. Sci. Nat.
[3]   Validation of species-climate impact models under climate change [J].
Araújo, MB ;
Pearson, RG ;
Thuiller, W ;
Erhard, M .
GLOBAL CHANGE BIOLOGY, 2005, 11 (09) :1504-1513
[4]   The crucial role of the accessible area in ecological niche modeling and species distribution modeling [J].
Barve, Narayani ;
Barve, Vijay ;
Jimenez-Valverde, Alberto ;
Lira-Noriega, Andres ;
Maher, Sean P. ;
Peterson, A. Townsend ;
Soberon, Jorge ;
Villalobos, Fabricio .
ECOLOGICAL MODELLING, 2011, 222 (11) :1810-1819
[5]   Will climate change promote future invasions? [J].
Bellard, Celine ;
Thuiller, Wilfried ;
Leroy, Boris ;
Genovesi, Piero ;
Bakkenes, Michel ;
Courchamp, Franck .
GLOBAL CHANGE BIOLOGY, 2013, 19 (12) :3740-3748
[6]   Evaluating resource selection functions [J].
Boyce, MS ;
Vernier, PR ;
Nielsen, SE ;
Schmiegelow, FKA .
ECOLOGICAL MODELLING, 2002, 157 (2-3) :281-300
[7]   Predicting richness and composition in mountain insect communities at high resolution: a new test of the SESAM framework [J].
D'Amen, Manuela ;
Pradervand, Jean-Nicolas ;
Guisan, Antoine .
GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2015, 24 (12) :1443-1453
[8]   ecospat: an R package to support spatial analyses and modeling of species niches and distributions [J].
Di Cola, Valeria ;
Broennimann, Olivier ;
Petitpierre, Blaise ;
Breiner, Frank T. ;
D'Amen, Manuela ;
Randin, Christophe ;
Engler, Robin ;
Pottier, Julien ;
Pio, Dorothea ;
Dubuis, Anne ;
Pellissier, Loic ;
Mateo, Ruben G. ;
Hordijk, Wim ;
Salamin, Nicolas ;
Guisan, Antoine .
ECOGRAPHY, 2017, 40 (06) :774-787
[9]   A review of methods for the assessment of prediction errors in conservation presence/absence models [J].
Fielding, AH ;
Bell, JF .
ENVIRONMENTAL CONSERVATION, 1997, 24 (01) :38-49
[10]   Twenty years of observed and predicted changes in subtidal red seaweed assemblages along a biogeographical transition zone: inferring potential causes from environmental data [J].
Gallon, Regis K. ;
Robuchon, Marine ;
Leroy, Boris ;
Le Gall, Line ;
Valero, Myriam ;
Feunteun, Eric .
JOURNAL OF BIOGEOGRAPHY, 2014, 41 (12) :2293-2306