On evaluating species distribution models with random background sites in place of absences when test presences disproportionately sample suitable habitat

被引:54
作者
Smith, Adam B. [1 ]
机构
[1] Missouri Bot Garden, Ctr Conservat & Sustainable Dev, St Louis, MO 63166 USA
关键词
AUC; background sites; biased data; model evaluation; species distribution models; CLIMATE-CHANGE; FUTURE; BIAS; RARE;
D O I
10.1111/ddi.12031
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Modelling the distribution of rare and invasive species often occurs in situations where reliable absences for evaluating model performance are unavailable. However, predictions at randomly located sites, or background' sites, can stand in for true absences. The maximum value of the area under the receiver operator characteristic curve, AUC, calculated with background sites is believed to be 1-a/2, where a is the typically unknown prevalence of the species on the landscape. Using a simple example of a species' range, I show how AUC can achieve values >1-a/2 when test presences do not represent each inhabited region of a species__ range in proportion to its area. Values of AUC that surpass 1-a/2 are associated with higher model predictions in areas overrepresented in the test data set, even if they are less environmentally suitable than other regions the species occupies. Pursuit of high AUC values can encourage inclusion of spurious predictors in the final model if they help to differentiate areas with disproportionate representation in the test data. Choices made during modelling to increase AUC calculated with background sites on the assumption that higher scores connote more accurate models can decrease actual accuracy when test presences disproportionately represent inhabited areas.
引用
收藏
页码:867 / 872
页数:6
相关论文
共 26 条
[1]   Predicting the future of species diversity: macroecological theory, climate change, and direct tests of alternative forecasting methods [J].
Algar, Adam C. ;
Kharouba, Heather M. ;
Young, Eric R. ;
Kerr, Jeremy T. .
ECOGRAPHY, 2009, 32 (01) :22-33
[2]   Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) [J].
Allouche, Omri ;
Tsoar, Asaf ;
Kadmon, Ronen .
JOURNAL OF APPLIED ECOLOGY, 2006, 43 (06) :1223-1232
[3]   Monitoring habitat dynamics for rare and endangered species using satellite images and niche-based models [J].
Bartel, Rebecca A. ;
Sexton, Joseph O. .
ECOGRAPHY, 2009, 32 (05) :888-896
[4]   Different climatic envelopes among invasive populations may lead to underestimations of current and future biological invasions [J].
Beaumont, Linda J. ;
Gallagher, Rachael V. ;
Thuiller, Wilfried ;
Downey, Paul O. ;
Leishman, Michelle R. ;
Hughes, Lesley .
DIVERSITY AND DISTRIBUTIONS, 2009, 15 (03) :409-420
[5]   Optimizing resiliency of reserve networks to climate change: multispecies conservation planning in the Pacific Northwest, USA [J].
Carroll, Carlos ;
Dunk, Jeffrey R. ;
Moilanen, Atte .
GLOBAL CHANGE BIOLOGY, 2010, 16 (03) :891-904
[6]   The art of modelling range-shifting species [J].
Elith, Jane ;
Kearney, Michael ;
Phillips, Steven .
METHODS IN ECOLOGY AND EVOLUTION, 2010, 1 (04) :330-342
[7]   A review of methods for the assessment of prediction errors in conservation presence/absence models [J].
Fielding, AH ;
Bell, JF .
ENVIRONMENTAL CONSERVATION, 1997, 24 (01) :38-49
[8]   Predicting habitat suitability for rare plants at local spatial scales using a species distribution model [J].
Gogol-Prokurat, Melanie .
ECOLOGICAL APPLICATIONS, 2011, 21 (01) :33-47
[9]   Predictive habitat distribution models in ecology [J].
Guisan, A ;
Zimmermann, NE .
ECOLOGICAL MODELLING, 2000, 135 (2-3) :147-186
[10]   Insights into the area under the receiver operating characteristic curve (AUC) as a discrimination measure in species distribution modelling [J].
Jimenez-Valverde, Alberto .
GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2012, 21 (04) :498-507