Minimum required number of specimen records to develop accurate species distribution models

被引:535
作者
van Proosdij, Andre S. J. [1 ]
Sosef, Marc S. M. [1 ,4 ]
Wieringa, Jan J. [1 ]
Raes, Niels [2 ,3 ]
机构
[1] Wageningen Univ, Biosystemat Grp, Droevendaalsesteeg 1, NL-6708 PB Wageningen, Netherlands
[2] Naturalis Biodivers Ctr, Bot Sect, ASJvP, Darwinweg 2, NL-2333 CR Leiden, Netherlands
[3] Naturalis Biodivers Ctr, Bot Sect, JJW, Darwinweg 2, NL-2333 CR Leiden, Netherlands
[4] Bot Garden Meise, Nieuwelaan 38, BE-1860 Meise, Belgium
关键词
SAMPLE-SIZE; HERBARIUM COLLECTIONS; BIAS; PERFORMANCE; NICHE; DIVERSITY; RELIABILITY; PREDICTION; PRESENCES; SELECTION;
D O I
10.1111/ecog.01509
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Species distribution models (SDMs) are widely used to predict the occurrence of species. Because SDMs generally use presence-only data, validation of the predicted distribution and assessing model accuracy is challenging. Model performance depends on both sample size and species' prevalence, being the fraction of the study area occupied by the species. Here, we present a novel method using simulated species to identify the minimum number of records required to generate accurate SDMs for taxa of different pre-defined prevalence classes. We quantified model performance as a function of sample size and prevalence and found model performance to increase with increasing sample size under constant prevalence, and to decrease with increasing prevalence under constant sample size. The area under the curve (AUC) is commonly used as a measure of model performance. However, when applied to presence-only data it is prevalence-dependent and hence not an accurate performance index. Testing the AUC of an SDM for significant deviation from random performance provides a good alternative. We assessed the minimum number of records required to obtain good model performance for species of different prevalence classes in a virtual study area and in a real African study area. The lower limit depends on the species' prevalence with absolute minimum sample sizes as low as 3 for narrow-ranged and 13 for widespread species for our virtual study area which represents an ideal, balanced, orthogonal world. The lower limit of 3, however, is flawed by statistical artefacts related to modelling species with a prevalence below 0.1. In our African study area lower limits are higher, ranging from 14 for narrow-ranged to 25 for widespread species. We advocate identifying the minimum sample size for any species distribution modelling by applying the novel method presented here, which is applicable to any taxonomic clade or group, study area or climate scenario.
引用
收藏
页码:542 / 552
页数:11
相关论文
共 80 条
[31]   Effect of roadside bias on the accuracy of predictive maps produced by bioclimatic models [J].
Kadmon, R ;
Farber, O ;
Danin, A .
ECOLOGICAL APPLICATIONS, 2004, 14 (02) :401-413
[32]   Measuring and comparing the accuracy of species distribution models with presence-absence data [J].
Liu, Canran ;
White, Matt ;
Newell, Graeme .
ECOGRAPHY, 2011, 34 (02) :232-243
[33]   AUC:: a misleading measure of the performance of predictive distribution models [J].
Lobo, Jorge M. ;
Jimenez-Valverde, Alberto ;
Real, Raimundo .
GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2008, 17 (02) :145-151
[34]   Exploring the effects of quantity and location of pseudo-absences and sampling biases on the performance of distribution models with limited point occurrence data [J].
Lobo, Jorge M. ;
Tognelli, Marcelo F. .
JOURNAL FOR NATURE CONSERVATION, 2011, 19 (01) :1-7
[35]   Predicting species distributions from herbarium collections:: does climate bias in collection sampling influence model outcomes? [J].
Loiselle, Bette A. ;
Jorgensen, Peter M. ;
Consiglio, Trisha ;
Jimenez, Ivan ;
Blake, John G. ;
Lohmann, Lucia G. ;
Montiel, Olga Martha .
JOURNAL OF BIOGEOGRAPHY, 2008, 35 (01) :105-116
[36]  
Lomolino M.V., 2010, Biogeography, VFourth
[37]  
Longino JT, 2002, ECOLOGY, V83, P689, DOI 10.1890/0012-9658(2002)083[0689:TAFOAT]2.0.CO
[38]  
2
[39]   Evaluating presence-absence models in ecology: the need to account for prevalence [J].
Manel, S ;
Williams, HC ;
Ormerod, SJ .
JOURNAL OF APPLIED ECOLOGY, 2001, 38 (05) :921-931
[40]   Effects of the number of presences on reliability and stability of MARS species distribution models: the importance of regional niche variation and ecological heterogeneity [J].
Mateo, Ruben G. ;
Felicisimo, Angel M. ;
Munoz, Jesus .
JOURNAL OF VEGETATION SCIENCE, 2010, 21 (05) :908-922