Effects of sample size on the performance of species distribution models

被引:2006
作者
Wisz, M. S. [1 ]
Hijmans, R. J. [2 ]
Li, J. [3 ]
Peterson, A. T. [4 ,5 ]
Graham, C. H. [6 ]
Guisan, A. [7 ]
机构
[1] Univ Aarhus, Dept Arctic Environm, Natl Environm Res Inst, Roskilde, Denmark
[2] Int Rice Res Inst, Los Banos, Laguna, Philippines
[3] Dept Marine & Coastal Environm, Canberra, ACT, Australia
[4] Univ Kansas, Museum Nat Hist, Lawrence, KS 66045 USA
[5] Univ Kansas, Biodivers Res Ctr, Lawrence, KS 66045 USA
[6] SUNY Stony Brook, Dept Ecol & Evolut, Stony Brook, NY 11794 USA
[7] Univ Lausanne, Dept Ecol & Evolut, Lausanne, Switzerland
关键词
ecological niche model; MAXENT; model comparison; OM-GARP; sample size; species distribution model;
D O I
10.1111/j.1472-4642.2008.00482.x
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
A wide range of modelling algorithms is used by ecologists, conservation practitioners, and others to predict species ranges from point locality data. Unfortunately, the amount of data available is limited for many taxa and regions, making it essential to quantify the sensitivity of these algorithms to sample size. This is the first study to address this need by rigorously evaluating a broad suite of algorithms with independent presence-absence data from multiple species and regions. We evaluated predictions from 12 algorithms for 46 species (from six different regions of the world) at three sample sizes (100, 30, and 10 records). We used data from natural history collections to run the models, and evaluated the quality of model predictions with area under the receiver operating characteristic curve (AUC). With decreasing sample size, model accuracy decreased and variability increased across species and between models. Novel modelling methods that incorporate both interactions between predictor variables and complex response shapes (i.e. GBM, MARS-INT, BRUTO) performed better than most methods at large sample sizes but not at the smallest sample sizes. Other algorithms were much less sensitive to sample size, including an algorithm based on maximum entropy (MAXENT) that had among the best predictive power across all sample sizes. Relative to other algorithms, a distance metric algorithm (DOMAIN) and a genetic algorithm (OM-GARP) had intermediate performance at the largest sample size and among the best performance at the lowest sample size. No algorithm predicted consistently well with small sample size (n < 30) and this should encourage highly conservative use of predictions based on small sample size and restrict their use to exploratory modelling.
引用
收藏
页码:763 / 773
页数:11
相关论文
共 49 条
[1]   Geographical distributions of spiny pocket mice in South America:: insights from predictive models [J].
Anderson, RP ;
Gómez-Laverde, M ;
Peterson, AT .
GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2002, 11 (02) :131-141
[2]   Validation of species-climate impact models under climate change [J].
Araújo, MB ;
Pearson, RG ;
Thuiller, W ;
Erhard, M .
GLOBAL CHANGE BIOLOGY, 2005, 11 (09) :1504-1513
[3]   Evaluation of statistical models used for predicting plant species distributions: Role of artificial data and theory [J].
Austin, M. P. ;
Belbin, L. ;
Meyers, J. A. ;
Doherty, M. D. ;
Luoto, M. .
ECOLOGICAL MODELLING, 2006, 199 (02) :197-216
[4]   Spatial prediction of species distribution: an interface between ecological theory and statistical modelling [J].
Austin, MP .
ECOLOGICAL MODELLING, 2002, 157 (2-3) :101-118
[5]   Predicting distributional change, with application to bird distributions in northeast Scotland [J].
Buckland, ST ;
Elston, DA ;
Beaney, SJ .
GLOBAL ECOLOGY AND BIOGEOGRAPHY LETTERS, 1996, 5 (02) :66-84
[6]  
Busby J.R., 1991, NATURE CONSERVATION, P64, DOI [DOI 10.1046/J.1365-294X.2001.01244.X, DOI 10.1590/2175-7860201869437]
[7]   DOMAIN - A FLEXIBLE MODELING PROCEDURE FOR MAPPING POTENTIAL DISTRIBUTIONS OF PLANTS AND ANIMALS [J].
CARPENTER, G ;
GILLISON, AN ;
WINTER, J .
BIODIVERSITY AND CONSERVATION, 1993, 2 (06) :667-680
[8]   The effects of scale and sample size on the accuracy of spatial predictions of tiger beetle (Cicindelidae) species richness [J].
Carroll, SS ;
Pearson, DL .
ECOGRAPHY, 1998, 21 (04) :401-414
[9]  
Chambers J., 1983, GRAPHICAL METHODS DA
[10]  
Crawley M.J., 2002, STAT COMPUTING INTRO