Background sampling for multi-scale ensemble habitat selection modeling: Does the number of points matter?

被引:29
作者
Hysen, Logan [1 ]
Nayeri, Danial [1 ]
Cushman, Samuel [2 ]
Wan, Ho Yi [1 ]
机构
[1] Calif State Polytech Univ Humboldt, Dept Wildlife, 1 Harpst St, Arcata, CA 95521 USA
[2] USDA, Rocky Mt Res Stn, Flagstaff, AZ USA
关键词
Biomod; Ecological niche; Habitat suitability; Presence-absence; Species distribution model; Presence-only; SPECIES DISTRIBUTION MODELS; PSEUDO-ABSENCES; PREDICTION; FOREST; RISK; PERFORMANCE; THRESHOLDS; ACCURACY;
D O I
10.1016/j.ecoinf.2022.101914
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Ensemble habitat selection modeling is becoming a popular approach among ecologists to answer different questions. Since we are still in the early stages of development and application of ensemble modeling, there remain many questions regarding performance and parameterization. One important gap, which this paper addresses, is how the number of background points used to train models influences the performance of the ensemble model. We used an empirical presence-only dataset and three different selections of background points to train scale-optimized habitat selection models using six modeling algorithms (GLM, GAM, MARS, ANN, Random Forest, and MaxEnt). We tested four ensemble models using different combinations of the component models: (a) equal numbers of background points and presences, (b) background points equaled ten times the number of presences, (c) 10,000 background points, and (d) optimized background points for each component model. Among regression-based approaches, MARS performed best when built with 10,000 background points. Among machine learning models, RF performed the best when built with equal presences and background points. Among the four ensemble models, AUC indicated that the best performing model was the ensemble with each component model including the optimized number of background points, while TSS increased as the number of background points models increased. We found that an ensemble of models, each trained with an optimal number of background points, outperformed ensembles of models trained with the same number of background points, although differences in performance were slight. When using a single modeling method, RF with equal number of presences and background points can perform better than an ensemble model, but the performance fluctuates when the number of background points is not properly selected. On the other hand, ensemble modeling provides consistently high accuracy regardless of background point sampling approach. Further, optimizing the number of background points for each component model within an ensemble model can provide the best model improvement. We suggest evaluating more models across multiple species to investigate how background point selection might affect ensemble models in different scenarios.
引用
收藏
页数:8
相关论文
共 46 条
[1]   Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) [J].
Allouche, Omri ;
Tsoar, Asaf ;
Kadmon, Ronen .
JOURNAL OF APPLIED ECOLOGY, 2006, 43 (06) :1223-1232
[2]  
[Anonymous], 2011, REV REC PLAN NO SPOT
[3]   Reducing uncertainty in projections of extinction risk from climate change [J].
Araújo, MB ;
Whittaker, RJ ;
Ladle, RJ ;
Erhard, M .
GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2005, 14 (06) :529-538
[4]   Ensemble forecasting of species distributions [J].
Araujo, Miguel B. ;
New, Mark .
TRENDS IN ECOLOGY & EVOLUTION, 2007, 22 (01) :42-47
[5]   Meta-replication, sampling bias, and multi-scale model selection: A case study on snow leopard (Panthera uncia) in western China [J].
Atzeni, Luciano ;
Cushman, Samuel A. ;
Bai, Defeng ;
Wang, Jun ;
Chen, Pengju ;
Shi, Kun ;
Riordan, Philip .
ECOLOGY AND EVOLUTION, 2020, 10 (14) :7686-7712
[6]   Selecting pseudo-absences for species distribution models: how, where and how many? [J].
Barbet-Massin, Morgane ;
Jiguet, Frederic ;
Albert, Cecile Helene ;
Thuiller, Wilfried .
METHODS IN ECOLOGY AND EVOLUTION, 2012, 3 (02) :327-338
[7]   COMBINATION OF FORECASTS [J].
BATES, JM ;
GRANGER, CWJ .
OPERATIONAL RESEARCH QUARTERLY, 1969, 20 (04) :451-&
[8]  
Burnham K. P., 2002, Model selection and multimodel inference: A practical informationtheoretic approach
[9]   Effects of non-representative sampling design on multi-scale habitat models: flammulated owls in the Rocky Mountains. [J].
Chiaverini, Luca ;
Wan, Ho Yi ;
Hahn, Beth ;
Cilimburg, Amy ;
Wasserman, Tzeidle N. ;
Cushman, Samuel A. .
ECOLOGICAL MODELLING, 2021, 450 (450)
[10]   Ensemble models predict Important Bird Areas in southern Africa will become less effective for conserving endemic birds under climate change [J].
Coetzee, Bernard W. T. ;
Robertson, Mark P. ;
Erasmus, Barend F. N. ;
van Rensburg, Berndt J. ;
Thuiller, Wilfried .
GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2009, 18 (06) :701-710