Background sampling for multi-scale ensemble habitat selection modeling: Does the number of points matter?

被引:29
作者
Hysen, Logan [1 ]
Nayeri, Danial [1 ]
Cushman, Samuel [2 ]
Wan, Ho Yi [1 ]
机构
[1] Calif State Polytech Univ Humboldt, Dept Wildlife, 1 Harpst St, Arcata, CA 95521 USA
[2] USDA, Rocky Mt Res Stn, Flagstaff, AZ USA
关键词
Biomod; Ecological niche; Habitat suitability; Presence-absence; Species distribution model; Presence-only; SPECIES DISTRIBUTION MODELS; PSEUDO-ABSENCES; PREDICTION; FOREST; RISK; PERFORMANCE; THRESHOLDS; ACCURACY;
D O I
10.1016/j.ecoinf.2022.101914
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Ensemble habitat selection modeling is becoming a popular approach among ecologists to answer different questions. Since we are still in the early stages of development and application of ensemble modeling, there remain many questions regarding performance and parameterization. One important gap, which this paper addresses, is how the number of background points used to train models influences the performance of the ensemble model. We used an empirical presence-only dataset and three different selections of background points to train scale-optimized habitat selection models using six modeling algorithms (GLM, GAM, MARS, ANN, Random Forest, and MaxEnt). We tested four ensemble models using different combinations of the component models: (a) equal numbers of background points and presences, (b) background points equaled ten times the number of presences, (c) 10,000 background points, and (d) optimized background points for each component model. Among regression-based approaches, MARS performed best when built with 10,000 background points. Among machine learning models, RF performed the best when built with equal presences and background points. Among the four ensemble models, AUC indicated that the best performing model was the ensemble with each component model including the optimized number of background points, while TSS increased as the number of background points models increased. We found that an ensemble of models, each trained with an optimal number of background points, outperformed ensembles of models trained with the same number of background points, although differences in performance were slight. When using a single modeling method, RF with equal number of presences and background points can perform better than an ensemble model, but the performance fluctuates when the number of background points is not properly selected. On the other hand, ensemble modeling provides consistently high accuracy regardless of background point sampling approach. Further, optimizing the number of background points for each component model within an ensemble model can provide the best model improvement. We suggest evaluating more models across multiple species to investigate how background point selection might affect ensemble models in different scenarios.
引用
收藏
页数:8
相关论文
共 46 条
[31]   Maximum entropy modeling of species geographic distributions [J].
Phillips, SJ ;
Anderson, RP ;
Schapire, RE .
ECOLOGICAL MODELLING, 2006, 190 (3-4) :231-259
[32]   Embracing Ensemble Species Distribution Models to Inform At-Risk Species Status Assessments [J].
Ramirez-Reyes, Carlos ;
Nazeri, Mona ;
Street, Garrett ;
Jones-Farrand, D. Todd ;
Vilella, Francisco J. ;
Evans, Kristine O. .
JOURNAL OF FISH AND WILDLIFE MANAGEMENT, 2021, 12 (01) :98-111
[33]   Habitat Suitability Estimation Using a Two-Stage Ensemble Approach [J].
Rew, Jehyeok ;
Cho, Yongjang ;
Moon, Jihoon ;
Hwang, Eenjun .
REMOTE SENSING, 2020, 12 (09)
[34]   Mapping an Observation-Based Global Solar Irradiance Climatology across the Conterminous United States [J].
Rupp, David E. ;
Daly, Christopher ;
Doggett, Matthew K. ;
Smith, Joseph I. ;
Steinberg, Ben .
JOURNAL OF APPLIED METEOROLOGY AND CLIMATOLOGY, 2022, 61 (07) :857-876
[35]   A comparison of absolute performance of different correlative and mechanistic species distribution models in an independent area [J].
Shabani, Farzin ;
Kumar, Lalit ;
Ahmadi, Mohsen .
ECOLOGY AND EVOLUTION, 2016, 6 (16) :5973-5986
[36]   The area under the precision-recall curve as a performance metric for rare binary events [J].
Sofaer, Helen R. ;
Hoeting, Jennifer A. ;
Jarnevich, Catherine S. .
METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (04) :565-577
[37]   Patterns and uncertainties of species' range shifts under climate change [J].
Thuiller, W .
GLOBAL CHANGE BIOLOGY, 2004, 10 (12) :2020-2027
[38]   Ensemble models of habitat suitability relate chimpanzee (Pan troglodytes) conservation to forest and landscape dynamics in Western Africa [J].
Torres, J. ;
Brito, J. C. ;
Vasconcelos, M. J. ;
Catarino, L. ;
Goncalves, J. ;
Honrado, J. .
BIOLOGICAL CONSERVATION, 2010, 143 (02) :416-425
[39]  
United States Forest Service [USFS], 2017, ECOLOGICAL SUBREGION
[40]   Predictive performance of presence-only species distribution models: a benchmark study with reproducible code [J].
Valavi, Roozbeh ;
Guillera-Arroita, Gurutzeta ;
Lahoz-Monfort, Jose J. ;
Elith, Jane .
ECOLOGICAL MONOGRAPHS, 2022, 92 (01)