Predicting species abundance using machine learning approach: a comparative assessment of random forest spatial variants and performance metrics

被引:2
作者
Mushagalusa, Ciza Arsene [1 ,2 ]
Fandohan, Adande Belarmain [3 ]
Kakai, Romain Glele [1 ]
机构
[1] Univ Abomey Calavi, Fac Sci Agron, Lab Biomath & Estimat Forestieres, 04 PB 1525, Cotonou, Benin
[2] Univ Evangel Africa UEA, Fac Agr & Environm Sci, Bukavu 3323, DEM REP CONGO
[3] Univ Natl Agr, Ecole Foresterie Trop, Unite Rech Foresterie & Conservat Bioressources, BP 43, Ketou, Benin
关键词
Ecological modelling; Machine learning; Random forest; Spatial analysis; Population estimation; GEOGRAPHICALLY WEIGHTED REGRESSION; IMPERFECT DETECTION; LINEAR-MODEL; CLIMATE; CLASSIFICATION; INTERPOLATION; AUTOCORRELATION; DEPENDENCE; DIVERSITY; POWERFUL;
D O I
10.1007/s40808-024-02055-7
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
For informed decision-making in biodiversity conservation and ecological management, accurate predictions of species abundance are essential. This study aimed to assess the predictive performance of random forest (RF) spatial variants in modelling species abundance distribution compared to standard RF, Poisson, their hybrid methods with ordinary kriging (OK), and the random generalised linear model (RGLM). Model performance in abundance modelling has rarely been quantified using a comprehensive index, except the existing single statistical indices. Therefore, modified Taylor diagrams were used to evaluate the model's overall ability to predict species abundance spatial patterns, taking into account abundance class and detection probability. An exponential correlation function was used to generate spatially correlated random effects with and without a quadratic term and two variation strengths. Species abundance class and the relationship between abundance and independent variables determine which RF spatial variant performs the best. Spatial RF variants outperform conventional modelling in terms of prediction accuracy and power, particularly when spatial autocorrelation and species detection probabilities are high. RF spatial variants were less precise for common species than RGLM and GLM-OK, which better predicted species abundance for low or no spatial autocorrelation cases. However, none of the models outperformed the others for all prediction goals, highlighting the need for combining performance metrics to evaluate species abundance distribution models. The study highlights the importance of model specification in ecological research and cautions against the use of RF algorithms as a black box.
引用
收藏
页码:5145 / 5171
页数:27
相关论文
共 121 条
[11]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[12]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[13]  
Breiman L, 1984, Classification and Regression Trees, V1st, DOI DOI 10.1201/9781315139470
[14]   Spatial prediction models for landslide hazards: review, comparison and evaluation [J].
Brenning, A .
NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2005, 5 (06) :853-862
[15]   Model selection and assessment for multi-species occupancy models [J].
Broms, Kristin M. ;
Hooten, Mevin B. ;
Fitzpatrick, Ryan M. .
ECOLOGY, 2016, 97 (07) :1759-1770
[16]   Geographically weighted regression - modelling spatial non-stationarity [J].
Brunsdon, C ;
Fotheringham, S ;
Charlton, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1998, 47 :431-443
[17]  
Cameron A. C., 2005, Microeconometrics: Methods and Applications
[18]   Vertebrates on the brink as indicators of biological annihilation and the sixth mass extinction [J].
Ceballos, Gerardo ;
Ehrlich, Paul R. ;
Raven, Peter H. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (24) :13596-13602
[19]   Top predators govern multitrophic diversity effects in tritrophic food webs [J].
Ceulemans, Ruben ;
Guill, Christian ;
Gaedke, Ursula .
ECOLOGY, 2021, 102 (07)
[20]  
Chiles J.-C., 2012, GEOSTATISITCS MODELI, P28, DOI [10.1002/9781118136188.ch2, DOI 10.1002/9781118136188.CH2]