A Comprehensive Evaluation of Machine Learning Algorithms for Digital Soil Organic Carbon Mapping on a National Scale

被引:0
作者
Radocaj, Dorijan [1 ]
Jug, Danijel [1 ]
Jug, Irena [1 ]
Jurisic, Mladen [1 ]
机构
[1] Univ Josip Juraj Strossmayer Osijek, Fac Agrobiotechn Sci Osijek, Vladimira Preloga 1, Osijek 31000, Croatia
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 21期
关键词
random forest; web of science core collection topic search; LUCAS dataset; environmental covariates; digital soil mapping; remote sensing; PREDICTION;
D O I
10.3390/app14219990
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The aim of this study was to narrow the research gap of ambiguity in which machine learning algorithms should be selected for evaluation in digital soil organic carbon (SOC) mapping. This was performed by providing a comprehensive assessment of prediction accuracy for 15 frequently used machine learning algorithms in digital SOC mapping based on studies indexed in the Web of Science Core Collection (WoSCC), providing a basis for algorithm selection in future studies. Two study areas, including mainland France and the Czech Republic, were used in the study based on 2514 and 400 soil samples from the LUCAS 2018 dataset. Random Forest was first ranked for France (mainland) and then ranked for the Czech Republic regarding prediction accuracy; the coefficients of determination were 0.411 and 0.249, respectively, which was in accordance with its dominant appearance in previous studies indexed in the WoSCC. Additionally, the K-Nearest Neighbors and Gradient Boosting Machine regression algorithms indicated, relative to their frequency in studies indexed in the WoSCC, that they are underrated and should be more frequently considered in future digital SOC studies. Future studies should consider study areas not strictly related to human-made administrative borders, as well as more interpretable machine learning and ensemble machine learning approaches.
引用
收藏
页数:15
相关论文
共 67 条
  • [61] Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest
    Sahin, Emrehan Kutlug
    [J]. SN APPLIED SCIENCES, 2020, 2 (07):
  • [62] Support Vector Machine Versus Random Forest for Remote Sensing Image Classification: A Meta-Analysis and Systematic Review
    Sheykhmousa, Mohammadreza
    Mahdianpari, Masoud
    Ghanbari, Hamid
    Mohammadimanesh, Fariba
    Ghamisi, Pedram
    Homayouni, Saeid
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 6308 - 6325
  • [63] Therneau T., Rpart: Recursive Partitioning and Regression Trees, R Package Version 4.1.23
  • [64] How to compare sampling designs for mapping?
    Wadoux, Alexandre M. J. -C.
    Brus, Dick J.
    [J]. EUROPEAN JOURNAL OF SOIL SCIENCE, 2021, 72 (01) : 35 - 46
  • [65] Using the Negative Soil Adjustment Factor of Soil Adjusted Vegetation Index (SAVI) to Resist Saturation Effects and Estimate Leaf Area Index (LAI) in Dense Vegetation Areas
    Zhen, Zhijun
    Chen, Shengbo
    Yin, Tiangang
    Chavanon, Eric
    Lauret, Nicolas
    Guilleux, Jordan
    Henke, Michael
    Qin, Wenhan
    Cao, Lisai
    Li, Jian
    Lu, Peng
    Gastellu-Etchegorry, Jean-Philippe
    [J]. SENSORS, 2021, 21 (06) : 1 - 15
  • [66] Random Forests and Cubist Algorithms for Predicting Shear Strengths of Rockfill Materials
    Zhou, Jian
    Li, Enming
    Wei, Haixia
    Li, Chuanqi
    Qiao, Qiuqiu
    Armaghani, Danial Jahed
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (08):
  • [67] Estimation of maize above-ground biomass based on stem-leaf separation strategy integrated with LiDAR and optical remote sensing data
    Zhu, Yaohui
    Zhao, Chunjiang
    Yang, Hao
    Yang, Guijun
    Han, Liang
    Li, Zhenhai
    Feng, Haikuan
    Xu, Bo
    Wu, Jintao
    Lei, Lei
    [J]. PEERJ, 2019, 7