A Comprehensive Evaluation of Machine Learning Algorithms for Digital Soil Organic Carbon Mapping on a National Scale

被引:0
作者
Radocaj, Dorijan [1 ]
Jug, Danijel [1 ]
Jug, Irena [1 ]
Jurisic, Mladen [1 ]
机构
[1] Univ Josip Juraj Strossmayer Osijek, Fac Agrobiotechn Sci Osijek, Vladimira Preloga 1, Osijek 31000, Croatia
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 21期
关键词
random forest; web of science core collection topic search; LUCAS dataset; environmental covariates; digital soil mapping; remote sensing; PREDICTION;
D O I
10.3390/app14219990
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The aim of this study was to narrow the research gap of ambiguity in which machine learning algorithms should be selected for evaluation in digital soil organic carbon (SOC) mapping. This was performed by providing a comprehensive assessment of prediction accuracy for 15 frequently used machine learning algorithms in digital SOC mapping based on studies indexed in the Web of Science Core Collection (WoSCC), providing a basis for algorithm selection in future studies. Two study areas, including mainland France and the Czech Republic, were used in the study based on 2514 and 400 soil samples from the LUCAS 2018 dataset. Random Forest was first ranked for France (mainland) and then ranked for the Czech Republic regarding prediction accuracy; the coefficients of determination were 0.411 and 0.249, respectively, which was in accordance with its dominant appearance in previous studies indexed in the WoSCC. Additionally, the K-Nearest Neighbors and Gradient Boosting Machine regression algorithms indicated, relative to their frequency in studies indexed in the WoSCC, that they are underrated and should be more frequently considered in future digital SOC studies. Future studies should consider study areas not strictly related to human-made administrative borders, as well as more interpretable machine learning and ensemble machine learning approaches.
引用
收藏
页数:15
相关论文
共 67 条
  • [1] Digital Mapping of Soil Properties Using Ensemble Machine Learning Approaches in an Agricultural Lowland Area of Lombardy, Italy
    Adeniyi, Odunayo David
    Brenning, Alexander
    Bernini, Alice
    Brenna, Stefano
    Maerker, Michael
    [J]. LAND, 2023, 12 (02)
  • [2] Sentinel-1 soil moisture at 1 km resolution: a validation study
    Balenzano, Anna
    Mattia, Francesco
    Satalino, Giuseppe
    Lovergine, Francesco P.
    Palmisano, Davide
    Peng, Jian
    Marzahn, Philip
    Wegmuller, Urs
    Cartus, Oliver
    Dabrowska-Zielinska, Katarzyna
    Musial, Jan P.
    Davidson, Malcolm W. J.
    Pauwels, Valentijn R. N.
    Cosh, Michael H.
    McNairn, Heather
    Johnson, Joel T.
    Walker, Jeffrey P.
    Yueh, Simon H.
    Entekhabi, Dara
    Kerr, Yann H.
    Jackson, Thomas J.
    [J]. REMOTE SENSING OF ENVIRONMENT, 2021, 263
  • [3] Machine learning based soil maps for a wide range of soil properties for the forested area of Switzerland
    Baltensweiler, Andri
    Walthert, Lorenz
    Hanewinkel, Marc
    Zimmermann, Stephan
    Nussbaum, Madlene
    [J]. GEODERMA REGIONAL, 2021, 27
  • [4] Using local ensemble models and Landsat bare soil composites for large-scale soil organic carbon maps in cropland
    Broeg, Tom
    Don, Axel
    Gocht, Alexander
    Scholten, Thomas
    Taghizadeh-Mehrjardi, Ruhollah
    Erasmi, Stefan
    [J]. GEODERMA, 2024, 444
  • [5] Transferability of Covariates to Predict Soil Organic Carbon in Cropland Soils
    Broeg, Tom
    Blaschek, Michael
    Seitz, Steffen
    Taghizadeh-Mehrjardi, Ruhollah
    Zepp, Simone
    Scholten, Thomas
    [J]. REMOTE SENSING, 2023, 15 (04)
  • [6] Root mean square error (RMSE) or mean absolute error (MAE)? - Arguments against avoiding RMSE in the literature
    Chai, T.
    Draxler, R. R.
    [J]. GEOSCIENTIFIC MODEL DEVELOPMENT, 2014, 7 (03) : 1247 - 1250
  • [7] Characterization of field-scale soil variation using a stepwise multi-sensor fusion approach and a cost-benefit analysis
    Chatterjee, Sumanta
    Hartemink, Alfred E.
    Triantafilis, John
    Desai, Ankur R.
    Soldat, Doug
    Zhu, Jun
    Townsend, Philip A.
    Zhang, Yakun
    Huang, Jingyi
    [J]. CATENA, 2021, 201
  • [8] Chen T., Xgboost: Extreme Gradient Boosting
  • [9] A daily merged MODIS Aqua-Terra land surface temperature data set for the conterminous United States
    Crosson, William L.
    Al-Hamdan, Mohammad Z.
    Hemmings, Sarah N. J.
    Wade, Gina M.
    [J]. REMOTE SENSING OF ENVIRONMENT, 2012, 119 : 315 - 324
  • [10] Influence of soil properties, topography, and land cover on soil organic carbon and total nitrogen concentration: A case study in Qinghai-Tibet plateau based on random forest regression and structural equation modeling
    Dai, Lijun
    Ge, Jingsong
    Wang, Lingqing
    Zhang, Qian
    Liang, Tao
    Bolan, Nanthi
    Lischeid, Gunnar
    Rinklebe, Joerg
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 821