Spatial prediction of ground substrate thickness in shallow mountain area based on machine learning model

被引:0
作者
Zhu, Xiaosong [1 ]
Pei, Xiaolong [1 ]
Yang, Siqi [2 ,3 ]
Wang, Wei [1 ]
Dong, Yue [1 ]
Fang, Mengyang [2 ,3 ]
Liu, Wenjie [3 ,4 ]
Jiang, Lingxiu [2 ,3 ]
机构
[1] China Geol Survey, Langfang Comprehens Survey Ctr Nat Resources, Langfang, Peoples R China
[2] China Geol Survey, Haikou Marine Geol Survey Ctr, Haikou, Peoples R China
[3] Sanya Land Sea Interface Crit Zone Field Sci Obser, Sanya, Peoples R China
[4] Hainan Univ, Sch Ecol & Environm, Haikou, Peoples R China
关键词
ground substrate; machine learning; parameter optimization; model validation; thickness prediction; SOIL DEPTH;
D O I
10.3389/feart.2024.1455124
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Introduction The thickness of ground substrate in shallow mountainous areas is a crucial indicator for substrate investigations and a key factor in evaluating substrate quality and function. Reliable data acquisition methods are essential for effective investigation.Methods This study utilizes six machine learning algorithms-Gradient Boosting Machine (GB), Random Forest (RF), AdaBoost Regressor (AB), Neural Network (NN), Support Vector Machine (SVM), and k-Nearest Neighbors (kNN)-to predict ground substrate thickness. Grid search optimization was employed to fine-tune model parameters. The models' performances were evaluated using four metrics: mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), and the coefficient of determination (R2). The optimal parameter combinations for each model were then used to calculate the spatial distribution of ground substrate thickness in the study area.Results The results indicate that after parameter optimization, all models showed significant reductions in the MSE, RMSE, and MAE, while R2 values increased substantially. Under optimal parameters, the RF model achieved an MSE of 1,589, RMSE of 39.8, MAE of 26.5, and an R2 of 0.63, with a Pearson correlation coefficient of 0.80, outperforming the other models. Therefore, parameter tuning is a necessary step in using machine learning models to predict ground substrate thickness, and the performance of all six models improved significantly after tuning. Overall, ensemble learning models provided better predictive performance than other machine learning models, with the RF model demonstrating the best accuracy and robustness.Discussion Moreover, further attention is required on the characteristics of sample data and environmental variables in machine learning-based predictions.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] [贾磊 Jia Lei], 2022, [西北地质, Northwestern Geology], V55, P140
  • [12] [金昭 Jin Zhao], 2022, [地理研究, Geographical Research], V41, P1731
  • [13] Three-dimensional mapping of soil organic matter content using soil type-specific depth functions
    Kempen, B.
    Brus, D. J.
    Stoorvogel, J. J.
    [J]. GEODERMA, 2011, 162 (1-2) : 107 - 123
  • [14] Digital mapping of soil carbon fractions with machine learning
    Keskin, Hamza
    Grunwald, Sabine
    Harris, Willie G.
    [J]. GEODERMA, 2019, 339 (40-58) : 40 - 58
  • [15] [蓝雪春 Lan Xuechun], 2017, [中国水土保持科学, Science of Soil and Water Conservation], V15, P141
  • [16] [李响 Li Xiang], 2023, [地质通报, Geological Bulletin of China], V42, P68
  • [17] [刘金涛 Liu Jintao], 2024, [土壤学报, Acta Pedologica Sinica], V61, P319
  • [18] [刘金涛 Liu Jintao], 2019, [水科学进展, Advances in Water Science], V30, P112
  • [19] A simple geomorphic-based analytical model for predicting the spatial distribution of soil thickness in headwater hillslopes and catchments
    Liu, Jintao
    Chen, Xi
    Lin, Henry
    Liu, Hu
    Song, Huiqing
    [J]. WATER RESOURCES RESEARCH, 2013, 49 (11) : 7733 - 7746
  • [20] Assessing soil organic carbon stocks under current and potential forest cover using digital soil mapping and spatial generalisation
    Ottoy, Sam
    De Vos, Bruno
    Sindayihebura, Anicet
    Hermy, Martin
    Van Orshoven, Jos
    [J]. ECOLOGICAL INDICATORS, 2017, 77 : 139 - 150