Effect of training sample size, sampling design and prediction model on soil mapping with proximal sensing data for precision liming

被引:10
|
作者
Schmidinger, Jonas [1 ,2 ]
Schroeter, Ingmar [3 ]
Boenecke, Eric [4 ]
Gebbers, Robin [5 ]
Ruehlmann, Joerg [4 ]
Kramer, Eckart [3 ]
Mulder, Vera L. [6 ]
Heuvelink, Gerard B. M. [6 ,7 ]
Vogel, Sebastian [1 ]
机构
[1] Leibniz Inst Agr Engn & Bioecon ATB, Dept Agromechatron, Potsdam, Germany
[2] Osnabruck Univ, Joint Lab Artificial Intelligence & Data Sci, Osnabruck, Germany
[3] Eberswalde Univ Sustainable Dev, Landscape Management & Nat Conservat, Eberswalde, Germany
[4] Leibniz Inst Vegetable & Ornamental Crops, Next Generat Hort Syst, Grossbeeren, Germany
[5] Martin Luther Univ Halle Wittenberg, Inst Agr & Nutr Sci, Halle, Germany
[6] Wageningen Univ & Res, Soil Geog & Landscape Grp, Wageningen, Netherlands
[7] ISRIC World Soil Informat Wageningen, Wageningen, Netherlands
关键词
Variable rate soil liming; Digital soil mapping; Training sample size; Precision agriculture; Sampling designs; Proximal soil sensing; SPATIAL PREDICTION; SCALE;
D O I
10.1007/s11119-024-10122-3
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Site-specific estimation of lime requirement requires high-resolution maps of soil organic carbon (SOC), clay and pH. These maps can be generated with digital soil mapping models fitted on covariates observed by proximal soil sensors. However, the quality of the derived maps depends on the applied methodology. We assessed the effects of (i) training sample size (5-100); (ii) sampling design (simple random sampling (SRS), conditioned Latin hypercube sampling (cLHS) and k-means sampling (KM)); and (iii) prediction model (multiple linear regression (MLR) and random forest (RF)) on the prediction performance for the above mentioned three soil properties. The case study is based on conditional geostatistical simulations using 250 soil samples from a 51 ha field in Eastern Germany. Lin's concordance correlation coefficient (CCC) and root-mean-square error (RMSE) were used to evaluate model performances. Results show that with increasing training sample sizes, relative improvements of RMSE and CCC decreased exponentially. We found the lowest median RMSE values with 100 training observations i.e., 1.73%, 0.21% and 0.3 for clay, SOC and pH, respectively. However, already with a sample size of 10, models of moderate quality (CCC > 0.65) were obtained for all three soil properties. cLHS and KM performed significantly better than SRS. MLR showed lower median RMSE values than RF for SOC and pH for smaller sample sizes, but RF outperformed MLR if at least 25-30 or 75-100 soil samples were used for SOC or pH, respectively. For clay, the median RMSE was lower with RF, regardless of sample size.
引用
收藏
页码:1529 / 1555
页数:27
相关论文
共 2 条
  • [1] Effect of training sample size, sampling design and prediction model on soil mapping with proximal sensing data for precision liming
    Jonas Schmidinger
    Ingmar Schröter
    Eric Bönecke
    Robin Gebbers
    Joerg Ruehlmann
    Eckart Kramer
    Vera L. Mulder
    Gerard B. M. Heuvelink
    Sebastian Vogel
    Precision Agriculture, 2024, 25 : 1529 - 1555
  • [2] Detecting spatial structures in throughfall data: The effect of extent, sample size, sampling design, and variogram estimation method
    Voss, Sebastian
    Zimmermann, Beate
    Zimmermann, Alexander
    JOURNAL OF HYDROLOGY, 2016, 540 : 527 - 537