Quantifying uncertainty in the prediction of soil properties using mid-infrared spectra

被引:0
作者
Omondiagbe, Osayande Pascal [1 ]
Roudier, Pierre [2 ]
Lilburne, Linda [1 ]
Ma, Yuxin [2 ]
Mcneill, Stephen [1 ]
机构
[1] Manaaki Whenua Landcare Res, Box 69040, Lincoln 7640, New Zealand
[2] Manawatu Mail Ctr, Manaaki Whenua Landcare Res, Private Bag 11052, Palmerston North 4442, New Zealand
关键词
Soil spectroscopy; Uncertainty quantification; Mid-infrared; Bayesian convolution neural networks; Generalised additive models; Partial least-squares regression; SPECTROSCOPY; MODELS;
D O I
10.1016/j.geoderma.2024.116954
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Soil's pivotal role in environmental and agricultural processes underscores the importance of accurate soil property predictions for informed decisions and sustainable land management. Spectroscopic techniques, particularly mid-infrared (MIR) spectroscopy, have emerged as rapid and non-destructive tools for soil analysis. Despite advances in predicting soil properties using spectroscopy, quantifying prediction uncertainties has often been overlooked. Accurate uncertainty quantification helps risk assessment and decision-making processes. This study introduces an enhanced version of the variational inference technique to capture uncertainty when using Bayesian Convolutional Neural Networks (Bayesian CNNs). This Bayesian CNNs method was evaluated against two other methods - Bootstrapped Partial Least-Squares regression (Bootsrapped PLS) and Generalised Additive Models (GAM) for their ability to quantify uncertainty in six soil property predictions (clay, sand, silt, pH, phosphorus retention, and carbon) based on MIR spectroscopy. In terms of predictive performance and quality of prediction, our evaluation indicated that both GAMs and Bayesian CNNs outperformed PLS-BS for all six soil properties. The ability of GAMs and Bayesian CNNs to capture non-linear relationships in the data allowed for better fitting to the underlying patterns. Bayesian CNNs, in particular, demonstrated superior performance by combining accurate predictions with robust uncertainty quantification. Our results also showed that, on our dataset, bootstrapping failed to provide satisfactory prediction intervals. We suggest therefore that the evaluation of models should extend beyond standard validation metrics, which typically focuses on prediction accuracy, to include an assessment of the predicted uncertainty.
引用
收藏
页数:16
相关论文
共 43 条
  • [11] MONTE-CARLO SAMPLING METHODS USING MARKOV CHAINS AND THEIR APPLICATIONS
    HASTINGS, WK
    [J]. BIOMETRIKA, 1970, 57 (01) : 97 - &
  • [12] Modelling soil variation: past, present, and future
    Heuvelink, GBM
    Webster, R
    [J]. GEODERMA, 2001, 100 (3-4) : 269 - 301
  • [13] Bootstrapping Not Independent and Not Identically Distributed Data
    Hrba, Martin
    Maciak, Matus
    Pestova, Barbora
    Pesta, Michal
    [J]. MATHEMATICS, 2022, 10 (24)
  • [14] CALIBRATION OF PROCESS-ORIENTED MODELS
    JANSSEN, PHM
    HEUBERGER, PSC
    [J]. ECOLOGICAL MODELLING, 1995, 83 (1-2) : 55 - 66
  • [15] Kendall Alex, 2017, Advances in Neural Information Processing Systems, V30
  • [16] Modelling soil carbon fractions with visible near-infrared (VNIR) and mid-infrared (MIR) spectroscopy
    Knox, N. M.
    Grunwald, S.
    McDowell, M. L.
    Bruland, G. L.
    Myers, D. B.
    Harris, W. G.
    [J]. GEODERMA, 2015, 239 : 229 - 239
  • [17] Konishi S, 2008, SPRINGER SER STAT, P1, DOI 10.1007/978-0-387-71887-3
  • [18] Kuhn M., 2013, Applied Predictive Modeling, V26
  • [19] Lakshminarayanan B, 2017, ADV NEUR IN, V30
  • [20] A CONCORDANCE CORRELATION-COEFFICIENT TO EVALUATE REPRODUCIBILITY
    LIN, LI
    [J]. BIOMETRICS, 1989, 45 (01) : 255 - 268