Strategies for efficient estimation of soil organic content at the local scale based on a national spectral database

被引:10
作者
Li, Hongyi [1 ]
Li, Yuheng [1 ]
Yang, Meihua [2 ]
Chen, Songchao [3 ]
Shi, Zhou [3 ]
机构
[1] Jiangxi Univ Finance & Econ, Sch Tourism & Urban Management, Dept Land Resource Management, Nanchang, PR, Peoples R China
[2] Yuzhang Normal Univ, Inst Ecosyst, Nanchang, Jiangxi, Peoples R China
[3] Zhejiang Univ, Inst Agr Remote Sensing & Informat Technol Applic, Coll Environm & Resource Sci, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
efficient estimation; local scale; national spectral database; soil organic content; strategy; NEAR-INFRARED SPECTROSCOPY; PREDICTION; LIBRARIES; REGRESSION; SPIKING; CARBON;
D O I
10.1002/ldr.4223
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Soil function degradation threatens the sustainable management of soil resources and soil organic matter (SOM) is a vital and important factor. Powerful measuring tools will become very important, especially in areas where data are poor or absent. The archive: China Soil Visible and Near Infrared (vis-NIR) Spectroscopy Library (CSSL) could help providea solution for less costly and fast measuring of SOM. The aim of this article was to compare SOM prediction performance according to three strategies: i) general global partial least squares regression (PLSR) using CSSL with and without spiking samples; ii) memory-based learning (MBL) using CSSL with and without spiking samples; and iii) general PLSR using only spiking samples to predict soil organic matter in the target area. When using spiked subsets, we also investigated the prediction performance of the extra-weighted (several copies) subsets. A series of spiking subsets were randomly selected from the total spiking samples, which were selected by conditioned Latin hypercube sampling (cLHS) from the target sites. We calculated only the mean squared Euclidean distance (msd) between the estimates density function (pds) of the principal components (PCs) of vis-NIR spectroscopy from the validation dataset and spiking subsets and statistically inferred the optimal sampling set size to be 30. Our study showed that global PLSR using CSSL spiked with the statistically optimal local samples can achieve higher predicted performance [with a mean root mean square error (RMSE) of 5.75]. MBL spiked with five extra-weighted optimal spiking samples achieved the best accuracy with an RMSE of 3.98, an R-2 of 0.70, a bias of 0.04, and an LCCC of 0.81. The msd is a simple and effective method to determine an adequate spiking set size using only vis-NIR data. These accurate predictions demonstrated the usefulness of statistically representative spiking and MBL for advanced large soil spectral libraries for SOM determination, which is currently lacking at large soil spectral libraries in use.
引用
收藏
页码:1649 / 1661
页数:13
相关论文
共 40 条
  • [1] Bao S.D., 2000, SOIL AGR CHEM ANAL
  • [2] Improvement in spectral library-based quantification of soil properties using representative spiking and local calibration - The case of soil inorganic carbon prediction by mid-infrared spectroscopy
    Barthes, Bernard G.
    Kouakoua, Ernest
    Coll, Patrice
    Clairotte, Michael
    Moulin, Patricia
    Saby, Nicolas P. A.
    Le Cadre, Edith
    Etayo, Amandine
    Chevallier, Tiphaine
    [J]. GEODERMA, 2020, 369
  • [3] Representative subset selection
    Daszykowski, M
    Walczak, B
    Massart, DL
    [J]. ANALYTICA CHIMICA ACTA, 2002, 468 (01) : 91 - 103
  • [4] National-scale spectroscopic assessment of soil organic carbon in forests of the Czech Republic
    Gholizadeh, Asa
    Rossel, Raphael A. Viscarra
    Saberioon, Mohammadmehdi
    Boruvka, Lubos
    Kratina, Josef
    Pavlu, Lenka
    [J]. GEODERMA, 2021, 385
  • [5] Glinski, 2011, ENCY EARTH SCI SERIE, DOI 10.1007/978-90-481-3585-1_787R
  • [6] Which strategy is best to predict soil properties of a local site from a national Vis-NIR database?
    Goge, Fabien
    Gomez, Cecile
    Jolivet, Claudy
    Joffre, Richard
    [J]. GEODERMA, 2014, 213 : 1 - 9
  • [7] Assessment of soil organic carbon at local scale with spiked NIR calibrations: effects of selection and extra-weighting on the spiking subset
    Guerrero, C.
    Stenberg, B.
    Wetterlind, J.
    Rossel, R. A. Viscarra
    Maestre, F. T.
    Mouazen, A. M.
    Zornoza, R.
    Ruiz-Sinoga, J. D.
    Kuang, B.
    [J]. EUROPEAN JOURNAL OF SOIL SCIENCE, 2014, 65 (02) : 248 - 263
  • [8] Do we really need large spectral libraries for local scale SOC assessment with NIR spectroscopy?
    Guerrero, Cesar
    Wetterlind, Johanna
    Stenberg, Bo
    Mouazen, Abdul M.
    Gabarron-Galeote, Miguel A.
    Ruiz-Sinoga, Jose D.
    Zornoza, Raul
    Rossel, Raphael A. Viscarra
    [J]. SOIL & TILLAGE RESEARCH, 2016, 155 : 501 - 509
  • [9] Spiking of NIR regional models using samples from target sites: Effect of model size on prediction accuracy
    Guerrero, Cesar
    Zornoza, Raul
    Gomez, Ignacio
    Mataix-Beneyto, Jorge
    [J]. GEODERMA, 2010, 158 (1-2) : 66 - 77
  • [10] Fuzzy c-Means Algorithms for Very Large Data
    Havens, Timothy C.
    Bezdek, James C.
    Leckie, Christopher
    Hall, Lawrence O.
    Palaniswami, Marimuthu
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (06) : 1130 - 1146