SoilGrids250m: Global gridded soil information based on machine learning

被引:2641
作者
Hengl, Tomislav [1 ]
de Jesus, Jorge Mendes [1 ]
Heuvelink, Gerard B. M. [1 ]
Gonzalez, Maria Ruiperez [1 ]
Kilibarda, Milan [2 ]
Blagotic, Aleksandar [3 ]
Shangguan, Wei [4 ]
Wright, Marvin N. [5 ]
Geng, Xiaoyuan [6 ]
Bauer-Marschallinger, Bernhard [7 ]
Guevara, Mario Antonio [8 ]
Vargas, Rodrigo [8 ]
MacMillan, Robert A. [9 ]
Batjes, Niels H. [1 ]
Leenaars, Johan G. B. [1 ]
Ribeiro, Eloi [1 ]
Wheeler, Ichsani [10 ]
Mantel, Stephan [1 ]
Kempen, Bas [1 ]
机构
[1] ISRIC World Soil Informat, Wageningen, Netherlands
[2] Univ Belgrade, Fac Civil Engn, Belgrade, Serbia
[3] GILab Ltd, Belgrade, Serbia
[4] Sun Yat Sen Univ, Sch Atmospher Sci, Guangzhou, Guangdong, Peoples R China
[5] Inst Med Biometrie & Stat, Lubeck, Germany
[6] Agr & Agri Food Canada, Ottawa, ON, Canada
[7] Vienna Univ Technol, Dept Geodesy & Geoinformat, Vienna, Austria
[8] Univ Delaware, Newark, DE USA
[9] LandMapper Environm Solut Inc, Edmonton, AB, Canada
[10] Envirometrix Inc, Wageningen, Netherlands
来源
PLOS ONE | 2017年 / 12卷 / 02期
关键词
SPATIOTEMPORAL INTERPOLATION; ORGANIC-CARBON; DATABASE; PREDICTIONS; DATASETS; FORESTS; MODELS; WATER; MAPS; TOOL;
D O I
10.1371/journal.pone.0169748
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper describes the technical development and accuracy assessment of the most recent and improved version of the SoilGrids system at 250m resolution (June 2016 update). SoilGrids provides global predictions for standard numeric soil properties (organic carbon, bulk density, Cation Exchange Capacity (CEC), pH, soil texture fractions and coarse fragments) at seven standard depths (0, 5, 15, 30, 60, 100 and 200 cm), in addition to predictions of depth to bedrock and distribution of soil classes based on the World Reference Base (WRB) and USDA classification systems (ca. 280 raster layers in total). Predictions were based on ca. 150,000 soil profiles used for training and a stack of 158 remote sensing-based soil covariates (primarily derived from MODIS land products, SRTM DEM derivatives, climatic images and global landform and lithology maps), which were used to fit an ensemble of machine learning methods D random forest and gradient boosting and/or multinomial logistic regression D as implemented in the R packages ranger, xgboost, nnet and caret. The results of 10 -fold cross-validation show that the ensemble models explain between 56% (coarse fragments) and 83% (pH) of variation with an overall average of 61%. Improvements in the relative accuracy considering the amount of variation explained, in comparison to the previous version of SoilGrids at 1 km spatial resolution, range from 60 to 230%. Improvements can be attributed to: (1) the use of machine learning instead of linear regression, (2) to considerable investments in preparing finer resolution covariate layers and (3) to insertion of additional soil profiles. Further development of SoilGrids could include refinement of methods to incorporate input uncertainties and derivation of posterior probability distributions (per pixel), and further automation of spatial modeling so that soil maps can be generated for potentially hundreds of soil variables. Another area of future research is the development of methods for multiscale merging of SoilGrids predictions with local and/or national gridded soil products (e.g. up to 50 m spatial resolution) so that increasingly more accurate, complete and consistent global soil information can be produced. SoilGrids are available under the Open Data Base License.
引用
收藏
页数:40
相关论文
共 89 条
  • [1] Combining Soil Databases for Topsoil Organic Carbon Mapping in Europe
    Aksoy, Ece
    Yigini, Yusuf
    Montanarella, Luca
    [J]. PLOS ONE, 2016, 11 (03):
  • [2] [Anonymous], 2005, R news, DOI DOI 10.1007/978-1-4614-7618-4_2
  • [3] [Anonymous], APPLIED PREDICTIVE M
  • [4] [Anonymous], 2011, FUND INF THEOR COD
  • [5] [Anonymous], THESIS
  • [6] GlobalSoilMap: Toward a Fine-Resolution Global Grid of Soil Properties
    Arrouays, Dominique
    Grundy, Michael G.
    Hartemink, Alfred E.
    Hempel, Jonathan W.
    Heuvelink, Gerard B. M.
    Hong, S. Young
    Lagacherie, Philippe
    Lelyk, Glenn
    McBratney, Alexander B.
    McKenzie, Neil J.
    Mendonca-Santos, Maria D. L.
    Minasny, Budiman
    Montanarella, Luca
    Odeh, Inakwu O. A.
    Sanchez, Pedro A.
    Thompson, James A.
    Zhang, Gan-Lin
    [J]. ADVANCES IN AGRONOMY, VOL 125, 2014, 125 : 93 - +
  • [7] UNCERTAINTIES OF A TANDEM-X DERIVED DIGITAL SURFACE MODEL A CASE STUDY FROM THE RODA CATCHMENT, GERMANY
    Baade, Jussi
    Schmullius, Christiane
    [J]. 2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014,
  • [8] Harmonized soil profile data for applications at global and continental scales: updates to the WISE database
    Batjes, N. H.
    [J]. SOIL USE AND MANAGEMENT, 2009, 25 (02) : 124 - 127
  • [9] Optimisation of global grids for high-resolution remote sensing data
    Bauer-Marschallinger, Bernhard
    Sabel, Daniel
    Wagner, Wolfgang
    [J]. COMPUTERS & GEOSCIENCES, 2014, 72 : 84 - 93
  • [10] Soil mapping, classification, and pedologic modeling: History and future directions
    Brevik, Eric C.
    Calzolari, Costanza
    Miller, Bradley A.
    Pereira, Paulo
    Kabala, Cezary
    Baumgarten, Andreas
    Jordan, Antonio
    [J]. GEODERMA, 2016, 264 : 256 - 274