Predicting and Mapping of Soil Organic Carbon Using Machine Learning Algorithms in Northern Iran

被引:157
作者
Emadi, Mostafa [1 ]
Taghizadeh-Mehrjardi, Ruhollah [2 ,3 ]
Cherati, Ali [4 ]
Danesh, Majid [1 ]
Mosavi, Amir [5 ,6 ,7 ]
Scholten, Thomas [2 ,8 ,9 ]
机构
[1] Sari Agr Sci & Nat Resources Univ, Coll Crop Sci, Dept Soil Sci, Sari 4818168984, Iran
[2] Univ Tubingen, Dept Geosci Soil Sci & Geomorphol, D-72070 Tubingen, Germany
[3] Ardakan Univ, Fac Agr & Nat Resources, Ardakan 8951656767, Iran
[4] AREEO, Mazandaran Agr & Nat Resources Res & Educ Ctr, Soil & Water Res Dept, Sari 4849155356, Iran
[5] Tech Univ Dresden, Fac Civil Engn, D-01069 Dresden, Germany
[6] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[7] J Selye Univ, Dept Informat, Komarno 94501, Slovakia
[8] Univ Tubingen, CRC 1070, Ressource Cultures, D-72070 Tubingen, Germany
[9] Univ Tubingen, DFG Cluster Excellence Machine Learning, D-72070 Tubingen, Germany
关键词
soil organic carbon; carbon sequestration; machine learning; deep neural networks; susceptibility; big data; mapping; soil informatics; geochemistry; remote sensing; deep learning; data science; system science; ARTIFICIAL NEURAL-NETWORK; SPATIAL PREDICTION; SEMIARID RANGELANDS; GENETIC ALGORITHM; REGRESSION TREE; RANDOM FORESTS; MATTER CONTENT; GRADIENT; MODELS; STOCKS;
D O I
10.3390/rs12142234
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Estimation of the soil organic carbon (SOC) content is of utmost importance in understanding the chemical, physical, and biological functions of the soil. This study proposes machine learning algorithms of support vector machines (SVM), artificial neural networks (ANN), regression tree, random forest (RF), extreme gradient boosting (XGBoost), and conventional deep neural network (DNN) for advancing prediction models of SOC. Models are trained with 1879 composite surface soil samples, and 105 auxiliary data as predictors. The genetic algorithm is used as a feature selection approach to identify effective variables. The results indicate that precipitation is the most important predictor driving 14.9% of SOC spatial variability followed by the normalized difference vegetation index (12.5%), day temperature index of moderate resolution imaging spectroradiometer (10.6%), multiresolution valley bottom flatness (8.7%) and land use (8.2%), respectively. Based on 10-fold cross-validation, the DNN model reported as a superior algorithm with the lowest prediction error and uncertainty. In terms of accuracy, DNN yielded a mean absolute error of 0.59%, a root mean squared error of 0.75%, a coefficient of determination of 0.65, and Lin's concordance correlation coefficient of 0.83. The SOC content was the highest in udic soil moisture regime class with mean values of 3.71%, followed by the aquic (2.45%) and xeric (2.10%) classes, respectively. Soils in dense forestlands had the highest SOC contents, whereas soils of younger geological age and alluvial fans had lower SOC. The proposed DNN (hidden layers = 7, and size = 50) is a promising algorithm for handling large numbers of auxiliary data at a province-scale, and due to its flexible structure and the ability to extract more information from the auxiliary data surrounding the sampled observations, it had high accuracy for the prediction of the SOC base-line map and minimal uncertainty.
引用
收藏
页数:29
相关论文
共 115 条
  • [1] Prediction of organic potato yield using tillage systems and soil properties by artificial neural network (ANN) and multiple linear regressions (MLR)
    Abrougui, Khaoula
    Gabsi, Karim
    Mercatoris, Benoit
    Khemis, Chiheb
    Amami, Roua
    Chehaibi, Sayed
    [J]. SOIL & TILLAGE RESEARCH, 2019, 190 : 202 - 208
  • [2] Digital Mapping of Topsoil Carbon Content and Changes in the Driftless Area of Wisconsin, USA
    Adhikari, Kabindra
    Hartemink, Alfred E.
    [J]. SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 2015, 79 (01) : 155 - 164
  • [3] Total soil organic carbon and carbon sequestration potential in Nigeria
    Akpa, Stephen I. C.
    Odeh, Inakwu O. A.
    Bishop, Thomas F. A.
    Hartemink, Alfred E.
    Amapu, Ishaku Y.
    [J]. GEODERMA, 2016, 271 : 202 - 215
  • [4] Calibration and Testing of the Aquacrop Model for Rice under Water and Nitrogen Management
    Amiri, Ebrahim
    [J]. COMMUNICATIONS IN SOIL SCIENCE AND PLANT ANALYSIS, 2016, 47 (03) : 387 - 403
  • [5] [Anonymous], 2020, REMOTE SENS BASEL, DOI DOI 10.3390/RS12071095
  • [6] [Anonymous], 2005, SOILS IRAN NEW ACHIE
  • [7] Evaluating machine learning approaches for the interpolation of monthly air temperature at Mt. Kilimanjaro, Tanzania
    Appelhans, Tim
    Mwangomo, Ephraim
    Hardy, Douglas R.
    Hemp, Andreas
    Nauss, Thomas
    [J]. SPATIAL STATISTICS, 2015, 14 : 91 - 113
  • [8] Arno Candel Viraj Parmar., 2016, Deep learning with H2O. H2O
  • [9] Spatial prediction of soil properties in temperate mountain regions using support vector regression
    Ballabio, Cristiano
    [J]. GEODERMA, 2009, 151 (3-4) : 338 - 350
  • [10] Multi-scale digital terrain analysis and feature selection for digital soil mapping
    Behrens, Thorsten
    Zhu, A-Xing
    Schmidt, Karsten
    Scholten, Thomas
    [J]. GEODERMA, 2010, 155 (3-4) : 175 - 185