Predicting and Mapping of Soil Organic Carbon Using Machine Learning Algorithms in Northern Iran

被引:187
作者
Emadi, Mostafa [1 ]
Taghizadeh-Mehrjardi, Ruhollah [2 ,3 ]
Cherati, Ali [4 ]
Danesh, Majid [1 ]
Mosavi, Amir [5 ,6 ,7 ]
Scholten, Thomas [2 ,8 ,9 ]
机构
[1] Sari Agr Sci & Nat Resources Univ, Coll Crop Sci, Dept Soil Sci, Sari 4818168984, Iran
[2] Univ Tubingen, Dept Geosci Soil Sci & Geomorphol, D-72070 Tubingen, Germany
[3] Ardakan Univ, Fac Agr & Nat Resources, Ardakan 8951656767, Iran
[4] AREEO, Mazandaran Agr & Nat Resources Res & Educ Ctr, Soil & Water Res Dept, Sari 4849155356, Iran
[5] Tech Univ Dresden, Fac Civil Engn, D-01069 Dresden, Germany
[6] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[7] J Selye Univ, Dept Informat, Komarno 94501, Slovakia
[8] Univ Tubingen, CRC 1070, Ressource Cultures, D-72070 Tubingen, Germany
[9] Univ Tubingen, DFG Cluster Excellence Machine Learning, D-72070 Tubingen, Germany
关键词
soil organic carbon; carbon sequestration; machine learning; deep neural networks; susceptibility; big data; mapping; soil informatics; geochemistry; remote sensing; deep learning; data science; system science; ARTIFICIAL NEURAL-NETWORK; SPATIAL PREDICTION; SEMIARID RANGELANDS; GENETIC ALGORITHM; REGRESSION TREE; RANDOM FORESTS; MATTER CONTENT; GRADIENT; MODELS; STOCKS;
D O I
10.3390/rs12142234
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Estimation of the soil organic carbon (SOC) content is of utmost importance in understanding the chemical, physical, and biological functions of the soil. This study proposes machine learning algorithms of support vector machines (SVM), artificial neural networks (ANN), regression tree, random forest (RF), extreme gradient boosting (XGBoost), and conventional deep neural network (DNN) for advancing prediction models of SOC. Models are trained with 1879 composite surface soil samples, and 105 auxiliary data as predictors. The genetic algorithm is used as a feature selection approach to identify effective variables. The results indicate that precipitation is the most important predictor driving 14.9% of SOC spatial variability followed by the normalized difference vegetation index (12.5%), day temperature index of moderate resolution imaging spectroradiometer (10.6%), multiresolution valley bottom flatness (8.7%) and land use (8.2%), respectively. Based on 10-fold cross-validation, the DNN model reported as a superior algorithm with the lowest prediction error and uncertainty. In terms of accuracy, DNN yielded a mean absolute error of 0.59%, a root mean squared error of 0.75%, a coefficient of determination of 0.65, and Lin's concordance correlation coefficient of 0.83. The SOC content was the highest in udic soil moisture regime class with mean values of 3.71%, followed by the aquic (2.45%) and xeric (2.10%) classes, respectively. Soils in dense forestlands had the highest SOC contents, whereas soils of younger geological age and alluvial fans had lower SOC. The proposed DNN (hidden layers = 7, and size = 50) is a promising algorithm for handling large numbers of auxiliary data at a province-scale, and due to its flexible structure and the ability to extract more information from the auxiliary data surrounding the sampled observations, it had high accuracy for the prediction of the SOC base-line map and minimal uncertainty.
引用
收藏
页数:29
相关论文
共 115 条
[31]   Soil Cd, Cr, Cu, Ni, Pb and Zn sorption and retention models using SVM: Variable selection and competitive model [J].
Gonzalez Costa, J. J. ;
Reigosa, M. J. ;
Matias, J. M. ;
Covelo, E. F. .
SCIENCE OF THE TOTAL ENVIRONMENT, 2017, 593 :508-522
[32]   Driving factors of soil organic carbon fractions over New South Wales, Australia [J].
Gray, Jonathan ;
Karunaratne, Senani ;
Bishop, Thomas ;
Wilson, Brian ;
Veeragathipillai, Manoharan .
GEODERMA, 2019, 353 :213-226
[33]   Factors Controlling Soil Organic Carbon Stocks with Depth in Eastern Australia [J].
Gray, Jonathan M. ;
Bishop, Thomas F. A. ;
Wilson, Brian R. .
SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 2015, 79 (06) :1741-1751
[34]   The effects of topography on forest soil characteristics in the Oregon Cascade Mountains (USA): Implications for the effects of climate change on soil properties [J].
Griffiths, R. P. ;
Madritch, M. D. ;
Swanson, A. K. .
FOREST ECOLOGY AND MANAGEMENT, 2009, 257 (01) :1-7
[35]   Soil organic carbon density and its driving factors in forest ecosystems across a northwestern province in China [J].
Guan, Jin-Hong ;
Deng, Lei ;
Zhang, Jian-Guo ;
He, Qiu-Yue ;
Shi, Wei-Yu ;
Li, Guoqing ;
Du, Sheng .
GEODERMA, 2019, 352 :1-12
[36]   Selection of terrain attributes and its scale dependency on soil organic carbon prediction [J].
Guo, Zhixing ;
Adhikari, Kabindra ;
Chellasamy, Menaka ;
Greve, Mette B. ;
Owens, Phillip R. ;
Greve, Mogens H. .
GEODERMA, 2019, 340 :303-312
[37]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[38]  
Hong-tao Zhang, 2018, Procedia Engineering, V211, P1004, DOI 10.1016/j.proeng.2017.12.103
[39]   Estimation of soil organic matter content by modeling with artificial neural networks [J].
Honorato Fernandes, Mariele Monique ;
Coelho, Anderson Prates ;
Fernandes, Carolina ;
da Silva, Matheus Flavio ;
Dela Marta, Claudia Campos .
GEODERMA, 2019, 350 :46-51
[40]   A hybrid training approach for leaf area index estimation via Cubist and random forests machine-learning [J].
Houborg, Rasmus ;
McCabe, Matthew F. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 135 :173-188