Improving the Spatial Prediction of Soil Organic Carbon Content in Two Contrasting Climatic Regions by Stacking Machine Learning Models and Rescanning Covariate Space

被引:141
作者
Taghizadeh-Mehrjardi, Ruhollah [1 ,2 ]
Schmidt, Karsten [3 ,4 ]
Amirian-Chakan, Alireza [5 ]
Rentschler, Tobias [1 ,6 ]
Zeraatpisheh, Mojtaba [7 ]
Sarmadian, Fereydoon [8 ]
Valavi, Roozbeh [9 ]
Davatgar, Naser [10 ]
Behrens, Thorsten [1 ,4 ,6 ]
Scholten, Thomas [1 ,4 ,6 ]
机构
[1] Univ Tubingen, Dept Geosci Soil Sci & Geomorphol, D-72070 Tubingen, Germany
[2] Ardakan Univ, Fac Agr & Nat Resources, Ardakan 8951656767, Iran
[3] Univ Tubingen, eSci Ctr, D-72070 Tubingen, Germany
[4] Univ Tubingen, DFG Cluster Excellence Machine Learning, D-72070 Tubingen, Germany
[5] Lorestan Univ, Dept Soil Sci, Khorramabad 6815144316, Iran
[6] Univ Tubingen, CRC ResourceCultures 1070, D-72070 Tubingen, Germany
[7] Henan Univ, Coll Environm & Planning, Key Lab Geospatial Technol Middle & Lower Yellow, Kaifeng 475004, Peoples R China
[8] Univ Tehran, Dept Soil Sci, Coll Agr, Karaj 7787131587, Iran
[9] Univ Melbourne, Sch BioSci, Quantitat & Appl Ecol Grp, Melbourne, Vic 3010, Australia
[10] Agr Res Educ & Extens Org, Soil & Water Res Inst, Karaj 3177993545, Iran
关键词
digital soil mapping; machine learning models; stacking of models; spatial block cross-validation; deep learning; ARTIFICIAL NEURAL-NETWORKS; RANDOM FORESTS; SEMIARID RANGELANDS; PROFILE DEPTH; STOCKS; MATTER; REGRESSION; MAPS; TOPOGRAPHY; LANDSCAPE;
D O I
10.3390/rs12071095
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Understanding the spatial distribution of soil organic carbon (SOC) content over different climatic regions will enhance our knowledge of carbon gains and losses due to climatic change. However, little is known about the SOC content in the contrasting arid and sub-humid regions of Iran, whose complex SOC-landscape relationships pose a challenge to spatial analysis. Machine learning (ML) models with a digital soil mapping framework can solve such complex relationships. Current research focusses on ensemble ML models to increase the accuracy of prediction. The usual ensemble method is boosting or weighted averaging. This study proposes a novel ensemble technique: the stacking of multiple ML models through a meta-learning model. In addition, we tested the ensemble through rescanning the covariate space to maximize the prediction accuracy. We first applied six state-of-the-art ML models (i.e., Cubist, random forests (RF), extreme gradient boosting (XGBoost), classical artificial neural network models (ANN), neural network ensemble based on model averaging (AvNNet), and deep learning neural networks (DNN)) to predict and map the spatial distribution of SOC content at six soil depth intervals for both regions. In addition, the stacking of multiple ML models through a meta-learning model with/without rescanning the covariate space were tested and applied to maximize the prediction accuracy. Out of six ML models, the DNN resulted in the best modeling accuracies, followed by RF, XGBoost, AvNNet, ANN, and Cubist. Importantly, the stacking of models indicated a significant improvement in the prediction of SOC content, especially when combined with rescanning the covariate space. For instance, the RMSE values for SOC content prediction of the upper 0-5 cm of the soil profiles of the arid site and the sub-humid site by the proposed stacking approaches were 17% and 9% respectively, less than that obtained by the DNN models-the best individual model. This indicates that rescanning the original covariate space by a meta-learning model can extract more information and improve the SOC content prediction accuracy. Overall, our results suggest that the stacking of diverse sets of models could be used to more accurately estimate the spatial distribution of SOC content in different climatic regions.
引用
收藏
页数:26
相关论文
共 116 条
[1]   Assessing soil organic carbon stock of Wisconsin, USA and its fate under future land use and climate change [J].
Adhikari, Kabindra ;
Owens, Phillip R. ;
Libohova, Zamir ;
Miller, David M. ;
Wills, Skye A. ;
Nemecek, Jason .
SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 667 :833-845
[2]   Digital Mapping of Soil Organic Carbon Contents and Stocks in Denmark [J].
Adhikari, Kabindra ;
Hartemink, Alfred E. ;
Minasny, Budiman ;
Kheir, Rania Bou ;
Greve, Mette B. ;
Greve, Mogens H. .
PLOS ONE, 2014, 9 (08)
[3]   Environmental factors controlling soil organic carbon storage in loess soils of a subhumid region, northern Iran [J].
Ajami, Mohammad ;
Heidari, Ahmad ;
Khormali, Farhad ;
Gorji, Manouchehr ;
Ayoubi, Shamsollah .
GEODERMA, 2016, 281 :1-10
[4]  
[Anonymous], 1996, Pattern Recognition and Neural Networks
[5]   GlobalSoilMap: Toward a Fine-Resolution Global Grid of Soil Properties [J].
Arrouays, Dominique ;
Grundy, Michael G. ;
Hartemink, Alfred E. ;
Hempel, Jonathan W. ;
Heuvelink, Gerard B. M. ;
Hong, S. Young ;
Lagacherie, Philippe ;
Lelyk, Glenn ;
McBratney, Alexander B. ;
McKenzie, Neil J. ;
Mendonca-Santos, Maria D. L. ;
Minasny, Budiman ;
Montanarella, Luca ;
Odeh, Inakwu O. A. ;
Sanchez, Pedro A. ;
Thompson, James A. ;
Zhang, Gan-Lin .
ADVANCES IN AGRONOMY, VOL 125, 2014, 125 :93-+
[6]   Optimisation of pedotransfer functions using an artificial neural network ensemble method [J].
Baker, L. ;
Ellison, D. .
GEODERMA, 2008, 144 (1-2) :212-224
[7]   The ConMap approach for terrain-based digital soil mapping [J].
Behrens, T. ;
Schmidt, K. ;
Zhu, A. X. ;
Scholten, T. .
EUROPEAN JOURNAL OF SOIL SCIENCE, 2010, 61 (01) :133-143
[8]   Digital soil mapping in Germany - a review [J].
Behrens, Thorsten ;
Scholten, Thomas .
JOURNAL OF PLANT NUTRITION AND SOIL SCIENCE, 2006, 169 (03) :434-443
[9]   Multi-scale digital soil mapping with deep learning [J].
Behrens, Thorsten ;
Schmidt, Karsten ;
MacMillan, Robert A. ;
Rossel, Raphael A. Viscarra .
SCIENTIFIC REPORTS, 2018, 8
[10]   Pedogenic and microbial interrelations to regional climate and local topography: New insights from a climate gradient (arid to humid) along the Coastal Cordillera of Chile [J].
Bernhard, Nadine ;
Moskwa, Lisa-Marie ;
Schmidt, Karsten ;
Oeser, Ralf A. ;
Aburto, Felipe ;
Bader, Maaike Y. ;
Baumann, Karen ;
von Blanckenburg, Friedhelm ;
Boy, Jens ;
van den Brink, Liesbeth ;
Brucker, Emanuel ;
Buedel, Burkhard ;
Canessa, Rafaella ;
Dippold, Michaela A. ;
Ehlers, Todd A. ;
Fuentes, Juan P. ;
Godoy, Roberto ;
Jung, Patrick ;
Karsten, Ulf ;
Koester, Moritz ;
Kuzyakov, Yakov ;
Leinweber, Peter ;
Neidhardt, Harald ;
Matus, Francisco ;
Mueller, Carsten W. ;
Oelmann, Yvonne ;
Oses, Romulo ;
Osses, Pablo ;
Paulino, Leandro ;
Samolov, Elena ;
Schaller, Mirjam ;
Schmid, Manuel ;
Spielvogel, Sandra ;
Spohn, Marie ;
Stock, Svenja ;
Stroncik, Nicole ;
Tielboerger, Katja ;
Uebernickel, Kirstin ;
Scholten, Thomas ;
Seguel, Oscar ;
Wagner, Dirk ;
Kuehn, Peter .
CATENA, 2018, 170 :335-355