Improving pedotransfer functions for predicting soil mineral associated organic carbon by ensemble machine learning

被引:33
|
作者
Xiao, Yi [1 ,2 ]
Xue, Jie [2 ]
Zhang, Xianglin [2 ]
Wang, Nan [2 ]
Hong, Yongsheng [2 ]
Jiang, Yefeng [2 ]
Zhou, Yin [3 ]
Teng, Hongfen [4 ]
Hu, Bifeng [5 ]
Lugato, Emanuele [6 ]
Richer-de-Forges, Anne C. [7 ]
Arrouays, Dominique [7 ]
Shi, Zhou [2 ]
Chen, Songchao [1 ,2 ]
机构
[1] ZJU Hangzhou Global Sci & Technol Innovat Ctr, Hangzhou 311200, Peoples R China
[2] Zhejiang Univ, Inst Appl Remote Sensing & Informat Technol, Coll Environm & Resource Sci, Hangzhou 310058, Peoples R China
[3] Zhejiang Univ Finance & Econ, Inst Land & Urban Rural Dev, Hangzhou 310018, Peoples R China
[4] Wuhan Inst Technol, Sch Environm Ecol & Biol Engn, Wuhan 430205, Peoples R China
[5] Jiangxi Univ Finance & Econ, Sch Tourism & Urban Management, Dept Land Resource Management, Nanchang 330013, Peoples R China
[6] European Commiss, Joint Res Ctr JRC, Ispra, Italy
[7] INRAE, Unite InfoSol, F-45075 Orleans, France
关键词
Soil carbon fractions; Model ensemble; LUCAS Soil; Forward recursive feature selection; RESOLUTION MAP; MATTER; FRACTIONS; MODEL; STOCKS; CALIBRATION; DYNAMICS; NITROGEN; DATASET;
D O I
10.1016/j.geoderma.2022.116208
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Soil organic carbon (SOC) sequestration is a promising natural climate solution for capturing atmospheric CO2, and it provides crucial co-benefits in improving soil functions and services at the same time. Given that SOC is not a single and uniform entity, a deep understanding of SOC response to environmental changes requires additional information on SOC fractions with distinct characteristics such as particulate organic carbon (POC) and mineral associated organic carbon (MAOC). Despite their great importance, POC and MAOC information is still scarce in the soil databases, particularly on a broad scale. Pedotransfer function (PTF) is a good strategy to estimate missing soil properties, while its application in SOC fractions has been poorly explored. Based on 352 representative mineral topsoil samples (0-20 cm) across Europe, we evaluated the potential of MAOC prediction using machine learning based PTF (random forest (RF), Cubist, and gradient boosted machine (GBM)) together with predictor selection methods (recursive feature elimination (RFE) and forward recursive feature selection (FRFS)). The repeated validation (100 times) showed that MAOC could be well predicted by machine learning based PTFs (R2 of 0.877-0.9, RMSE of 2.994-3.269 g kg- 1). RFE can effectively reduce the number of predictors from 21 to 12 with comparable performance to the models using all predictors. The proposed FRFS algorithm had the best model parsimony with only 6 predictors (SOC, silt + clay, nitrogen, nitrogen deposition, soil erosion and sand) and performed similar to or even better than RFE. In combination with FRFS, Cubist performed best among the three machine learning models (R2 of 0.9, RMSE of 2.994 g kg- 1). Our results also showed that five model ensemble methods had similar model performance and can improve model accuracy and robustness compared to a single machine learning model. This study provides a valuable reference for coupling PTF and legacy soil databases to increase the spatial coverage and the performance of machine learning based SOC fraction predictions.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Evaluation of pedotransfer functions for predicting particle density of soils with low organic matter contents
    Qin, Lu
    Lin, Lirong
    Ding, Shuwen
    Yi, Ceng
    Chen, Jiazhou
    Tian, Zhengchao
    GEODERMA, 2022, 416
  • [42] Improving carbon flux estimation in tea plantation ecosystems: A machine learning ensemble approach
    Raza, Ali
    Hu, Yongguang
    Lu, Yongzong
    EUROPEAN JOURNAL OF AGRONOMY, 2024, 160
  • [43] Ensemble of optimised machine learning algorithms for predicting surface soil moisture content at a global scale
    Han, Qianqian
    Zeng, Yijian
    Zhang, Lijie
    Cira, Calimanut-Ionut
    Prikaziuk, Egor
    Duan, Ting
    Wang, Chao
    Szabo, Brigitta
    Manfreda, Salvatore
    Zhuang, Ruodan
    Su, Bob
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2023, 16 (20) : 5825 - 5845
  • [44] Machine learning in space and time for modelling soil organic carbon change
    Heuvelink, Gerard B. M.
    Angelini, Marcos E.
    Poggio, Laura
    Bai, Zhanguo
    Batjes, Niels H.
    van den Bosch, Rik
    Bossio, Deborah
    Estella, Sergio
    Lehmann, Johannes
    Olmedo, Guillermo F.
    Sanderman, Jonathan
    EUROPEAN JOURNAL OF SOIL SCIENCE, 2021, 72 (04) : 1607 - 1623
  • [45] Machine Learning Ensemble Modelling for Predicting Unemployment Duration
    Gabrikova, Barbora
    Svabova, Lucia
    Kramarova, Katarina
    APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [46] Improving the accuracy of soil organic carbon content prediction based on visible and near-infrared spectroscopy and machine learning
    Mingxing Xu
    Xianyao Chu
    Yesi Fu
    Changjiang Wang
    Shaohua Wu
    Environmental Earth Sciences, 2021, 80
  • [47] A comprehensive evaluation of pedotransfer functions for predicting soil water content in environmental modeling and ecosystem management
    Contreras, Cristina P.
    Bonilla, Carlos A.
    SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 644 : 1580 - 1590
  • [48] Improving the accuracy of soil organic carbon content prediction based on visible and near-infrared spectroscopy and machine learning
    Xu, Mingxing
    Chu, Xianyao
    Fu, Yesi
    Wang, Changjiang
    Wu, Shaohua
    ENVIRONMENTAL EARTH SCIENCES, 2021, 80 (08)
  • [49] Developing pedotransfer functions using Sentinel-2 satellite spectral indices and Machine learning for estimating the surface soil moisture
    Sedaghat, Azadeh
    Shahrestani, Mahmoud Shabanpour
    Noroozi, Ali Akbar
    Nosratabad, Alireza Fallah
    Bayat, Hossein
    JOURNAL OF HYDROLOGY, 2022, 606
  • [50] Improving ensemble docking for drug discovery by machine learning
    Wong, Chung F.
    JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY, 2019, 18 (03):