A novel intelligence approach based active and ensemble learning for agricultural soil organic carbon prediction using multispectral and SAR data fusion

被引:80
作者
Thu Thuy Nguyen [1 ]
Tien Dat Pham [2 ,4 ]
Chi Trung Nguyen [3 ]
Delfos, Jacob [4 ]
Archibald, Robert [4 ]
Kinh Bac Dang [5 ]
Ngoc Bich Hoang [6 ]
Guo, Wenshan [1 ]
Huu Hao Ngo [1 ,6 ]
机构
[1] Univ Technol Sydney, Ctr Technol Water & Wastewater, Sch Civil & Environm Engn, Sydney, NSW 2007, Australia
[2] Macquarie Univ, Dept Earth & Environm Sci, N Ryde, NSW 2109, Australia
[3] Univ New England, Fac Sci Agr Business & Law, UNE Business Sch, Elm Ave, Armidale, NSW 2351, Australia
[4] Astron Environm Serv, 129 Royal St, East Perth, WA 6004, Australia
[5] VNU Univ Sci, Fac Geog, 334 Nguyen Trai, Hanoi, Vietnam
[6] Nguyen Tat Thanh Univ, Inst Environm Sci, Ho Chi Minh City, Vietnam
关键词
SOC; Machine learning; Multi-sensor data fusion; Sentinel; 1; 2; SENTINEL-2; VEGETATION; INDEXES; AIRBORNE; TEXTURE; BANDS; RED;
D O I
10.1016/j.scitotenv.2021.150187
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Monitoring agricultural soil organic carbon (SOC) has played an essential role in sustainable agricultural management. Precise and robust prediction of SOC greatly contributes to carbon neutrality in the agricultural industry. To create more knowledge regarding the ability of remote sensing to monitor carbon soil, this research devises a state-of-the-art low cost machine learning model for quantifying agricultural soil carbon using active and ensemble-based decision tree learning combined with multi-sensor data fusion at a national and world scale. This work explores the use of Sentinel-1 (S1) C-band dual polarimetric synthetic aperture radar (SAR), Sentinel-2 (S2) multispectral data, and an innovative machine learning (ML) approach using an integration of active learning for land-use mapping and advanced Extreme Gradient Boosting (XGBoost) for robustness of the SOC estimates. The collected soil samples from a field survey in Western Australia were used for the model validation. The indicators including the coefficient of determination (R-2) and root - mean - square - error (RMSE) were applied to evaluate the model's performance. A numerous features computed from optical and SAR data fusion were employed to build and test the proposed model performance. The effectiveness of the proposed machine learning model was assessed by comparing with the two well-known algorithms such as Random Forests (RF) and Support Vector Machine (SVM) to predict agricultural SOC. Results suggest that a combination of S1 and S2 sensors could effectively estimate SOC in farming areas by using ML techniques. Satisfactory accuracy of the proposed XGBoost with optimal features was achieved the highest performance (R-2 = 0.870; RMSE - 1.818 tonC/ha) which outperformed RF and SVM. Thus, multi-sensor data fusion combined with the XGBoost lead to the best prediction results for agricultural SOC at 10 m spatial resolution. In short, this new approach could significantly contribute to various agricultural SOC retrieval studies globally. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 58 条
  • [1] UAS-based soil carbon mapping using VIS-NIR (480-1000 nm) multi-spectral imaging: Potential and limitations
    Aldana-Jague, Emilien
    Heckrath, Goswin
    Macdonald, Andy
    van Wesemael, Bas
    Van Oost, Kristof
    [J]. GEODERMA, 2016, 275 : 55 - 66
  • [2] Remote Sensing Techniques for Soil Organic Carbon Estimation: A Review
    Angelopoulou, Theodora
    Tziolas, Nikolaos
    Balafoutis, Athanasios
    Zalidis, George
    Bochtis, Dionysis
    [J]. REMOTE SENSING, 2019, 11 (06)
  • [3] Breiman L., 2001, Machine Learning, V45, P5
  • [4] Evaluating the capability of the Sentinel 2 data for soil organic carbon prediction in croplands
    Castaldi, Fabio
    Hueni, Andreas
    Chabrillat, Sabine
    Ward, Kathrin
    Buttafuoco, Gabriele
    Bomans, Bart
    Vreys, Kristin
    Brell, Maximilian
    van Wesemael, Bas
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 147 : 267 - 282
  • [5] Evaluation of the potential of the current and forthcoming multispectral and hyperspectral imagers to estimate soil texture and organic carbon
    Castaldi, Fabio
    Palombo, Angelo
    Santini, Federico
    Pascucci, Simone
    Pignatti, Stefano
    Casa, Raffaele
    [J]. REMOTE SENSING OF ENVIRONMENT, 2016, 179 : 54 - 65
  • [6] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [7] The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation
    Chicco, Davide
    Jurman, Giuseppe
    [J]. BMC GENOMICS, 2020, 21 (01)
  • [8] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [9] Cristianini N., 2008, ENCY ALGORITHMS, V1, P928
  • [10] On the use of remote sensing techniques for monitoring spatio-temporal soil organic carbon dynamics in agricultural systems
    Croft, H.
    Kuhn, N. J.
    Anderson, K.
    [J]. CATENA, 2012, 94 : 64 - 74