Comparison of boosted regression tree and random forest models for mapping topsoil organic carbon concentration in an alpine ecosystem

被引:314
作者
Yang, Ren-Min [1 ,2 ]
Zhang, Gan-Lin [1 ,2 ]
Liu, Feng [1 ]
Lu, Yuan-Yuan [1 ,2 ]
Yang, Fan [1 ,2 ]
Yang, Fei [1 ,2 ]
Yang, Min [1 ,2 ]
Zhao, Yu-Guo [1 ]
Li, De-Cheng [1 ]
机构
[1] Chinese Acad Sci, Inst Soil Sci, State Key Lab Soil & Sustainable Agr, Nanjing 210008, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Digital soil mapping; Soil organic carbon; Boosted regression tree; Random forest; Environmental variables; Tibetan Plateau; SOIL CARBON; SPATIAL-DISTRIBUTION; STOCKS; VEGETATION; STORAGE; IMAGERY; QUANTIFICATION; CLASSIFICATION; VARIABILITY; PREDICTION;
D O I
10.1016/j.ecolind.2015.08.036
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Soil organic carbon (SOC) plays an important role in soil fertility and carbon sequestration, and a better understanding of the spatial patterns of SOC is essential for soil resource management. In this study, we used boosted regression tree (BRT) and random forest (RF) models to map the distribution of topsoil organic carbon content at the northeastern edge of the Tibetan Plateau in China. A set 01 105 soil samples and 12 environmental variables (including topography, climate and vegetation) were analyzed. The performance of the models was evaluated using a 10-fold cross-validation procedure. Maps of the mean values and standard deviations of SOC were generated to illustrate model variability and uncertainty. The results indicate that the BRT and RF models exhibited very similar performance and yielded similar predicted distributions of SOC. The two models explained approximately 70% of the total SOC variability. The BRT and RF models robustly predicted the SOC at low observed SOC values, whereas they underestimated high observed SOC values. This underestimation may have been caused by biased distributions of soil samples in the SOC space. Vegetation-related variables were assigned the highest importance in both models, followed by climate and topography. Both models produced spatial distribution maps of SOC that were closely related to vegetation cover. The SOC content predicted by the BRT model was clearly higher than that of the RF model in areas with greater vegetation cover because the contributions of vegetation-related variables in the two models (65% and 43%, respectively) differed significantly. The predicted SOC content increased from the northwestern to the southeastern part of the study area, average values produced by the BRT and RF models were 27.3 g kg(-1) and 26.6 g kg(-1), respectively. We conclude that the BRT and RF methods should be calibrated and compared to obtain the best prediction of SOC spatial distribution in similar regions. In addition, vegetation variables, including those obtained from remote sensing imagery, should be taken as the main environmental indicators and explicitly included when generating SOC maps in Alpine environments. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:870 / 878
页数:9
相关论文
共 65 条
[1]  
[Anonymous], 1993, An introduction to the bootstrap
[2]  
[Anonymous], 2014, Keys to Soil Taxonomy
[3]  
[Anonymous], 2001, KEYS CHIN SOIL TAX
[4]  
[Anonymous], 2012, BIOGEOSCIENCES, DOI DOI 10.5194/bg-9-2287-2012
[5]  
[Anonymous], LANDS TM MOS IM HEIH
[6]  
[Anonymous], 2011, LANDUSE LANDCOVER DA
[7]  
[Anonymous], 1995, SOIL SCI
[8]   Total carbon and nitrogen in the soils of the world [J].
Batjes, N. H. .
EUROPEAN JOURNAL OF SOIL SCIENCE, 2014, 65 (01) :10-21
[9]   Carbon losses from all soils across England and Wales 1978-2003 [J].
Bellamy, PH ;
Loveland, PJ ;
Bradley, RI ;
Lark, RM ;
Kirk, GJD .
NATURE, 2005, 437 (7056) :245-248
[10]   ESTIMATE OF ORGANIC-CARBON IN WORLD SOILS .2. [J].
BOHN, HL .
SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 1982, 46 (05) :1118-1119