Mapping Soil Properties of Africa at 250 m Resolution: Random Forests Significantly Improve Current Predictions

被引:605
作者
Hengl, Tomislav [1 ]
Heuvelink, Gerard B. M. [1 ]
Kempen, Bas [1 ]
Leenaars, Johan G. B. [1 ]
Walsh, Markus G. [2 ,3 ]
Shepherd, Keith D. [4 ]
Sila, Andrew [4 ]
MacMillan, Robert A. [5 ]
de Jesus, Jorge Mendes [1 ]
Tamene, Lulseged [6 ]
Tondoh, Jerome E. [4 ]
机构
[1] ISRIC World Soil Informat, Wageningen, Netherlands
[2] Columbia Univ, Earth Inst, New York, NY 10027 USA
[3] Selian Agr Res Inst, Arusha, Tanzania
[4] World Agroforestry Ctr, Nairobi, Kenya
[5] LandMapper Environm Solut Inc, Edmonton, AB, Canada
[6] Ctr Int Agr Trop, Lilongwe, Malawi
来源
PLOS ONE | 2015年 / 10卷 / 06期
关键词
SURVEILLANCE; MODELS; CARBON;
D O I
10.1371/journal.pone.0125814
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
80% of arable land in Africa has low soil fertility and suffers from physical soil problems. Additionally, significant amounts of nutrients are lost every year due to unsustainable soil management practices. This is partially the result of insufficient use of soil management knowledge. To help bridge the soil information gap in Africa, the Africa Soil Information Service (AfSIS) project was established in 2008. Over the period 2008-2014, the AfSIS project compiled two point data sets: the Africa Soil Profiles (legacy) database and the AfSIS Sentinel Site database. These data sets contain over 28 thousand sampling locations and represent the most comprehensive soil sample data sets of the African continent to date. Utilizing these point data sets in combination with a large number of covariates, we have generated a series of spatial predictions of soil properties relevant to the agricultural management-organic carbon, pH, sand, silt and clay fractions, bulk density, cation-exchange capacity, total nitrogen, exchangeable acidity, Al content and exchangeable bases (Ca, K, Mg, Na). We specifically investigate differences between two predictive approaches: random forests and linear regression. Results of 5-fold cross-validation demonstrate that the random forests algorithm consistently outperforms the linear regression algorithm, with average decreases of 15-75% in Root Mean Squared Error (RMSE) across soil properties and depths. Fitting and running random forests models takes an order of magnitude more time and the modelling success is sensitive to artifacts in the input data, but as long as quality-controlled point data are provided, an increase in soil mapping accuracy can be expected. Results also indicate that globally predicted soil classes (USDA Soil Taxonomy, especially Alfisols and Mollisols) help improve continental scale soil property mapping, and are among the most important predictors. This indicates a promising potential for transferring pedological knowledge from data rich countries to countries with limited soil data.
引用
收藏
页数:26
相关论文
共 53 条
  • [1] Estimating soil moisture using remote sensing data: A machine learning approach
    Ahmad, Sajjad
    Kalra, Ajay
    Stephen, Haroon
    [J]. ADVANCES IN WATER RESOURCES, 2010, 33 (01) : 69 - 80
  • [2] Alliance for a Green Revolution in Africa (AGRA), 2014, AFR AGR STAT REP
  • [3] [Anonymous], 4 GLOB WORKSH DIG SO
  • [4] Ballabio C, 2014, ASA CSSA SSSA INTERN
  • [5] Bivand R., 2013, Applied Spatial Data Analysis with R, Vsecond, DOI 10.1007/978-1-4614-7618-4
  • [6] Boettinger JL, 2010, PROGR SOIL SCI, V2
  • [7] Bohner J., 2008, SAGA - Seconds Out, V19
  • [8] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [9] Brown PE, 2015, J STAT SOFTW, V63
  • [10] Sampling for validation of digital soil maps
    Brus, D. J.
    Kempen, B.
    Heuvelink, G. B. M.
    [J]. EUROPEAN JOURNAL OF SOIL SCIENCE, 2011, 62 (03) : 394 - 407