A high resolution map of soil types and physical properties for Cyprus: A digital soil mapping optimization

被引:113
作者
Camera, Corrado [1 ]
Zomeni, Zomenia [2 ]
Noller, Jay S. [3 ]
Zissimos, Andreas M. [2 ]
Christoforou, Irene C. [2 ]
Bruggeman, Adriana [1 ]
机构
[1] Cyprus Inst, Energy Environm & Water Res Ctr, CY-2121 Aglantzia, Lefkosia, Cyprus
[2] Minist Agr Rural Dev & Environm, Cyprus Geol Survey, Strovolos, Lefkosia, Cyprus
[3] Oregon State Univ, Dept Crop & Soil Sci, Corvallis, OR 97331 USA
关键词
Cyprus; Digital soil mapping; Model optimization; Random Forest; Soil landscape model; World Reference Base; CLASSIFICATION TREE ANALYSIS; ORGANIC-CARBON; RANDOM FOREST; GEOCHEMICAL PATTERNS; SPATIAL-DISTRIBUTION; VARIABLE IMPORTANCE; PREDICTION; REGION; LANDSCAPES; INTENSITY;
D O I
10.1016/j.geoderma.2016.09.019
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Fine-resolution soil maps constitute important data for many different environmental studies. Digital soil mapping techniques represent a cost-effective method to obtain detailed information about soil types and soil properties over large areas. The main objective of the study was to extend predictions from 1:25,000 legacy soil surveys (including WRB soil groups, soil depth and soil texture classes) to the larger area of Cyprus. A multiple-trees classification technique, namely Random Forest (RF), was applied. Specific objectives were: (i) to analyze the role and importance of a large data set of environmental predictors, (ii) to investigate the effect of the number of training points, forest size (ntree), the numbers of predictors sampled per node (mtry) and tree size (nodesize) in RF; (iii) to compare RF-derived maps with maps derived with a multinomial logistic regression model, in terms of validation error (test set and independent profiles) and map uncertainty, using the confusion index and a newly developed reliability index. The optimized RF model was run using half of the input points available (over a million) and with ntree equal to 350. The mtry parameter was set to 5 (close to half the number of the environmental variables used) for both soil series and soil properties. The nodesize calibration showed no relevant performance increase and was kept at its default value (1). In terms of environmental variables, the model used 10 predictors, covering all the soil formation factors considered in the scorpan formula, to derive the three maps. Soil properties, derived from geochemistry data, showed a high importance in deriving soil groups, depths and texture. Random Forest constructed a better predictive model than multinomial logistic-regression, showing comparable predictive uncertainty but much lower validation error. The RF-derived maps show very low out of bag (OOB) errors (around 10% for both soil groups and soil properties) but relatively high validation error from independent profiles (45% for soil depth, 51% for soil texture). The resulting reliability index was low in the main mountainous area of Cyprus, where predictions were extrapolations as indicated by the multivariate environmental similarity surface, but medium to high in the main agricultural areas of the country. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:35 / 49
页数:15
相关论文
共 80 条
[1]   Total soil organic carbon and carbon sequestration potential in Nigeria [J].
Akpa, Stephen I. C. ;
Odeh, Inakwu O. A. ;
Bishop, Thomas F. A. ;
Hartemink, Alfred E. ;
Amapu, Ishaku Y. .
GEODERMA, 2016, 271 :202-215
[2]  
Allen R. G., 1998, FAO Irrigation and Drainage Paper
[3]  
[Anonymous], 2014, RANDOMFOREST BREIMAN
[4]  
[Anonymous], GEOMORPHOLOGY CYPRUS
[5]   Quantifying chemical weathering intensity and trace element release from two contrasting basalt profiles, Deccan Traps, India [J].
Babechuk, M. G. ;
Widdowson, M. ;
Kamber, B. S. .
CHEMICAL GEOLOGY, 2014, 363 :56-75
[6]   Land use and climate control the spatial distribution of soil types in the grasslands of Inner Mongolia [J].
Barthold, F. K. ;
Wiesmeier, M. ;
Breuer, L. ;
Frede, H-G ;
Wu, J. ;
Blank, F. B. .
JOURNAL OF ARID ENVIRONMENTS, 2013, 88 :194-205
[7]   Multi-scale digital terrain analysis and feature selection for digital soil mapping [J].
Behrens, Thorsten ;
Zhu, A-Xing ;
Schmidt, Karsten ;
Scholten, Thomas .
GEODERMA, 2010, 155 (3-4) :175-185
[8]   Modelling climate change impacts on and adaptation strategies for agriculture in Sardinia and Tunisia using AquaCrop and value-at-risk [J].
Bird, David Neil ;
Benabdallah, Sihem ;
Gouda, Nadine ;
Hummel, Franz ;
Koeberl, Judith ;
La Jeunesse, Isabelle ;
Meyer, Swen ;
Prettenthaler, Franz ;
Soddu, Antonino ;
Woess-Gallasch, Susanne .
SCIENCE OF THE TOTAL ENVIRONMENT, 2016, 543 :1019-1027
[9]  
Bolstad P. V., 1990, International Journal of Geographical Information Systems, V4, P399, DOI 10.1080/02693799008941555
[10]   Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics [J].
Boulesteix, Anne-Laure ;
Janitza, Silke ;
Kruppa, Jochen ;
Koenig, Inke R. .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (06) :493-507