Instance selection in digital soil mapping: a study case in Rio Grande do Sul, Brazil

被引:11
作者
Giasson, Elvio [1 ]
ten Caten, Alexandre [2 ]
Bagatini, Tatiane [1 ]
Bonfatti, Benito [1 ]
机构
[1] Univ Fed Rio Grande do Sul, BR-91540000 Porto Alegre, RS, Brazil
[2] UFSC, Curitibanos, SC, Brazil
来源
CIENCIA RURAL | 2015年 / 45卷 / 09期
关键词
soil survey; data sampling; model training; CLASSIFICATION TREE;
D O I
10.1590/0103-8478cr20140694
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
A critical issue in digital soil mapping (DSM) is the selection of data sampling method for model training. One emerging approach applies instance selection to reduce the size of the dataset by drawing only relevant samples in order to obtain a representative subset that is still large enough to preserve relevant information, but small enough to be easily handled by learning algorithms. Although there are suggestions to distribute data sampling as a function of the soil map unit (MU) boundaries location, there are still contradictions among research recommendations for locating samples either closer or more distant from soil MU boundaries. A study was conducted to evaluate instance selection methods based on spatially-explicit data collection using location in relation to soil MU boundaries as the main criterion. Decision tree analysis was performed for modeling digital soil class mapping using two different sampling schemes: a) selecting sampling points located outside buffers near soil MU boundaries, and b) selecting sampling points located within buffers near soil MU boundaries. Data was prepared for generating classification trees to include only data points located within or outside buffers with widths of 60, 120, 240, 360, 480, and 600m near MU boundaries. Instance selection methods using both spatial selection of methods was effective for reduced size of the dataset used for calibrating classification tree models, but failed to provide advantages to digital soil mapping because of potential reduction in the accuracy of classification tree models
引用
收藏
页码:1592 / 1598
页数:7
相关论文
共 19 条
[1]  
ABRAMS M., 1999, ASTER US HDB
[2]  
[Anonymous], 1999, Ph.D. Thesis
[3]  
[Anonymous], 2010, KEYS SOIL TAX
[4]  
[Anonymous], 2014, C4. 5: programs for machine learning
[5]  
[Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
[6]  
[Anonymous], 2011, ENCY MACHINE LEARNIN
[7]  
[Anonymous], 1998, Feature Extraction, Construction and Selection: A Data Mining Perspective
[8]  
Beven K.J., 1979, Hydrological Sciences Bulletin, V24, P43, DOI DOI 10.1080/02626667909491834
[9]   Extracting soil-landscape rules from previous soil surveys [J].
Bui, EN ;
Loughhead, A ;
Corner, R .
AUSTRALIAN JOURNAL OF SOIL RESEARCH, 1999, 37 (03) :495-508
[10]  
EMBRAPA, 2013, BRAZ SOIL CLASS SYST