Post-hoc Evaluation of Sample Size in a Regional Digital Soil Mapping Project

被引：0

作者：

Saurette, Daniel D. ^{[1
,2
]}

Heck, Richard J. ^{[1
]}

Gillespie, Adam W. ^{[1
]}

Berg, Aaron A. ^{[3
]}

Biswas, Asim ^{[1
]}

机构：

[1] Univ Guelph, Sch Environm Sci, 50 Stone Rd East, Guelph, ON N1G 2W1, Canada

[2] Ontario Minist Agr Food & Agribusiness, 1 Stone Rd West, Guelph, ON N1G 2Y4, Canada

[3] Univ Guelph, Dept Geog Environm & Geomat, 50 Stone Rd East, Guelph, ON N1G 2W1, Canada

来源：

LAND | 2025年 / 14卷 / 03期

基金：

加拿大自然科学与工程研究理事会;

关键词：

sampling design; sample size; digital soil mapping; conventional soil mapping; divergence metrics; operational soil survey; CATION-EXCHANGE CAPACITY;

D O I：

10.3390/land14030545

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The transition from conventional soil mapping (CSM) to digital soil mapping (DSM) not only affects the final map products, but it also affects the concepts of scale, resolution, and sampling intensity. This is critical because in the CSM approach, sampling intensity is intricately linked to the desired scale of soil map publication, which provided standardization of sampling. This is not the case for DSM where sample size varies widely by project, and sampling design studies have largely focused on where to sample without due consideration for sample size. Using a regional soil survey dataset with 1791 sampled and described soil profiles, we first extracted an external validation dataset using the conditioned Latin hypercube sampling (cLHS) algorithm and then created repeated (n = 10) sample plans of increasing size from the remaining calibration sites using the cLHS, feature space coverage sampling (FSCS), and simple random sampling (SRS). We then trained random forest (RF) models for four soil properties: pH, CEC, clay content, and SOC at five different depths. We identified the effective sample size based on the model learning curves and compared it to the optimal sample size determined from the Jensen-Shannon divergence (DJS) applied to the environmental covariates. Maps were then generated from models that used all the calibration points (reference maps) and from models that used the optimal sample size (optimal maps) for comparison. Our findings revealed that the optimal sample sizes based on the DJS analysis were closely aligned with the effective sample sizes from the model learning curves (815 for cLHS, 832 for FSCS, and 847 for SRS). Furthermore, the comparison of the optimal maps to the reference maps showed little difference in the global statistics (concordance correlation coefficient and root mean square error) and spatial trends of the data, confirming that the optimal sample size was sufficient for creating predictions of similar accuracy to the full calibration dataset. Finally, we conclude that the Ottawa soil survey project could have saved between CAD 330,500 and CAD 374,000 (CAD = Canadian dollars) if the determination of optimal sample size tools presented herein existed during the project planning phase. This clearly illustrates the need for additional research in determining an optimal sample size for DSM and demonstrates that operationalization of DSM in public institutions requires a sound scientific basis for determining sample size.

引用

页数：22

共 30 条

[1] SAMPLE SIZE AND INTRASUBJECT COEFFICIENT OF VARIATION OF BIOEQUIVALENCE STUDIES: A POST-HOC ANALYSIS
Chung, I.
Lee, S.
Yoon, S.
Yu, K.
Lee, H.
Jang, I.
Chung, J.
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2016, 99 : S27 - S27
[2] Sample Size Optimization for Digital Soil Mapping: An Empirical Example
Saurette, Daniel D.
Heck, Richard J.
Gillespie, Adam W.
Berg, Aaron A.
Biswas, Asim
LAND, 2024, 13 (03)
[3] Effects of sample size and covariate resolution on field-scale predictive digital mapping of soil carbon
Saurette, Daniel D.
Berg, Aaron A.
Laamrani, Ahmed
Heck, Richard J.
Gillespie, Adam W.
Voroney, Paul
Biswas, Asim
GEODERMA, 2022, 425
[4] Divergence metrics for determining optimal training sample size in digital soil mapping
Saurette, Daniel D.
Heck, Richard J.
Gillespie, Adam W.
Berg, Aaron A.
Biswas, Asim
GEODERMA, 2023, 436
[5] Adaptation of regional digital soil mapping for precision agriculture
Mats Söderström
Gustav Sohlenius
Lars Rodhe
Kristin Piikki
Precision Agriculture, 2016, 17 : 588 - 607
[6] Adaptation of regional digital soil mapping for precision agriculture
Soderstrom, Mats
Sohlenius, Gustav
Rodhe, Lars
Piikki, Kristin
PRECISION AGRICULTURE, 2016, 17 (05) : 588 - 607
[7] Consultants' forum: should post hoc sample size calculations be done?
Walters, Stephen J.
PHARMACEUTICAL STATISTICS, 2009, 8 (02) : 163 - 169
[8] Conditioned Latin Hypercube Sampling: Optimal Sample Size for Digital Soil Mapping of Arid Rangelands in Utah, USA
Brungard, C. W.
Boettinger, J. L.
DIGITAL SOIL MAPPING: BRIDGING RESEARCH, ENVIRONMENTAL APPLICATION, AND OPERATION, 2010, 2 : 67 - 75
[9] Uncertainty analysis of sample locations within digital soil mapping approaches
Grimm, Rosina
Behrens, Thorsten
GEODERMA, 2010, 155 (3-4) : 154 - 163
[10] Digital soil assessment for regional agricultural land evaluation
Harms, B.
Brough, D.
Philip, S.
Bartley, R.
Clifford, D.
Thomas, M.
Willis, R.
Gregory, L.
GLOBAL FOOD SECURITY-AGRICULTURE POLICY ECONOMICS AND ENVIRONMENT, 2015, 5 : 25 - 36

← 1 2 3 →