Post-hoc Evaluation of Sample Size in a Regional Digital Soil Mapping Project

被引:0
|
作者
Saurette, Daniel D. [1 ,2 ]
Heck, Richard J. [1 ]
Gillespie, Adam W. [1 ]
Berg, Aaron A. [3 ]
Biswas, Asim [1 ]
机构
[1] Univ Guelph, Sch Environm Sci, 50 Stone Rd East, Guelph, ON N1G 2W1, Canada
[2] Ontario Minist Agr Food & Agribusiness, 1 Stone Rd West, Guelph, ON N1G 2Y4, Canada
[3] Univ Guelph, Dept Geog Environm & Geomat, 50 Stone Rd East, Guelph, ON N1G 2W1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
sampling design; sample size; digital soil mapping; conventional soil mapping; divergence metrics; operational soil survey; CATION-EXCHANGE CAPACITY;
D O I
10.3390/land14030545
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The transition from conventional soil mapping (CSM) to digital soil mapping (DSM) not only affects the final map products, but it also affects the concepts of scale, resolution, and sampling intensity. This is critical because in the CSM approach, sampling intensity is intricately linked to the desired scale of soil map publication, which provided standardization of sampling. This is not the case for DSM where sample size varies widely by project, and sampling design studies have largely focused on where to sample without due consideration for sample size. Using a regional soil survey dataset with 1791 sampled and described soil profiles, we first extracted an external validation dataset using the conditioned Latin hypercube sampling (cLHS) algorithm and then created repeated (n = 10) sample plans of increasing size from the remaining calibration sites using the cLHS, feature space coverage sampling (FSCS), and simple random sampling (SRS). We then trained random forest (RF) models for four soil properties: pH, CEC, clay content, and SOC at five different depths. We identified the effective sample size based on the model learning curves and compared it to the optimal sample size determined from the Jensen-Shannon divergence (DJS) applied to the environmental covariates. Maps were then generated from models that used all the calibration points (reference maps) and from models that used the optimal sample size (optimal maps) for comparison. Our findings revealed that the optimal sample sizes based on the DJS analysis were closely aligned with the effective sample sizes from the model learning curves (815 for cLHS, 832 for FSCS, and 847 for SRS). Furthermore, the comparison of the optimal maps to the reference maps showed little difference in the global statistics (concordance correlation coefficient and root mean square error) and spatial trends of the data, confirming that the optimal sample size was sufficient for creating predictions of similar accuracy to the full calibration dataset. Finally, we conclude that the Ottawa soil survey project could have saved between CAD 330,500 and CAD 374,000 (CAD = Canadian dollars) if the determination of optimal sample size tools presented herein existed during the project planning phase. This clearly illustrates the need for additional research in determining an optimal sample size for DSM and demonstrates that operationalization of DSM in public institutions requires a sound scientific basis for determining sample size.
引用
收藏
页数:22
相关论文
共 30 条
  • [1] SAMPLE SIZE AND INTRASUBJECT COEFFICIENT OF VARIATION OF BIOEQUIVALENCE STUDIES: A POST-HOC ANALYSIS
    Chung, I.
    Lee, S.
    Yoon, S.
    Yu, K.
    Lee, H.
    Jang, I.
    Chung, J.
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2016, 99 : S27 - S27
  • [2] Sample Size Optimization for Digital Soil Mapping: An Empirical Example
    Saurette, Daniel D.
    Heck, Richard J.
    Gillespie, Adam W.
    Berg, Aaron A.
    Biswas, Asim
    LAND, 2024, 13 (03)
  • [3] Effects of sample size and covariate resolution on field-scale predictive digital mapping of soil carbon
    Saurette, Daniel D.
    Berg, Aaron A.
    Laamrani, Ahmed
    Heck, Richard J.
    Gillespie, Adam W.
    Voroney, Paul
    Biswas, Asim
    GEODERMA, 2022, 425
  • [4] Divergence metrics for determining optimal training sample size in digital soil mapping
    Saurette, Daniel D.
    Heck, Richard J.
    Gillespie, Adam W.
    Berg, Aaron A.
    Biswas, Asim
    GEODERMA, 2023, 436
  • [5] Adaptation of regional digital soil mapping for precision agriculture
    Mats Söderström
    Gustav Sohlenius
    Lars Rodhe
    Kristin Piikki
    Precision Agriculture, 2016, 17 : 588 - 607
  • [6] Adaptation of regional digital soil mapping for precision agriculture
    Soderstrom, Mats
    Sohlenius, Gustav
    Rodhe, Lars
    Piikki, Kristin
    PRECISION AGRICULTURE, 2016, 17 (05) : 588 - 607
  • [7] Consultants' forum: should post hoc sample size calculations be done?
    Walters, Stephen J.
    PHARMACEUTICAL STATISTICS, 2009, 8 (02) : 163 - 169
  • [8] Conditioned Latin Hypercube Sampling: Optimal Sample Size for Digital Soil Mapping of Arid Rangelands in Utah, USA
    Brungard, C. W.
    Boettinger, J. L.
    DIGITAL SOIL MAPPING: BRIDGING RESEARCH, ENVIRONMENTAL APPLICATION, AND OPERATION, 2010, 2 : 67 - 75
  • [9] Uncertainty analysis of sample locations within digital soil mapping approaches
    Grimm, Rosina
    Behrens, Thorsten
    GEODERMA, 2010, 155 (3-4) : 154 - 163
  • [10] Digital soil assessment for regional agricultural land evaluation
    Harms, B.
    Brough, D.
    Philip, S.
    Bartley, R.
    Clifford, D.
    Thomas, M.
    Willis, R.
    Gregory, L.
    GLOBAL FOOD SECURITY-AGRICULTURE POLICY ECONOMICS AND ENVIRONMENT, 2015, 5 : 25 - 36