Large-scale mapping of soil particle size distribution using legacy data and machine learning-based pedotransfer functions

被引:0
|
作者
Kassai, Piroska [1 ,2 ]
Kocsis, Mihaly [1 ,2 ]
Szatmari, Gabor [1 ,2 ]
Mako, Andras [1 ,2 ]
Meszaros, Janos [1 ,2 ]
Laborczi, Annamaria [1 ,2 ]
Magyar, Zoltan [3 ]
Takacs, Katalin [1 ,2 ]
Pasztor, Laszlo [1 ,2 ]
Szabo, Brigitta [1 ,2 ]
机构
[1] HUN REN Ctr Agr Res, Inst Soil Sci, Herman Otto Ut 15, H-1022 Budapest, Hungary
[2] Natl Lab Water Sci & Water Secur, Herman Otto Ut 15, H-1022 Budapest, Hungary
[3] Soil Control LLC, Ady Utca 55, H-8749 Bak, Hungary
基金
芬兰科学院;
关键词
Soil particle size distribution; Upper limit of soil plasticity; Pedotransfer function; Random forest kriging; Digital soil mapping; GEOSTATISTICS; UNCERTAINTY; INFORMATION; TEXTURE; FOREST;
D O I
10.1016/j.geoderma.2025.117178
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Large-scale maps of particle size fractions (i.e., sand, silt, and clay contents) were created for a case study based on the newly developed Profile-level Database of the Hungarian Large-Scale Soil Mapping (Hungarian acronym: NATASA). This database combines data from previous surveys, offering potential to improve soil mapping accuracy. The database includes information on soil taxonomy and basic soil chemical and physical properties. However, this database contains no direct information on sand, silt and clay content, only an indirect parameter, namely, the upper limit of soil plasticity. Particle size distribution is crucial for various applications, such as assessing soil degradation, hydrology and fertility. To overcome this limitation, we developed pedotransfer functions (PTFs) to compute the particle size distribution from the soil properties available in the NATASA dataset (1,372 soil profiles). The PTFs were trained and tested on the Hungarian Detailed Soil Hydrophysical Database (3,970 soil profiles) using the random forest method. For the prediction model, i) additive log-ratio transformed clay, silt and sand content were used as the dependent variables, and ii) the upper limit of soil plasticity, soil type, calcium carbonate content, organic matter content and pH were included as independent variables. The results indicate that the R2 values of the PTFs are 0.69 for clay, 0.58 for silt, and 0.74 for sand content. Since the NATASA database contains soil information from different depths, we splined the data into six standard depth layers (0-5, 5-15, 15-30, 30-60, 60-100 and 100-200 cm depths). The spatial modelling was performed by random forest kriging (RFK) using environmental auxiliary variables. The R2 values of the RFK models range from 0.19 to 0.67 for clay content, from 0.49 to 0.62 for silt content and from 0.69 to 0.74 for sand content. We compared the high-resolution (25 m) maps with the global SoilGrids (250 m resolution) and the national DOSoReMI.hu soil maps (100 m resolution). Our high-resolution maps offer more detailed information on clay, silt and sand content vertically and horizontally compared to global and national soil maps. This enhanced detail will facilitate future assessments of soil texture-related processes in the area.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Countrywide mapping and assessment of organic carbon saturation in the topsoil using machine learning-based pedotransfer function with uncertainty propagation
    Szatmari, Gabor
    Pasztor, Laszlo
    Laborczi, Annamaria
    Illes, Gabor
    Bakacsi, Zsofia
    Zachary, Dora
    Filep, Tibor
    Szalai, Zoltan
    Jakab, Gergely
    CATENA, 2023, 227
  • [22] TGE: Machine Learning Based Task Graph Embedding for Large-Scale Topology Mapping
    Choi, Jong Youl
    Logan, Jeremy
    Wolf, Matthew
    Ostrouchov, George
    Kurc, Tahsin
    Liu, Qing
    Podhorszki, Norbert
    Klasky, Scott
    Romanus, Melissa
    Sun, Qian
    Parashar, Manish
    Churchill, Randy Michael
    Chang, C. S.
    2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 587 - 591
  • [23] Humanization of antibodies using a machine learning approach on large-scale repertoire data
    Marks, Claire
    Hummer, Alissa M.
    Chin, Mark
    Deane, Charlotte M.
    BIOINFORMATICS, 2021, 37 (22) : 4041 - 4047
  • [24] MACHINE LEARNING-BASED APPROACH FOR LANDSLIDE SUSCEPTIBILITY MAPPING USING MULTIMODAL DATA
    Ma, Xianping
    Pun, Man-On
    Liu, Ming
    Wang, Yang
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5174 - 5177
  • [25] A machine learning-based framework for mapping hydrogen at the atomic scale
    Zhao, Qingkun
    Zhu, Qi
    Zhang, Zhenghao
    Yin, Binglun
    Gao, Huajian
    Zhou, Haofei
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2024, 121 (39)
  • [26] Mapping soil corrosivity potential to exclusion fencing using pedotransfer functions and open-source soil data
    Stiglingh, Andrea D.
    Mosley, Luke M.
    Smernik, Ronald J.
    Fitzpatrick, Robert W.
    SOIL USE AND MANAGEMENT, 2024, 40 (01)
  • [27] A method for using legacy data for metamodel-based design of large-scale systems
    Srivastava, A
    Hacker, K
    Lewis, K
    Simpson, TW
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2004, 28 (2-3) : 146 - 155
  • [28] A method for using legacy data for metamodel-based design of large-scale systems
    A. Srivastava
    K. Hacker
    K. Lewis
    T.W. Simpson
    Structural and Multidisciplinary Optimization, 2004, 28 : 146 - 155
  • [29] Machine Learning-Based Soft-Error-Rate Evaluation for Large-Scale Integrated Circuits
    Song, Ruiqiang
    Shao, Jinjin
    Chi, Yaqing
    Liang, Bin
    Chen, Jianjun
    Wu, Zhenyu
    ELECTRONICS, 2023, 12 (24)
  • [30] Machine learning-based evaluation technology of 3D spatial distribution of residual radioactivity in large-scale radioactive structures
    Lee, Ukjae
    Chang, Phillip
    Jung, Nam-Suk
    Jang, Jonghun
    Lee, Jimin
    Lee, Hee-Seock
    NUCLEAR ENGINEERING AND TECHNOLOGY, 2024, 56 (08) : 3199 - 3209