Accurate prediction of spatial distribution of soil heavy metal in complex mining terrain using an improved machine learning method

被引:0
|
作者
Han, Zhaoyang [1 ,2 ]
Wang, Jingyun [3 ]
Liao, Xiaoyong [1 ,2 ]
Yang, Jun [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Shandong Inst Geol Sci, Jinan 250013, Peoples R China
关键词
Topographic complexity; Heavy metals; Spatial prediction; Machine learning; Feature selection; POLLUTION; ISFAHAN; REGION; CHINA; RIVER; IRON;
D O I
10.1016/j.jhazmat.2025.137994
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate prediction of heavy metals (HMs) spatial distribution in mining areas is crucial for pollution management. However, predicting the spatial distribution of HMs remains a significant challenge in mining areas with complex terrain and variable contaminant transport pathways. This study aims to optimize the spatial prediction of arsenic (As) distribution in the Shimen realgar mining area, the largest in Asia, by integrating machine learning models with kriging interpolation and feature selection techniques. The results show that the Random Forest (RF) model achieved the best performance in predicting soil As concentration, with an R2 of 0.84 for the test data. Incorporating environmental variables improved the spatial prediction accuracy, with RF (R2 = 0.76, RMSE = 24.68 mg/kg) and Random Forest Regression Kriging (RFRK) (R2 = 0.78, RMSE = 23.46 mg/kg) outperforming ordinary kriging and geographically weighted regression kriging. Importance analysis and recursive feature elimination further optimized the model, leading to a 5 % increase in R2 and a reduction of RMSE by 8 %-12.4 %. The optimized RFRK model accurately captured the spatial distribution of As in the mining area, revealing the outward diffusion pattern of As from the smelting plant. The findings highlight the critical role of feature selection in improving prediction accuracy in highly polluted and complex terrain regions, an aspect that has often been overlooked in previous studies. This study provides a practical framework for spatial prediction of contaminants in similar areas, enhancing the understanding of pollution distribution.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A hybrid framework for delineating the migration route of soil heavy metal pollution by heavy metal similarity calculation and machine learning method
    Wang, Feng
    Huo, Lili
    Li, Yue
    Wu, Lina
    Zhang, Yanqiu
    Shi, Guoliang
    An, Yi
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 858
  • [22] Uncovering soil heavy metal pollution hotspots and influencing mechanisms through machine learning and spatial analysis
    Song, Xiaoyong
    Sun, Yao
    Wang, Huijuan
    Huang, Xinmiao
    Han, Zilin
    Shu, Yilan
    Wu, Jiaheng
    Zhang, Zhenglin
    Zhong, Qicheng
    Li, Rongxi
    Fan, Zhengqiu
    ENVIRONMENTAL POLLUTION, 2025, 370
  • [23] Accurate prediction of the energetics of weakly bound complexes using the machine learning method kriging
    Peter I. Maxwell
    Paul L. A. Popelier
    Structural Chemistry, 2017, 28 : 1513 - 1523
  • [24] Accurate prediction of the energetics of weakly bound complexes using the machine learning method kriging
    Maxwell, Peter I.
    Popelier, Paul L. A.
    STRUCTURAL CHEMISTRY, 2017, 28 (05) : 1513 - 1523
  • [25] Improved soil carbon stock spatial prediction in a Mediterranean soil erosion site through robust machine learning techniques
    Mosaid, Hassan
    Barakat, Ahmed
    John, Kingsley
    Faouzi, Elhousna
    Bustillo, Vincent
    El Garnaoui, Mohamed
    Heung, Brandon
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2024, 196 (02)
  • [26] A novel spatial prediction method for soil heavy metal based on unbiased conditional kernel density estimation
    Liu, Shuoyu
    Wang, Liping
    Liu, Dongsheng
    Diao, Jingping
    Jiang, Yan
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 952
  • [27] Spatial distribution of heavy metal contamination and uncertainty-based human health risk in the aquatic environment using multivariate statistical method
    Li, Jing
    Chen, Yizhong
    Lu, Hongwei
    Zhai, Weiyao
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (18) : 22804 - 22822
  • [28] Spatial distribution and risk assessment of heavy metal contamination in soil-crop systems near gold mining areas
    Dai, Da-kai
    Zhou, Jun
    ENVIRONMENTAL GEOCHEMISTRY AND HEALTH, 2025, 47 (06)
  • [29] Spatial prediction of soil micronutrients using machine learning algorithms integrated with multiple digital covariates
    Ali Keshavarzi
    Fuat Kaya
    Levent Başayiğit
    Yeboah Gyasi-Agyei
    Jesús Rodrigo-Comino
    Andrés Caballero-Calvo
    Nutrient Cycling in Agroecosystems, 2023, 127 : 137 - 153
  • [30] Spatial Prediction of Soil Organic Carbon Stock in the Moroccan High Atlas Using Machine Learning
    Meliho, Modeste
    Boulmane, Mohamed
    Khattabi, Abdellatif
    Dansou, Caleb Efelic
    Orlando, Collins Ashianga
    Mhammdi, Nadia
    Noumonvi, Koffi Dodji
    REMOTE SENSING, 2023, 15 (10)