Accurate prediction of spatial distribution of soil heavy metal in complex mining terrain using an improved machine learning method

被引:0
|
作者
Han, Zhaoyang [1 ,2 ]
Wang, Jingyun [3 ]
Liao, Xiaoyong [1 ,2 ]
Yang, Jun [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Shandong Inst Geol Sci, Jinan 250013, Peoples R China
关键词
Topographic complexity; Heavy metals; Spatial prediction; Machine learning; Feature selection; POLLUTION; ISFAHAN; REGION; CHINA; RIVER; IRON;
D O I
10.1016/j.jhazmat.2025.137994
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate prediction of heavy metals (HMs) spatial distribution in mining areas is crucial for pollution management. However, predicting the spatial distribution of HMs remains a significant challenge in mining areas with complex terrain and variable contaminant transport pathways. This study aims to optimize the spatial prediction of arsenic (As) distribution in the Shimen realgar mining area, the largest in Asia, by integrating machine learning models with kriging interpolation and feature selection techniques. The results show that the Random Forest (RF) model achieved the best performance in predicting soil As concentration, with an R2 of 0.84 for the test data. Incorporating environmental variables improved the spatial prediction accuracy, with RF (R2 = 0.76, RMSE = 24.68 mg/kg) and Random Forest Regression Kriging (RFRK) (R2 = 0.78, RMSE = 23.46 mg/kg) outperforming ordinary kriging and geographically weighted regression kriging. Importance analysis and recursive feature elimination further optimized the model, leading to a 5 % increase in R2 and a reduction of RMSE by 8 %-12.4 %. The optimized RFRK model accurately captured the spatial distribution of As in the mining area, revealing the outward diffusion pattern of As from the smelting plant. The findings highlight the critical role of feature selection in improving prediction accuracy in highly polluted and complex terrain regions, an aspect that has often been overlooked in previous studies. This study provides a practical framework for spatial prediction of contaminants in similar areas, enhancing the understanding of pollution distribution.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Spatial prediction of soil micronutrients using machine learning algorithms integrated with multiple digital covariates
    Keshavarzi, Ali
    Kaya, Fuat
    Basayigit, Levent
    Gyasi-Agyei, Yeboah
    Rodrigo-Comino, Jesus
    Caballero-Calvo, Andres
    NUTRIENT CYCLING IN AGROECOSYSTEMS, 2023, 127 (01) : 137 - 153
  • [32] An improved estimation model for soil heavy metal(loid) concentration retrieval in mining areas using reflectance spectroscopy
    Kun Tan
    Huimin Wang
    Qianqian Zhang
    Xiuping Jia
    Journal of Soils and Sediments, 2018, 18 : 2008 - 2022
  • [33] An improved estimation model for soil heavy metal(loid) concentration retrieval in mining areas using reflectance spectroscopy
    Tan, Kun
    Wang, Huimin
    Zhang, Qianqian
    Jia, Xiuping
    JOURNAL OF SOILS AND SEDIMENTS, 2018, 18 (05) : 2008 - 2022
  • [34] Using geostatistics and machine learning models to analyze the influence of soil nutrients and terrain attributes on lead prediction in forest soils
    Ahado, Samuel Kudjo
    Agyeman, Prince Chapman
    Boruvka, Lubos
    Kanianska, Radoslava
    Nwaogu, Chukwudi
    MODELING EARTH SYSTEMS AND ENVIRONMENT, 2024, 10 (02) : 2099 - 2112
  • [35] Shear stress distribution prediction in symmetric compound channels using data mining and machine learning models
    Zohreh Sheikh Khozani
    Khabat Khosravi
    Mohammadamin Torabi
    Amir Mosavi
    Bahram Rezaei
    Timon Rabczuk
    Frontiers of Structural and Civil Engineering, 2020, 14 : 1097 - 1109
  • [36] Mining Campus Big Data: Prediction of Career Choice Using Interpretable Machine Learning Method
    Wang, Yuan
    Yang, Liping
    Wu, Jun
    Song, Zisheng
    Shi, Li
    MATHEMATICS, 2022, 10 (08)
  • [37] Shear stress distribution prediction in symmetric compound channels using data mining and machine learning models
    Khozani, Zohreh Sheikh
    Khosravi, Khabat
    Torabi, Mohammadamin
    Mosavi, Amir
    Rezaei, Bahram
    Rabczuk, Timon
    FRONTIERS OF STRUCTURAL AND CIVIL ENGINEERING, 2020, 14 (05) : 1097 - 1109
  • [38] Machine Learning Methods for Statistical Prediction of PM2.5 in Urban Agglomerations with Complex Terrain, Using Grenoble As an Example
    A. I. Suslov
    M. A. Krinitskiy
    C. Staquet
    E. Le Boudec
    Moscow University Physics Bulletin, 2024, 79 (Suppl 2) : S774 - S783
  • [39] Delineating and identifying risk zones of soil heavy metal pollution in an industrialized region using machine learning
    Chen, Di
    Wang, Xiahui
    Luo, Ximing
    Huang, Guoxin
    Tian, Zi
    Li, Weiyu
    Liu, Fei
    ENVIRONMENTAL POLLUTION, 2023, 318
  • [40] Sources apportionment and spatial prediction of soil heavy metal pollution using UNMIX model and multivariate statistical simulation
    Yang Q.
    Wang L.
    Li P.
    Lyu L.
    Fan Y.
    Zhu G.
    Wang Y.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2024, 40 (04): : 224 - 234