Accurate prediction of spatial distribution of soil heavy metal in complex mining terrain using an improved machine learning method

被引:0
|
作者
Han, Zhaoyang [1 ,2 ]
Wang, Jingyun [3 ]
Liao, Xiaoyong [1 ,2 ]
Yang, Jun [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Shandong Inst Geol Sci, Jinan 250013, Peoples R China
关键词
Topographic complexity; Heavy metals; Spatial prediction; Machine learning; Feature selection; POLLUTION; ISFAHAN; REGION; CHINA; RIVER; IRON;
D O I
10.1016/j.jhazmat.2025.137994
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate prediction of heavy metals (HMs) spatial distribution in mining areas is crucial for pollution management. However, predicting the spatial distribution of HMs remains a significant challenge in mining areas with complex terrain and variable contaminant transport pathways. This study aims to optimize the spatial prediction of arsenic (As) distribution in the Shimen realgar mining area, the largest in Asia, by integrating machine learning models with kriging interpolation and feature selection techniques. The results show that the Random Forest (RF) model achieved the best performance in predicting soil As concentration, with an R2 of 0.84 for the test data. Incorporating environmental variables improved the spatial prediction accuracy, with RF (R2 = 0.76, RMSE = 24.68 mg/kg) and Random Forest Regression Kriging (RFRK) (R2 = 0.78, RMSE = 23.46 mg/kg) outperforming ordinary kriging and geographically weighted regression kriging. Importance analysis and recursive feature elimination further optimized the model, leading to a 5 % increase in R2 and a reduction of RMSE by 8 %-12.4 %. The optimized RFRK model accurately captured the spatial distribution of As in the mining area, revealing the outward diffusion pattern of As from the smelting plant. The findings highlight the critical role of feature selection in improving prediction accuracy in highly polluted and complex terrain regions, an aspect that has often been overlooked in previous studies. This study provides a practical framework for spatial prediction of contaminants in similar areas, enhancing the understanding of pollution distribution.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Accurate Prediction of Soil Heavy Metal Pollution Using an Improved Machine Learning Method: A Case Study in the Pearl River Delta, China
    Zhao, Wenhao
    Ma, Jin
    Liu, Qiyuan
    Qu, Yajing
    Dou, Lei
    Shi, Huading
    Sun, Yi
    Chen, Haiyan
    Tian, Yuxin
    Wu, Fengchang
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2023, 57 (46) : 17751 - 17761
  • [2] Accurate prediction of spatial distribution of soil potentially toxic elements using machine learning and associated key influencing factors identification: A case study in mining and smelting area in southwestern China
    Li, Kai
    Guo, Guanghui
    Zhang, Degang
    Lei, Mei
    Wang, Yingying
    JOURNAL OF HAZARDOUS MATERIALS, 2024, 478
  • [3] Research on the spatial pattern distribution of soil selenium using machine learning methods integrating geographic proximity in complex terrain
    Liu, Xiaoyan
    Ma, Qianru
    Song, Zhaofen
    Ye, Zhicheng
    Zhai, Xu
    Zhang, Miao
    Zhang, Lili
    Wang, Qiang
    JOURNAL OF SOILS AND SEDIMENTS, 2024, 24 (07) : 2776 - 2790
  • [4] A two-point machine learning method for the spatial prediction of soil pollution
    Gao, Bingbo
    Stein, Alfred
    Wang, Jinfeng
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 108
  • [5] Improved mapping of heavy metals in agricultural soils using machine learning augmented with spatial regionalization indices
    Ma, Xudong
    Guan, Dong-Xing
    Zhang, Chaosheng
    Yu, Tao
    Li, Cheng
    Wu, Zhiliang
    Li, Bo
    Geng, Wenda
    Wu, Tiansheng
    Yang, Zhongfang
    JOURNAL OF HAZARDOUS MATERIALS, 2024, 478
  • [6] Application of heavy metal immobilization in soil by biochar using machine learning
    Guo, Genmao
    Lin, Linyi
    Jin, Fangming
    Masek, Ondrej
    Huang, Qing
    ENVIRONMENTAL RESEARCH, 2023, 231
  • [7] Spatial prediction of soil organic carbon using machine learning techniques in western Iran
    Mahmoudzadeh, Hamid
    Matinfar, Hamid Reza
    Taghizadeh-Mehrjardi, Ruhollah
    Kerry, Ruth
    GEODERMA REGIONAL, 2020, 21
  • [8] Spatial prediction of soil surface properties in an arid region using synthetic soil image and machine learning
    Naimi, Salman
    Ayoubi, Shamsollah
    Dematte, Jose A. M.
    Zeraatpisheh, Mojtaba
    Amorim, Merilyn Taynara Accorsi
    Mello, Fellipe Alcantara de Oliveira
    GEOCARTO INTERNATIONAL, 2022, 37 (25) : 8230 - 8253
  • [9] Estimating the spatial distribution of soil heavy metals in oil mining area using air quality data
    Song, Yingqiang
    Kang, Lu
    Lin, Fan
    Sun, Na
    Aizezi, Aziguli
    Yang, Zhongkang
    Wu, Xinya
    ATMOSPHERIC ENVIRONMENT, 2022, 287
  • [10] Identifying interactive effects of spatial drivers in soil heavy metal pollutants using interpretable machine learning models
    Duan, Deyu
    Wang, Peng
    Rao, Xin
    Zhong, Junhong
    Xiao, Meihong
    Huang, Fei
    Xiao, Rongbo
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 934