Accurate prediction of spatial distribution of soil heavy metal in complex mining terrain using an improved machine learning method

被引:0
|
作者
Han, Zhaoyang [1 ,2 ]
Wang, Jingyun [3 ]
Liao, Xiaoyong [1 ,2 ]
Yang, Jun [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Shandong Inst Geol Sci, Jinan 250013, Peoples R China
关键词
Topographic complexity; Heavy metals; Spatial prediction; Machine learning; Feature selection; POLLUTION; ISFAHAN; REGION; CHINA; RIVER; IRON;
D O I
10.1016/j.jhazmat.2025.137994
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate prediction of heavy metals (HMs) spatial distribution in mining areas is crucial for pollution management. However, predicting the spatial distribution of HMs remains a significant challenge in mining areas with complex terrain and variable contaminant transport pathways. This study aims to optimize the spatial prediction of arsenic (As) distribution in the Shimen realgar mining area, the largest in Asia, by integrating machine learning models with kriging interpolation and feature selection techniques. The results show that the Random Forest (RF) model achieved the best performance in predicting soil As concentration, with an R2 of 0.84 for the test data. Incorporating environmental variables improved the spatial prediction accuracy, with RF (R2 = 0.76, RMSE = 24.68 mg/kg) and Random Forest Regression Kriging (RFRK) (R2 = 0.78, RMSE = 23.46 mg/kg) outperforming ordinary kriging and geographically weighted regression kriging. Importance analysis and recursive feature elimination further optimized the model, leading to a 5 % increase in R2 and a reduction of RMSE by 8 %-12.4 %. The optimized RFRK model accurately captured the spatial distribution of As in the mining area, revealing the outward diffusion pattern of As from the smelting plant. The findings highlight the critical role of feature selection in improving prediction accuracy in highly polluted and complex terrain regions, an aspect that has often been overlooked in previous studies. This study provides a practical framework for spatial prediction of contaminants in similar areas, enhancing the understanding of pollution distribution.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Prediction on daily spatial distribution of chlorophyll-a in coastal seas using a synthetic method of remote sensing, machine learning and numerical modeling
    Li, Hai
    Li, Xiuren
    Song, Dehai
    Nie, Jie
    Liang, Shengkang
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 910
  • [42] Spatial Prediction of Soil Organic Matter Using a Hybrid Geostatistical Model of an Extreme Learning Machine and Ordinary Kriging
    Song, Ying-Qiang
    Yang, Lian-An
    Li, Bo
    Hu, Yue-Ming
    Wang, An-Le
    Zhou, Wu
    Cui, Xue-Sen
    Liu, Yi-Lun
    SUSTAINABILITY, 2017, 9 (05)
  • [43] USING MACHINE LEARNING FOR LOSS PREDICTION IN A HYBRID MEANLINE MODELING METHOD TO DELIVER IMPROVED RADIAL TURBINE PERFORMANCE PREDICTION
    Ren, Pangbo
    Inomata, Ryosuke
    Stuart, Charles
    Kobayashi, Takayuki
    Spence, Stephen
    Morita, Isao
    PROCEEDINGS OF ASME TURBO EXPO 2022: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, GT2022, VOL 10B, 2022,
  • [44] Using Machine Learning for Loss Prediction in a Hybrid Meanline Modeling Method to Deliver Improved Radial Turbine Performance Prediction
    Ren, Pangbo
    Stuart, Charles
    Spence, Stephen
    Inomata, Ryosuke
    Kobayashi, Takayuki
    Morita, Isao
    JOURNAL OF TURBOMACHINERY-TRANSACTIONS OF THE ASME, 2023, 145 (07):
  • [45] Prediction on the spatial distribution of the seropositive rate of schistosomiasis in Hunan Province, China: a machine learning model integrated with the Kriging method
    Xu, Ning
    Cai, Yu
    Tong, Yixin
    Tang, Ling
    Zhou, Yu
    Gong, Yanfeng
    Huang, Junhui
    Wang, Jiamin
    Chen, Yue
    Jiang, Qingwu
    Zheng, Mao
    Zhou, Yibiao
    PARASITOLOGY RESEARCH, 2024, 123 (09)
  • [46] Simple Optimal Sampling Algorithm to Strengthen Digital Soil Mapping Using the Spatial Distribution of Machine Learning Predictive Uncertainty: A Case Study for Field Capacity Prediction
    Yang, Hyunje
    Lim, Honggeun
    Moon, Haewon
    Li, Qiwen
    Nam, Sooyoun
    Kim, Jaehoon
    Choi, Hyung Tae
    LAND, 2022, 11 (11)
  • [47] Improved prediction of clay soil expansion using machine learning algorithms and meta-heuristic dichotomous ensemble classifiers
    Eyo, E. U.
    Abbey, S. J.
    Lawrence, T. T.
    Tetteh, F. K.
    GEOSCIENCE FRONTIERS, 2022, 13 (01)
  • [48] HPO-empowered machine learning with multiple environment variables enables spatial prediction of soil heavy metals in coastal delta farmland of China
    Song, Yingqiang
    Zhan, Dexi
    He, Zhenxin
    Li, Wenhui
    Duan, Wenxu
    Yang, Zhongkang
    Lu, Miao
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 213
  • [49] A synthesis framework using machine learning and spatial bivariate analysis to identify drivers and hotspots of heavy metal pollution of agricultural soils
    Yang, Shiyan
    Taylor, David
    Yang, Dong
    He, Mingjiang
    Liu, Xingmei
    Xu, Jianming
    ENVIRONMENTAL POLLUTION, 2021, 287
  • [50] Using Machine Learning Algorithms Based on GF-6 and Google Earth Engine to Predict and Map the Spatial Distribution of Soil Organic Matter Content
    Ye, Zhishan
    Sheng, Ziheng
    Liu, Xiaoyan
    Ma, Youhua
    Wang, Ruochen
    Ding, Shiwei
    Liu, Mengqian
    Li, Zijie
    Wang, Qiang
    SUSTAINABILITY, 2021, 13 (24)