Interpretable Machine Learning Based Quantification of the Impact of Water Quality Indicators on Groundwater Under Multiple Pollution Sources

被引:0
|
作者
Zhang, Tianyi [1 ]
Wu, Jin [2 ]
Chu, Haibo [1 ]
Liu, Jing [1 ]
Wang, Guoqiang [2 ]
机构
[1] Beijing Univ Technol, Fac Architecture Civil & Transportat Engn, Beijing 100124, Peoples R China
[2] Beijing Normal Univ, Adv Interdisciplinary Inst Satellite Applicat, Beijing 100875, Peoples R China
基金
中国国家自然科学基金;
关键词
groundwater; water quality assessment; human health risk; positive matrix factorization; INDEX; BASIN;
D O I
10.3390/w17060905
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate evaluation of groundwater quality and identification of key characteristics are essential for maintaining groundwater resources. The purpose of this study is to strengthen water quality evaluation through the SHAP and XGBoost algorithms, analyze the key indicators affecting water quality in depth, and quantify their impact on groundwater quality through interpretable tools. The XGBoost algorithm shows that zinc (0.183), nitrate (0.159), and chloride (0.136) are the three indicators with the highest weight. The SHAP algorithm shows that zinc (34.62%), nitrate (17.65%), and chloride (16.98%) have higher contribution values, which explains the output results of XGBoost. According to the calculation scores and classification standards of the water quality model, 49% of the groundwater samples in the study area have excellent water quality, 33% of the samples are better, and 18% of the samples are polluted. The results of positive matrix factorization (PMF) show that natural conditions, metal processing, metal smelting and mining, and agricultural activities all cause pollution to groundwater. Zinc, chloride, nitrate, and manganese were the key variables determined by the SHAP algorithm to explain the vast majority of human health risk sources. These findings indicate that interpretable machine learning not only improves the correlation of water quality assessment but also quantifies the judgment basis of each sample and helps to track key pollution indicators.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] Insights into enhanced machine learning techniques for surface water quantity and quality prediction based on data pre-processing algorithms
    Panahi, Javad
    Mastouri, Reza
    Shabanlou, Saeid
    JOURNAL OF HYDROINFORMATICS, 2022, : 875 - 897
  • [42] Assessing long-term climate change impact on spatiotemporal changes of groundwater level using autoregressive-based and ensemble machine learning models
    Nourani, Vahid
    Tapeh, Ali Hasanpour Ghareh
    Khodkar, Kasra
    Huang, Jinhui Jeanne
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 336
  • [43] Carcinogenic health risks and water quality assessment of groundwater around lead–zinc mining areas of Ebonyi state Nigeria: a data-driven machine learning approach
    Obinna Chigoziem Akakuru
    Moses Oghenenyoreme Eyankware
    Godwin O. Aigbadon
    Ayatu Ojonugwa Usman
    Alexander Iheanyi Opara
    Kizito Ojochenemi Musa
    Micheal Akaninyene Okon
    Okechukwu Pius Aghamelu
    Gabriel Ehriga Odesa
    Ifeyinwa Juliana Ofoh
    Annabel U. Obinna-Akakuru
    Discover Civil Engineering, 1 (1):
  • [44] Environmental assessment based surface water quality prediction using hyper-parameter optimized machine learning models based on consistent big data
    Shah, Muhammad Izhar
    Javed, Muhammad Faisal
    Alqahtani, Abdulaziz
    Aldrees, Ali
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2021, 151 : 324 - 340
  • [45] Comprehensive water quality evaluation based on kernel extreme learning machine optimized with the sparrow search algorithm in Luoyang River Basin, China
    Song, Chenguang
    Yao, Leihua
    Hua, Chengya
    Ni, Qihang
    ENVIRONMENTAL EARTH SCIENCES, 2021, 80 (16)
  • [46] Evaluation of soil quality of cultivated lands with classification and regression-based machine learning algorithms optimization under humid environmental condition
    Dengiz, Orhan
    Alaboz, Pelin
    In, Fikret Sayg
    Adem, Kemal
    Yuksek, Emre
    ADVANCES IN SPACE RESEARCH, 2024, 74 (11) : 5514 - 5529
  • [47] Solving water scarcity challenges in arid regions: A novel approach employing human-based meta-heuristics and machine learning algorithm for groundwater potential mapping
    Razavi-Termeh, Seyed Vahid
    Sadeghi-Niaraki, Abolghasem
    Farhangi, Farbod
    Khiadani, Mehdi
    Pirasteh, Saied
    Choi, Soo-Mi
    Chemosphere, 2024, 363
  • [48] GIS mapping-based impact assessment of groundwater contamination by arsenic and other heavy metal contaminants in the Brahmaputra River valley: A water quality assessment study
    Nath, B. K.
    Chaliha, C.
    Bhuyan, B.
    Kalita, E.
    Baruah, D. C.
    Bhagabati, A. K.
    JOURNAL OF CLEANER PRODUCTION, 2018, 201 : 1001 - 1011
  • [49] Surface water quality prediction in the lower Thoubal river watershed, India: A hyper-tuned machine learning approach and DNN-based sensitivity analysis
    Rahaman, Md Hibjur
    Sajjad, Haroon
    Hussain, Shabina
    Roshani
    Masroor, Md
    Sharma, Aastha
    JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2024, 12 (03):
  • [50] Efficacy of GIS-based AHP and data-driven intelligent machine learning algorithms for irrigation water quality prediction in an agricultural-mine district within the Lower Benue Trough, Nigeria
    Omeka, Michael E.
    Igwe, Ogbonnaya
    Onwuka, Obialo S.
    Nwodo, Ogechukwu M.
    Ugar, Samuel I.
    Undiandeye, Peter A.
    Anyanwu, Ifeanyi E.
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 31 (41) : 54204 - 54233