Interpretable Machine Learning Based Quantification of the Impact of Water Quality Indicators on Groundwater Under Multiple Pollution Sources

被引:0
|
作者
Zhang, Tianyi [1 ]
Wu, Jin [2 ]
Chu, Haibo [1 ]
Liu, Jing [1 ]
Wang, Guoqiang [2 ]
机构
[1] Beijing Univ Technol, Fac Architecture Civil & Transportat Engn, Beijing 100124, Peoples R China
[2] Beijing Normal Univ, Adv Interdisciplinary Inst Satellite Applicat, Beijing 100875, Peoples R China
基金
中国国家自然科学基金;
关键词
groundwater; water quality assessment; human health risk; positive matrix factorization; INDEX; BASIN;
D O I
10.3390/w17060905
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate evaluation of groundwater quality and identification of key characteristics are essential for maintaining groundwater resources. The purpose of this study is to strengthen water quality evaluation through the SHAP and XGBoost algorithms, analyze the key indicators affecting water quality in depth, and quantify their impact on groundwater quality through interpretable tools. The XGBoost algorithm shows that zinc (0.183), nitrate (0.159), and chloride (0.136) are the three indicators with the highest weight. The SHAP algorithm shows that zinc (34.62%), nitrate (17.65%), and chloride (16.98%) have higher contribution values, which explains the output results of XGBoost. According to the calculation scores and classification standards of the water quality model, 49% of the groundwater samples in the study area have excellent water quality, 33% of the samples are better, and 18% of the samples are polluted. The results of positive matrix factorization (PMF) show that natural conditions, metal processing, metal smelting and mining, and agricultural activities all cause pollution to groundwater. Zinc, chloride, nitrate, and manganese were the key variables determined by the SHAP algorithm to explain the vast majority of human health risk sources. These findings indicate that interpretable machine learning not only improves the correlation of water quality assessment but also quantifies the judgment basis of each sample and helps to track key pollution indicators.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Integrated machine learning based groundwater quality prediction through groundwater quality index for drinking purposes in a semi-arid river basin of south India
    Karunanidhi, D.
    Raj, M. Rhishi Hari
    Roy, Priyadarsi D.
    Subramani, T.
    ENVIRONMENTAL GEOCHEMISTRY AND HEALTH, 2025, 47 (04)
  • [22] Water quality prediction using machine learning models based on grid search method
    Shams, Mahmoud Y.
    Elshewey, Ahmed M.
    El-kenawy, El-Sayed M.
    Ibrahim, Abdelhameed
    Talaat, Fatma M.
    Tarek, Zahraa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 35307 - 35334
  • [23] Machine learning model for IoT-Edge device based Water Quality Monitoring
    Kumar, Yogendra
    Udgata, Siba K.
    IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
  • [24] Evaluation and Prediction of Groundwater Quality for Irrigation Using an Integrated Water Quality Indices, Machine Learning Models and GIS Approaches: A Representative Case Study
    Ibrahim, Hekmat
    Yaseen, Zaher Mundher
    Scholz, Miklas
    Ali, Mumtaz
    Gad, Mohamed
    Elsayed, Salah
    Khadr, Mosaad
    Hussein, Hend
    Ibrahim, Hazem H.
    Eid, Mohamed Hamdy
    Kovacs, Attila
    Peter, Szucs
    Khalifa, Moataz M.
    WATER, 2023, 15 (04)
  • [25] Prediction of groundwater level fluctuations under climate change based on machine learning algorithms in the Mashhad Aquifer, Iran
    Panahi, Ghasem
    Hassanzadeh Eskafi, Mahya
    Faridhosseini, Alireza
    Khodashenas, Saeed Reza
    Rohani, Abbas
    JOURNAL OF WATER AND CLIMATE CHANGE, 2023, 14 (03) : 1039 - 1059
  • [26] Machine Learning Approaches for Assessing Groundwater Quality and Its Implications for Water Conservation in the Sub-tropical Capital Region of India
    Kushwaha, Nand Lal
    Sahoo, Madhumita
    Biwalkar, Nilesh
    WATER CONSERVATION SCIENCE AND ENGINEERING, 2025, 10 (01)
  • [27] Investigating the Impact of Anthropogenic and Natural Sources of Pollution on Quality of Water in Upper Indus Basin (UIB) by Using Multivariate Statistical Analysis
    Baluch, Mansoor A.
    Hashmi, Hashim Nisar
    JOURNAL OF CHEMISTRY, 2019, 2019
  • [28] Evaluation of Water Quality Based on a Machine Learning Algorithm and Water Quality Index for Mid Gangetic Region (South Bihar plain), India
    Gupta, Amar Nath
    Kumar, Deepak
    Singh, Anshuman
    JOURNAL OF THE GEOLOGICAL SOCIETY OF INDIA, 2021, 97 (09) : 1063 - 1072
  • [29] Multiple Linear Regression and Machine Learning for Predicting the Drinking Water Quality Index in Al-Seine Lake
    Jafar, Raed
    Awad, Adel
    Hatem, Iyad
    Jafar, Kamel
    Awad, Edmond
    Shahrour, Isam
    SMART CITIES, 2023, 6 (05): : 2807 - 2827
  • [30] Groundwater suitability assessment for irrigation and drinking purposes by integrating spatial analysis, machine learning, water quality index, and health risk model
    Yan Y.
    Zhang Y.
    Yao R.
    Wei C.
    Luo M.
    Yang C.
    Chen S.
    Huang X.
    Environmental Science and Pollution Research, 2024, 31 (27) : 39155 - 39176