Comprehensive assessment of E. coli dynamics in river water using advanced machine learning and explainable AI

被引:0
|
作者
Mallik, Santanu [1 ,2 ]
Saha, Bodhipriya [2 ]
Podder, Krishanu [3 ]
Muthuraj, Muthusivaramapandian [4 ]
Mishra, Umesh [2 ]
Deb, Sharbari [5 ]
机构
[1] Poornima Coll Engn, Dept Civil Engn, Jaipur 302022, Rajasthan, India
[2] Natl Inst Technol Agartala, Dept Civil Engn, Jirania 799046, Tripura, India
[3] Govt Tripura, Dept Elementary Educ, Agartala, India
[4] Natl Inst Technol Agartala, Dept Bioengn, Jirania 799046, Tripura, India
[5] Poornima Univ, Dept Elect & Comp Engn, Jaipur 303905, Rajasthan, India
关键词
E; coli; Land use; QMRA; Automatic machine learning algorithm; Explainable artificial intelligence; RISK-ASSESSMENT; LAND-USE; QUALITY;
D O I
10.1016/j.psep.2025.106816
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The discharge of untreated municipal wastewater has resulted in faecal contamination of river water, posing severe public health risks, and has challenged safe irrigation. Therefore, the present study quantified the Escherichia coli (E. coli) contamination in three rivers of the Tripura region and assessed the impact of land use (LU) patterns on E. coli dynamics using spatial distribution maps. Further, the Quantitative Microbial Risk Assessment (QMRA) model is utilized to evaluate microbial risks associated with farmers using contaminated river water for irrigation. Finally, this study is the first of its kind to use and compare three hyper-tuning frameworks, which included Bayesian optimization, Tree-based Pipeline Optimization Tool, and Optuna, to predict E. coli concentration. This work also utilizes the Explainable AI (XAI) based Shapley Additive Explanations (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME) for global and local site-specific sensitivity analyses, providing interpretable and actionable insights. The findings show that water quality in all three rivers is unsuitable for drinking primarily due to elevated E. coli levels. Stable pH levels and favorable temperatures support E. coli growth, intensifying the contamination risk. The QMRA model further indicates a 0.01- 0.57 probability of significant health risks for farmers using contaminated water. Additionally, the machine learning approaches, along with statistical metrics and cumulative density function plots, reveal the superior performance of the Optuna-optimized extreme gradient-boosting (XGBoost) model over the random forest (RF) and gradient-boosting machine models (GBM). XAI recognized electrical conductivity and total dissolved solids as the most influential factors affecting the E. coli concentrations. Overall, this framework can predict regions impacted by faecal contamination, supporting the sustainable development goals for clean water and health.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Photocatalytic inactivation of E. coli in surface water using immobilised nanoparticle TiO2 films
    Alrousan, Dheaya M. A.
    Dunlop, Patrick S. M.
    McMurray, Trudy A.
    Byrne, J. Anthony
    WATER RESEARCH, 2009, 43 (01) : 47 - 54
  • [22] Development of an advanced electrochemical biosensing platform for E. coli using hybrid metal-organic framework/polyaniline composite
    Gupta, Arushi
    Bhardwaj, Sanjeev K.
    Sharma, Amit L.
    Kim, Ki-Hyun
    Deep, Akash
    ENVIRONMENTAL RESEARCH, 2019, 171 : 395 - 402
  • [23] Comprehensive insight on multidrug resistance and virulence genes of ESBL-producing E. coli from different surface water sources in Bangladesh
    Mou, Taslin Jahan
    Sumon, Sazzad Hossain
    Nupur, Nasrin Akter
    Sharif, Nadim
    Islam, Md. Fokhrul
    Dey, Shuvra Kanti
    Parvez, Md Anowar Khasru
    JOURNAL OF WATER AND HEALTH, 2024, 22 (10) : 1808 - 1825
  • [24] A Comprehensive framework for Parkinson's disease diagnosis using explainable artificial intelligence empowered machine learning techniques
    Priyadharshini, S.
    Ramkumar, K.
    Vairavasundaram, Subramaniyaswamy
    Narasimhan, K.
    Venkatesh, S.
    Amirtharajan, Rengarajan
    Kotecha, Ketan
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 107 : 568 - 582
  • [25] Total coliform and E. coli in public water systems using undisinfected ground water in the United States
    Messner, Michael J.
    Berger, Philip
    Javier, Julie
    INTERNATIONAL JOURNAL OF HYGIENE AND ENVIRONMENTAL HEALTH, 2017, 220 (04) : 736 - 743
  • [26] Dynamics and future prediction of LULC on Pare River basin of Arunachal Pradesh using machine learning techniques
    Mandal, Sameer
    Bandyopadhyay, Arnab
    Bhadra, Aditi
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (06)
  • [27] Quantifying Temporal Dynamics of E. coli Concentration and Quantitative Microbial Risk Assessment of Pathogen in a Karst Basin
    Sarker, Shishir K.
    Dapkus, Ryan T.
    Byrne, Diana M.
    Fryar, Alan E.
    Hutchison, Justin M.
    WATER, 2025, 17 (05)
  • [28] Data-driven models for predicting microbial water quality in the drinking water source using E. coli monitoring and hydrometeorological data
    Sokolova, Ekaterina
    Ivarsson, Oscar
    Lilliestrom, Ann
    Speicher, Nora K.
    Rydberg, Henrik
    Bondelind, Mia
    SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 802
  • [29] Exploring forest fire susceptibility and management strategies in Western Himalaya: Integrating ensemble machine learning and explainable AI for accurate prediction and comprehensive analysis
    Hang, Hoang Thi
    Mallick, Javed
    Alqadhi, Saeed
    Bindajam, Ahmed Ali
    Abdo, Hazem Ghassan
    ENVIRONMENTAL TECHNOLOGY & INNOVATION, 2024, 35
  • [30] Development of a novel fluorescence spectroscopy based method using layered double hydroxides to study degradation of E. coli in water
    Fatima, Noor
    Hassan, Syed Mujtaba ul
    Fakhar-e-Alam, M.
    Asif, Muhammad
    Imtiaz, Sana
    Anwar, Shahzad
    Arooj, Hurriyat
    Imran, Muhammad
    JOURNAL OF MOLECULAR STRUCTURE, 2024, 1310