Comprehensive assessment of E. coli dynamics in river water using advanced machine learning and explainable AI

被引:0
|
作者
Mallik, Santanu [1 ,2 ]
Saha, Bodhipriya [2 ]
Podder, Krishanu [3 ]
Muthuraj, Muthusivaramapandian [4 ]
Mishra, Umesh [2 ]
Deb, Sharbari [5 ]
机构
[1] Poornima Coll Engn, Dept Civil Engn, Jaipur 302022, Rajasthan, India
[2] Natl Inst Technol Agartala, Dept Civil Engn, Jirania 799046, Tripura, India
[3] Govt Tripura, Dept Elementary Educ, Agartala, India
[4] Natl Inst Technol Agartala, Dept Bioengn, Jirania 799046, Tripura, India
[5] Poornima Univ, Dept Elect & Comp Engn, Jaipur 303905, Rajasthan, India
关键词
E; coli; Land use; QMRA; Automatic machine learning algorithm; Explainable artificial intelligence; RISK-ASSESSMENT; LAND-USE; QUALITY;
D O I
10.1016/j.psep.2025.106816
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The discharge of untreated municipal wastewater has resulted in faecal contamination of river water, posing severe public health risks, and has challenged safe irrigation. Therefore, the present study quantified the Escherichia coli (E. coli) contamination in three rivers of the Tripura region and assessed the impact of land use (LU) patterns on E. coli dynamics using spatial distribution maps. Further, the Quantitative Microbial Risk Assessment (QMRA) model is utilized to evaluate microbial risks associated with farmers using contaminated river water for irrigation. Finally, this study is the first of its kind to use and compare three hyper-tuning frameworks, which included Bayesian optimization, Tree-based Pipeline Optimization Tool, and Optuna, to predict E. coli concentration. This work also utilizes the Explainable AI (XAI) based Shapley Additive Explanations (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME) for global and local site-specific sensitivity analyses, providing interpretable and actionable insights. The findings show that water quality in all three rivers is unsuitable for drinking primarily due to elevated E. coli levels. Stable pH levels and favorable temperatures support E. coli growth, intensifying the contamination risk. The QMRA model further indicates a 0.01- 0.57 probability of significant health risks for farmers using contaminated water. Additionally, the machine learning approaches, along with statistical metrics and cumulative density function plots, reveal the superior performance of the Optuna-optimized extreme gradient-boosting (XGBoost) model over the random forest (RF) and gradient-boosting machine models (GBM). XAI recognized electrical conductivity and total dissolved solids as the most influential factors affecting the E. coli concentrations. Overall, this framework can predict regions impacted by faecal contamination, supporting the sustainable development goals for clean water and health.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Evaluation of the soil and water assessment tool (SWAT) for simulating E. coli concentrations at the watershed-scale
    Sowah, Robert A.
    Bradshaw, Kenneth
    Snyder, Blake
    Spidle, David
    Molilna, Marirosa
    SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 746 (746)
  • [32] Spatial and temporal variations in river water quality of the Middle Ganga Basin using unsupervised machine learning techniques
    Krishnaraj, Ashwitha
    Deka, Paresh Chandra
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2020, 192 (12)
  • [33] Assessment of Forest Cover Dynamics in Dedo District, Oromia Region, Ethiopia, Using Machine Learning Algorithm
    Engda, Esubalew Mulugeta
    Bojer, Amanuel Kumsa
    Mekonnen, Ziyen Achamyeleh
    Tegegn, Wubalem Woreket
    JOURNAL OF SENSORS, 2024, 2024
  • [34] Quantitative assessment of human exposure to extended spectrum and AmpC β-lactamases bearing E. coli in lettuce attributable to irrigation water and subsequent horizontal gene transfer
    Njage, P. M. K.
    Buys, E. M.
    INTERNATIONAL JOURNAL OF FOOD MICROBIOLOGY, 2017, 240 : 141 - 151
  • [35] Assessment of forest cover dynamics for the detection of deforestation in the Hindu Kush Himalayan region using geospatial and machine learning approaches
    Gupta, Laxmi
    Dixit, Jagabandhu
    Pandey, Prem Chandra
    Pandey, Manish
    EARTH SCIENCE INFORMATICS, 2025, 18 (01)
  • [36] Validity assessment of Michigan's proposed qPCR threshold value for rapid water-quality monitoring of E. coli contamination
    McNair, James N.
    Lane, Molly J.
    Hart, John J.
    Porter, Alexis M.
    Briggs, Shannon
    Southwell, Benjamin
    Sivy, Tami
    Szlag, David C.
    Scull, Brian T.
    Pike, Schuyler
    Dreelin, Erin
    Vernier, Chris
    Carter, Bonnie
    Sharp, Josh
    Nowlin, Penny
    Rediske, Richard R.
    WATER RESEARCH, 2022, 226
  • [37] Assessment of groundwater suitability for sustainable irrigation: A comprehensive study using indexical, statistical, and machine learning approaches
    Singh, Gobinder
    Singh, Jagdeep
    Wani, Owais Ali
    Egbueri, Johnbosco C.
    Agbasi, Johnson C.
    GROUNDWATER FOR SUSTAINABLE DEVELOPMENT, 2024, 24
  • [38] Cropland classification and water stress vulnerability assessment in arid environment of Churu district, India using machine learning approach
    Islam, Zubairul
    Siddiqui, Azizur Rahman
    Singh, Sudhir Kumar
    Singh, Jaspal
    Bajpai, Rajesh
    Ahirwar, Saroj
    JOURNAL OF ATMOSPHERIC AND SOLAR-TERRESTRIAL PHYSICS, 2025, 269
  • [39] Detection of E. coli concentration levels using CSI-D plus handheld with UV-C fluorescence imaging and deep learning on leaf surfaces
    Yadav, Pappu Kumar
    Burks, Thomas
    Vaddi, Snehit
    Qin, Jianwei
    Kim, Moon
    Ritenour, Mark A.
    Vasefi, Fartash
    SENSING FOR AGRICULTURE AND FOOD QUALITY AND SAFETY XVI, 2024, 13060
  • [40] A novel sensing platform using silicon nanowires/reduced graphene oxide to detect pathogenic E. coli (MTCC4430) and its application in water samples
    Nair, Sneha
    Gautam, Varsha
    Kumar, Ramesh
    Verma, Abhishek
    Jain, Vinod Kumar
    Nagpal, Suman
    TOXICOLOGY AND ENVIRONMENTAL HEALTH SCIENCES, 2022, 14 (03) : 253 - 260