Advancing water quality assessment and prediction using machine learning models, coupled with explainable artificial intelligence (XAI) techniques like shapley additive explanations (SHAP) for interpreting the black-box nature

被引:44
作者
Makumbura, Randika K. [1 ]
Mampitiya, Lakindu [1 ]
Rathnayake, Namal [2 ]
Meddage, D. P. P. [3 ]
Henna, Shagufta [4 ]
Dang, Tuan Linh [5 ]
Hoshino, Yukinobu [6 ]
Rathnayake, Upaka [7 ]
机构
[1] Water Resources Management & Soft Comp Res Lab, Millennium City 10150, Athurugiriya, Sri Lanka
[2] Univ Tokyo, Fac Engn, Dept Civil Engn, 1 Chome 1-1 Yayoi, Bunkyo City, Tokyo 1138656, Japan
[3] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT, Australia
[4] Atlantic Technol Univ, Dept Comp, Letterkenny F92 FC93, Ireland
[5] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, 1 Dai Co Viet Rd, Hanoi 100000, Vietnam
[6] Kochi Univ Technol, Sch Syst Engn, 185 Miyanokuchi, Kami, Kochi 7828502, Japan
[7] Atlantic Technol Univ, Fac Engn & Design, Dept Civil Engn & Construct, Sligo F91 YW50, Ireland
关键词
Water quality assessment; Machine learning; Explainable artificial intelligence; Shapley additive explanations; Prediction models; SCATTER PLOT;
D O I
10.1016/j.rineng.2024.102831
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Water quality assessment and prediction play crucial roles in ensuring the sustainability and safety of freshwater resources. This study aims to enhance water quality assessment and prediction by integrating advanced machine learning models with XAI techniques. Traditional methods, such as the water quality index, often require extensive data collection and laboratory analysis, making them resource-intensive. The weighted arithmetic water quality index is employed alongside machine learning models, specifically RF, LightGBM, and XGBoost, to predict water quality. The models' performance was evaluated using metrics such as MAE, RMSE, R-2, and R. The results demonstrated high predictive accuracy, with XGBoost showing the best performance (R-2 = 0.992, R = 0.996, MAE = 0.825, and RMSE = 1.381). Additionally, SHAP were used to interpret the model's predictions, revealing that COD and BOD are the most influential factors in determining water quality, while electrical conductivity, chloride, and nitrate had minimal impact. High dissolved oxygen levels were associated with lower water quality index, indicative of excellent water quality, while pH consistently influenced predictions. The findings suggest that the proposed approach offers a reliable and interpretable method for water quality prediction, which can significantly benefit water specialists and decision-makers.
引用
收藏
页数:14
相关论文
共 52 条
[1]   Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) [J].
Adadi, Amina ;
Berrada, Mohammed .
IEEE ACCESS, 2018, 6 :52138-52160
[2]   Machine learning methods for better water quality prediction [J].
Ahmed, Ali Najah ;
Othman, Faridah Binti ;
Afan, Haitham Abdulmohsin ;
Ibrahim, Rusul Khaleel ;
Fai, Chow Ming ;
Hossain, Md Shabbir ;
Ehteram, Mohammad ;
Elshafie, Ahmed .
JOURNAL OF HYDROLOGY, 2019, 578
[3]   Evaluation of water quality indexes with novel machine learning and SHapley Additive ExPlanation (SHAP) approaches [J].
Aldrees, Ali ;
Khan, Majid ;
Taha, Abubakr Taha Bakheit ;
Ali, Mujahid .
JOURNAL OF WATER PROCESS ENGINEERING, 2024, 58
[4]  
Alshaltone O, 2021, I C DEV ESYST ENG, P174, DOI [10.1109/DESE54285.2021.9719474, 10.1109/DeSE54285.2021.9719474]
[5]   River water quality index prediction and uncertainty analysis: A comparative study of machine learning models [J].
Asadollah, Seyed Babak Haji Seyed ;
Sharafati, Ahmad ;
Motta, Davide ;
Yaseen, Zaher Mundher .
JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2021, 9 (01)
[6]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[7]   Characterizing seasonal, environmental and human-induced factors influencing the dynamics of Rispana River's water quality: Implications for sustainable river management [J].
Bhatt, Sushmita ;
Mishra, Arun Pratap ;
Chandra, Naveen ;
Sahu, Himanshu ;
Chaurasia, Shardesh Kumar ;
Pande, Chaitanya B. ;
Agbasi, Johnson C. ;
Khan, Mohd Yawar Ali ;
Abba, Sani I. ;
Egbueri, Johnbosco C. ;
Durin, Bojan ;
Hunt, Julian .
RESULTS IN ENGINEERING, 2024, 22
[8]  
Bouslah S., 2017, Journal of Water and Land Development, P221
[9]  
Brar A.S., 2013, Consumer Behaviour and Perception for Efficient Water Use in Urban Punjab
[10]  
Breiman L., 2017, Classification and regression trees