A robust and explainable deep learning model based on an LSTM-CNN framework for reliable FDOM prediction in water quality monitoring: Incorporating SHAP analysis for enhanced interpretability

被引:0
作者
Alizamir, Meysam [1 ,2 ]
Heddam, Salim [3 ]
Kim, Sungwon [4 ]
机构
[1] Duy Tan Univ, Inst Res & Dev, Da Nang, Vietnam
[2] Duy Tan Univ, Sch Engn & Technol, Da Nang, Vietnam
[3] Univ 20 Aout 1955 Skikda, Fac Sci, Agron Dept, Hydraul Div, Skikda, Algeria
[4] Dongyang Univ, Dept Railroad Construct & Safety Engn, Yeongju, South Korea
关键词
Water quality; FDOM; LSTM; CNN; SHAP; CLIMATE-CHANGE; NEURAL-NETWORKS; WEB TOOL; RIVER; MANAGEMENT; POLLUTION; IMPACTS; DECOMPOSITION; DYNAMICS;
D O I
10.1016/j.psep.2025.107594
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The quality and availability of water play a vital role in sustaining human life, driving economic growth, and maintaining public health and environmental balance. Moreover, the traditional methods of assessing water quality rely on costly and lengthy laboratory testing and statistical evaluations. Given the serious risks posed by poor water quality, there is an urgent need for faster and more economical assessment techniques. Therefore, precise forecasting of water quality parameters stands as a critical tool for enhancing both water resource management and efforts to combat contamination. To bridge these gaps, this study suggested multiple deep learning algorithms (LSTM, CNN, GRU, BiLSTM, BiGRU, and LSTM-CNN) to predict daily fluorescent dissolved organic matter (FDOM) concentrations. For a more accurate assessment, the results of the suggested models were compared with baseline models, including CART and MLR. The analysis incorporated using nine different scenarios from seven water quality parameters including discharge (Q), water temperature (Tw), specific conductivity (SC), dissolved oxygen (DO), pH, turbidity (TU), chlorophyll-a (Chl-a), and also, YY (year), MM (month), and DD (day), from two USGS monitoring stations (14211720 and 14203500) in Oregon, USA. In this study, the models' performance was evaluated using four metrics: root mean squared error (RMSE), Nash-Sutcliffe efficiency (NSE), correlation coefficient (R), and mean absolute error (MAE). Analyzing results from both stations, the hybrid LSTM-CNN model demonstrated superior FDOM prediction accuracy compared to standalone architectures (LSTM, CNN, GRU, BiLSTM, BiGRU). At USGS 14203500, LSTM-CNN achieved RMSE of 2.867 ppb QSE, MAE of 1.641 ppb QSE, and R of 0.965. For USGS 14211720, it yielded RMSE of 1.022 ppb QSE, MAE of 0.631 ppb QSE, and R of 0.989. Based on SHAP results, DO, pH, TU, and Chl-a were identified as the most important parameters for predicting FDOM in each model's performance. Finally, this study demonstrates that the hybrid LSTM-CNN approach effectively predicts FDOM concentrations, making it an efficient tool for water quality monitoring.
引用
收藏
页数:29
相关论文
共 137 条
[1]   Various Natural and Anthropogenic Factors Responsible for Water Quality Degradation: A Review [J].
Akhtar, Naseem ;
Syakir Ishak, Muhammad Izzuddin ;
Bhawani, Showkat Ahmad ;
Umar, Khalid .
WATER, 2021, 13 (19)
[2]   Explainable AI for Retinoblastoma Diagnosis: Interpreting Deep Learning Models with LIME and SHAP [J].
Aldughayfiq, Bader ;
Ashfaq, Farzeen ;
Jhanjhi, N. Z. ;
Humayun, Mamoona .
DIAGNOSTICS, 2023, 13 (11)
[3]   An efficient computational investigation on accurate daily soil temperature prediction using boosting ensemble methods explanation based on SHAP importance analysis [J].
Alizamir, Meysam ;
Wang, Mo ;
Ikram, Rana Muhammad Adnan ;
Ahmed, Kaywan Othman ;
Heddam, Salim ;
Kim, Sungwon .
RESULTS IN ENGINEERING, 2024, 24
[4]   Developing an efficient explainable artificial intelligence approach for accurate reverse osmosis desalination plant performance prediction: application of SHAP analysis [J].
Alizamir, Meysam ;
Wang, Mo ;
Ikram, Rana Muhammad Adnan ;
Kim, Sungwon ;
Ahmed, Kaywan Othman ;
Heddam, Salim .
ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2024, 18 (01)
[5]   An efficient data fusion model based on Bayesian model averaging for robust water quality prediction using deep learning strategies [J].
Alizamir, Meysam ;
Moradveisi, Kayhan ;
Ahmed, Kaywan Othman ;
Bahrami, Jamil ;
Kim, Sungwon ;
Heddam, Salim .
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 261
[6]   Development of a robust daily soil temperature estimation in semi-arid continental climate using meteorological predictors based on computational intelligent paradigms [J].
Alizamir, Meysam ;
Ahmed, Kaywan Othman ;
Kim, Sungwon ;
Heddam, Salim ;
Gorgij, AliReza Docheshmeh ;
Chang, Sun Woo .
PLOS ONE, 2023, 18 (12)
[7]   Accurately predicting the performance of MOF-based mixed matrix membranes for CO2 removal using a novel optimized extreme learning machine by BAT algorithm [J].
Alizamir, Meysam ;
Keshavarz, Ahmad ;
Abdollahi, Farideh ;
Khosravi, Arash ;
Karagoz, Seckin .
SEPARATION AND PURIFICATION TECHNOLOGY, 2023, 325
[8]   A New Insight for Daily Solar Radiation Prediction by Meteorological Data Using an Advanced Artificial Intelligence Algorithm: Deep Extreme Learning Machine Integrated with Variational Mode Decomposition Technique [J].
Alizamir, Meysam ;
Othman Ahmed, Kaywan ;
Shiri, Jalal ;
Fakheri Fard, Ahmad ;
Kim, Sungwon ;
Heddam, Salim ;
Kisi, Ozgur .
SUSTAINABILITY, 2023, 15 (14)
[9]   Improving the accuracy of daily solar radiation prediction by climatic data using an efficient hybrid deep learning model: Long short-term memory (LSTM) network coupled with wavelet transform [J].
Alizamir, Meysam ;
Shiri, Jalal ;
Fard, Ahmad Fakheri ;
Kim, Sungwon ;
Gorgij, AliReza Docheshmeh ;
Heddam, Salim ;
Singh, Vijay P. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[10]   On the implementation of a novel data-intelligence model based on extreme learning machine optimized by bat algorithm for estimating daily chlorophyll-a concentration: Case studies of river and lake in USA [J].
Alizamir, Meysam ;
Heddam, Salim ;
Kim, Sungwon ;
Mehr, Ali Danandeh .
JOURNAL OF CLEANER PRODUCTION, 2021, 285