New Graph-Based and Transformer Deep Learning Models for River Dissolved Oxygen Forecasting

被引:4
作者
Rocha, Paulo Alexandre Costa [1 ,2 ]
Santos, Victor Oliveira [1 ]
The, Jesse Van Griensven [1 ,3 ]
Gharabaghi, Bahram [1 ]
机构
[1] Univ Guelph, Sch Engn, 50 Stone Rd E, Guelph, ON N1G 2W1, Canada
[2] Univ Fed Ceara, Technol Ctr, Mech Engn Dept, BR-60020181 Fortaleza, CE, Brazil
[3] Lakes Environm Res Inc, 170 Columbia St W, Waterloo, ON N2L 3L3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
pollution; dissolved oxygen; Credit River; machine learning; graph neural networks; SHAP analysis; BLACK-BOX; NEURAL-NETWORKS; WATER-QUALITY;
D O I
10.3390/environments10120217
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Dissolved oxygen (DO) is a key indicator of water quality and the health of an aquatic ecosystem. Aspiring to reach a more accurate forecasting approach for DO levels of natural streams, the present work proposes new graph-based and transformer-based deep learning models. The models were trained and validated using a network of real-time hydrometric and water quality monitoring stations for the Credit River Watershed, Ontario, Canada, and the results were compared with both benchmarking and state-of-the-art approaches. The proposed new Graph Neural Network Sample and Aggregate (GNN-SAGE) model was the best-performing approach, reaching coefficient of determination (R2) and root mean squared error (RMSE) values of 97% and 0.34 mg/L, respectively, when compared with benchmarking models. The findings from the Shapley additive explanations (SHAP) indicated that the GNN-SAGE benefited from spatiotemporal information from the surrounding stations, improving the model's results. Furthermore, temperature has been found to be a major input attribute for determining future DO levels. The results established that the proposed GNN-SAGE model outperforms the accuracy of existing models for DO forecasting, with great potential for real-time water quality management in urban watersheds.
引用
收藏
页数:24
相关论文
共 95 条
[1]   A Multi-Step Approach for Optically Active and Inactive Water Quality Parameter Estimation Using Deep Learning and Remote Sensing [J].
Ahmed, Mehreen ;
Mumtaz, Rafia ;
Anwar, Zahid ;
Shaukat, Arslan ;
Arif, Omar ;
Shafait, Faisal .
WATER, 2022, 14 (13)
[2]   Historical changes in the fish communities of the Credit River watershed [J].
Allen, Brett ;
Mandrak, Nicholas E. .
AQUATIC ECOSYSTEM HEALTH & MANAGEMENT, 2019, 22 (03) :316-328
[3]  
[Anonymous], 1999, Canadian Environmental Quality Guidelines
[4]   Leveraging data from nearby stations to improve short-term wind speed forecasts [J].
Baile, Rachel ;
Muzy, Jean-Francois .
ENERGY, 2023, 263
[5]   Distribution, sources and consequences of nutrients, persistent organic pollutants, metals and microplastics in South American estuaries [J].
Barletta, Mario ;
Lima, Andre R. A. ;
Costa, Monica F. .
SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 651 :1199-1218
[6]   Short-term water quality variable prediction using a hybrid CNN-LSTM deep learning model [J].
Barzegar, Rahim ;
Aalami, Mohammad Taghi ;
Adamowski, Jan .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2020, 34 (02) :415-433
[7]  
Benesty Jacob, 2009, The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation, P1, DOI DOI 10.4135/9781506326139
[8]   A comparative analysis of gradient boosting algorithms [J].
Bentejac, Candice ;
Csorgo, Anna ;
Martinez-Munoz, Gonzalo .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (03) :1937-1967
[9]   Comparison of machine learning algorithms to predict dissolved oxygen in an urban stream [J].
Bolick, Madeleine M. ;
Post, Christopher J. ;
Naser, Mohannad-Zeyad ;
Mikhailova, Elena A. .
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (32) :78075-78096
[10]  
Bondi A. B., 2000, Proceedings Second International Workshop on Software and Performance. WOSP2000, P195, DOI 10.1145/350391.350432