Multi-Model Machine Learning Approach Accurately Predicts Lake Dissolved Oxygen With Multiple Environmental Inputs

被引:1
|
作者
Lin, Shuqi [1 ,2 ,3 ]
Pierson, Donald C. [2 ,3 ]
Ladwig, Robert [4 ]
Kraemer, Benjamin M. [5 ]
Hu, Fenjuan R. S. [6 ]
机构
[1] Canada Ctr Inland Waters, Environm & Climate Change Canada, Burlington, ON, Canada
[2] Uppsala Univ, Erken Lab, Uppsala, Sweden
[3] Uppsala Univ, Limnol Dept, Uppsala, Sweden
[4] Univ Wisconsin Madison, Ctr Limnol, Madison, WI USA
[5] IGB Leibniz Inst Freshwater Ecol & Inland Fisherie, Berlin, Germany
[6] VIA Univ Coll, Res Ctr Bldg Energy Water & Climate, Horsens, Denmark
关键词
LONG-TERM CHANGES; CLIMATE-CHANGE; CENTRAL BASIN; NORTH SHORE; HYPOXIA; PHOSPHORUS; DEPLETION; MODEL; VARIABILITY; DYNAMICS;
D O I
10.1029/2023EA003473
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
As a key water quality parameter, dissolved oxygen (DO) concentration, and particularly changes in bottom water DO is fundamental for understanding the biogeochemical processes in lake ecosystems. Based on two machine learning (ML) models, Gradient Boost Regressor (GBR) and long-short-term-memory (LSTM) network, this study developed three ML model approaches: direct GBR; direct LSTM; and a 2-step mixed ML model workflow combining both GBR and LSTM. They were used to simulate multi-year surface and bottom DO concentrations in five lakes. All approaches were trained with readily available environmental data as predictors. Indices of lake thermal structure and mixing provided by a one-dimensional (1-D) hydrodynamic model were also included as predictors in the ML models. The advantages of each ML approach were not consistent for all the tested lakes, but the best one of them was defined that can estimate DO concentration with coefficient of determination (R2) up to 0.6-0.7 in each lake. All three approaches have normalized mean absolute error (NMAE) under 0.15. In a polymictic lake, the 2-step mixed model workflow showed better representation of bottom DO concentrations, with a highest true positive rate (TPR) of hypolimnetic hypoxia detection of over 90%, while the other workflows resulted in, TPRs are around 50%. In most of the tested lakes, the predicted surface DO concentrations and variables indicating stratified conditions (i.e., Wedderburn number and the temperature difference between surface and bottom water) are essential for simulating bottom DO. The ML approaches showed promising results and could be used to support short- and long-term water management plans. Dissolved oxygen (DO) concentrations is the essential water quality parameter in lake systems. Nowadays, with the development of data-driven machine learning (ML) models, prediction of DO concentrations can be achieved via these models in lakes with long-term DO concentration observations. This study developed three ML model approaches with one mixed two kind of ML models, and test them in five lakes. Readily available environmental data and the derived hydrodynamic data from process-based hydrodynamic model were used as predictors. All three ML approaches showed promising results, and the mixed ML approach show better skill in the lake stratifying and mixing irregularly. To predict hypoxia in the bottom of the lake, the surface DO concentrations and variables indicating water column stratification are important. A 2-step machine learning workflow combining Gradient Boost Regressor and long-short-term-memory was applied to simulate dissolved oxygen A one-dimensional process-based hydrodynamic model provides ML models with indices of lake thermal structure and mixing In a polymictic lake, the 2-step mixed machine learning workflow showed over 90% true positive rate of hypolimnetic hypoxia detection
引用
收藏
页数:16
相关论文
共 28 条
  • [1] A multi-model ensemble approach for reservoir dissolved oxygen forecasting based on feature screening and machine learning
    Zhang, Peng
    Liu, Xinyang
    Dai, Huancheng
    Shi, Chengchun
    Xie, Rongrong
    Song, Gangfu
    Tang, Lei
    ECOLOGICAL INDICATORS, 2024, 166
  • [2] A multi-model approach to evaluating target phosphorus loads for Lake Erie
    Scavia, Donald
    DePinto, Joseph V.
    Bertani, Isabella
    JOURNAL OF GREAT LAKES RESEARCH, 2016, 42 (06) : 1139 - 1150
  • [3] An Interpretable Multi-Model Machine Learning Approach for Spatial Mapping of Deep-Sea Polymetallic Nodule Occurrences
    Gazis, Iason-Zois
    Charlet, Francois
    Greinert, Jens
    NATURAL RESOURCES RESEARCH, 2024, 33 (06) : 2473 - 2501
  • [4] The Geometry of Flow: Advancing Predictions of River Geometry With Multi-Model Machine Learning
    Chang, Shuyu Y.
    Ghahremani, Zahra
    Manuel, Laura
    Erfani, Seyed Mohammad Hassan
    Shen, Chaopeng
    Cohen, Sagy
    Van Meter, Kimberly J.
    Pierce, Jennifer L.
    Meselhe, Ehab A.
    Goharian, Erfan
    WATER RESOURCES RESEARCH, 2024, 60 (10)
  • [5] Advancing ocean subsurface thermal structure estimation in the Pacific Ocean: A multi-model ensemble machine learning approach
    Qi, Jifeng
    Zhang, Linlin
    Yin, Baoshu
    Li, Delei
    Xie, Bowen
    Sun, Guimin
    DYNAMICS OF ATMOSPHERES AND OCEANS, 2023, 104
  • [6] Assessing physical and biological lake oxygen indicators using simulated environmental variables and machine learning algorithms
    Chang, Feng
    Vlahos, P.
    Astitha, M.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2024, 176
  • [7] An assessment of the predictability of column minimum dissolved oxygen concentrations in Chesapeake Bay using a machine learning model
    Ross, Andrew C.
    Stock, Charles A.
    ESTUARINE COASTAL AND SHELF SCIENCE, 2019, 221 : 53 - 65
  • [8] Email-Based Cyberstalking Detection On Textual Data Using Multi-Model Soft Voting Technique Of Machine Learning Approach
    Gautam, Arvind Kumar
    Bansal, Abhishek
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2023, 63 (06) : 1362 - 1381
  • [9] Multi-model inference in comparative phylogeography: an integrative approach based on multiple lines of evidence
    Collevatti, Rosane G.
    Terribile, Levi C.
    Diniz-Filho, Jose A. F.
    Lima-Ribeiro, Matheus S.
    FRONTIERS IN GENETICS, 2015, 6
  • [10] Multi-Model Ensemble Machine Learning Approaches to Project Climatic Scenarios in a River Basin in the Pyrenees
    Bilbao-Barrenetxea, Nerea
    Martinez-Espana, Raquel
    Jimeno-Saez, Patricia
    Faria, Sergio Henrique
    Senent-Aparicio, Javier
    EARTH SYSTEMS AND ENVIRONMENT, 2024, 8 (04) : 1159 - 1177