Combining physical-based model and machine learning to forecast chlorophyll-a concentration in freshwater lakes

被引:12
作者
Chen, Cheng [1 ,2 ,3 ]
Chen, Qiuwen [1 ,3 ,4 ,5 ]
Yao, Siyang [3 ]
He, Mengnan [3 ]
Zhang, Jianyun [1 ,4 ]
Li, Gang [3 ]
Lin, Yuqing [1 ,3 ]
机构
[1] Nanjing Hydraul Res Inst, Natl Key Lab Water Disaster Prevent, Nanjing 210029, Peoples R China
[2] Hohai Univ, Coll Water Conservancy & Hydroelect Power, Nanjing 210098, Peoples R China
[3] Nanjing Hydraul Res Inst, Ctr Ecoenvironm Res, Nanjing 210029, Peoples R China
[4] Yangtze Inst Conservat & Green Dev, Nanjing 210029, Peoples R China
[5] Hujuguan 34, Nanjing 210029, Peoples R China
基金
中国国家自然科学基金;
关键词
Chlorophyll-a concentration; Algal blooms; Freshwater lake; Short-term ensemble forecast; Machine learning; Bayesian model averaging; PREDICTION; RESERVOIRS; QUALITY;
D O I
10.1016/j.scitotenv.2023.168097
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Increasing algal blooms in freshwater lakes have become a serious challenge facing the world. Short-term forecast of chlorophyll-a concentration (Chla) is essential for providing early warnings and taking action to mitigate the risks of algal blooms in freshwater lakes. At present, a variety of data-driven models and physicalbased models have been developed for Chla forecast, yet how to effectively combine multiple models for improving the forecast accuracy remains largely unknown. Here we developed an effective model by combining a physical-based model and machine learning algorithms (long short-term memory, LSTM; random forest, RF; support vector machine, SVM) to forecast the Chla in a freshwater lake, and a Bayesian model averaging (BMA) ensemble forecasting method was further proposed to improve the accuracy and reliability of the forecast results. We found that, with the increase of time steps of advance forecast from 1-day to 7-day, the forecast accuracy as measured by R2 of the machine learning algorithms is decreased from 0.95 to 0.68. The combination of physicalbased modeling with LSTM had great capability in short-term forecast of Chla, owing to the fact that the physicalbased model can provide high-frequency Chla data and LSTM is skilled at forecasting in the sequence. This is also evidenced by the weights in the BMA method. The proposed BMA short-term ensemble forecasting results had the robust performance when compared to each individual machine learning forecast model for the 7-day advance forecast, with the largest R2 (0.834) and the smallest RMSE (0.267 mu g/L). In particular, the uncertainty of a single machine learning model can be effectively reduced by the BMA method.
引用
收藏
页数:11
相关论文
共 46 条
  • [1] An integrated hydrologic Bayesian multimodel combination framework: Confronting input, parameter, and model structural uncertainty in hydrologic prediction
    Ajami, Newsha K.
    Duan, Qingyun
    Sorooshian, Soroosh
    [J]. WATER RESOURCES RESEARCH, 2007, 43 (01)
  • [2] Prediction of Water Level and Water Quality Using a CNN-LSTM Combined Deep Learning Approach
    Baek, Sang-Soo
    Pyo, Jongcheol
    Chun, Jong Ahn
    [J]. WATER, 2020, 12 (12)
  • [3] PROBLEMS OF APPLICATION OF THE ECOLOGICAL MODEL SALMO TO LAKES AND RESERVOIRS HAVING VARIOUS TROPHIC STATES
    BENNDORF, J
    RECKNAGEL, F
    [J]. ECOLOGICAL MODELLING, 1982, 17 (02) : 129 - 145
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [6] Long-Term Changes and Influencing Factors of Water Quality in Aquaculture Dominated Lakes Unveiled by Sediment Records and Time Series Remote Sensing Images
    Chen, Cheng
    Lin, Miaoli
    Chen, Qiuwen
    He, Mengnan
    Zhang, Jianyun
    Cui, Zhen
    Zhu, Liujun
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-BIOGEOSCIENCES, 2022, 127 (11)
  • [7] A novel multi-source data fusion method based on Bayesian inference for accurate estimation of chlorophyll-a concentration over eutrophic lakes
    Chen, Cheng
    Chen, Qiuwen
    Li, Gang
    He, Mengnan
    Dong, Jianwei
    Yan, Hanlu
    Wang, Zhiyuan
    Duan, Zheng
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2021, 141 (141)
  • [8] Assimilating multi-source data into a three-dimensional hydro-ecological dynamics model using Ensemble Kalman Filter
    Chen, Cheng
    Huang, Jiacong
    Chen, Qiuwen
    Zhang, Jianyun
    Li, Zhijie
    Lin, Yuqing
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2019, 117 : 188 - 199
  • [9] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [10] HUP-BMA: An Integration of Hydrologic Uncertainty Processor and Bayesian Model Averaging for Streamflow Forecasting
    Darbandsari, Pedram
    Coulibaly, Paulin
    [J]. WATER RESOURCES RESEARCH, 2021, 57 (10)