Short-term Lake Erie algal bloom prediction by classification and regression models

被引:26
作者
Ai, Haiping [1 ]
Zhang, Kai [1 ]
Sun, Jiachun [1 ]
Zhang, Huichun [1 ]
机构
[1] Case Western Reserve Univ, Dept Civil & Environm Engn, Cleveland, OH 44106 USA
基金
美国国家科学基金会;
关键词
Bloom forecast; Feature selection; Long-short term memory; Machine learning; Random forest; Time series modeling; CYANOBACTERIAL BLOOMS; MICROCYSTIS BLOOMS; PHOSPHORUS LOADS; CLIMATE-CHANGE; LONG-TERM; WESTERN; FLOW; NETWORKS; NITROGEN; BIOMASS;
D O I
10.1016/j.watres.2023.119710
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The recent outbreaks of harmful algal blooms in the western Lake Erie Basin (WLEB) have drawn tremendous attention to bloom prediction for better control and management. Many weekly to annual bloom prediction models have been reported, but they only employ small datasets, have limited types of input features, build linear regression or probabilistic models, or require complex process-based computations. To address these limitations, we conducted a comprehensive literature review, complied a large dataset containing chlorophyll-a index (from 2002 to 2019) as the output and a novel combination of riverine (the Maumee & Detroit Rivers) and meteorological (WLEB) features as the input, and built machine learning-based classification and regression models for 10-d scale bloom predictions. By analyzing the feature importance, we identified 8 most important features for the HAB control, including nitrogen loads, time, water levels, soluble reactive phosphorus load, and solar irradiance. Here, both long- and short-term nitrogen loads were for the first time considered in HAB models for Lake Erie. Based on these features, the 2-, 3-, and 4-level random forest classification models achieved an accuracy of 89.6%, 77.0%, and 66.7%, respectively, and the regression model achieved an R2 value of 0.69. In addition, longshort term memory (LSTM) was implemented to predict temporal trends of four short-term features (N, solar irradiance, and two water levels) and achieved the Nash-Sutcliffe efficiency of 0.12-0.97. Feeding the LSTM model predictions for these features into the 2-level classification model reached an accuracy of 86.0% for predicting the HABs in 2017-2018, suggesting that we can provide short-term HAB forecasts even when the feature values are not available.
引用
收藏
页数:10
相关论文
共 67 条
  • [1] Evaluation of the current state of mechanistic aquatic biogeochemical modeling
    Arhonditsis, GB
    Brett, MT
    [J]. MARINE ECOLOGY PROGRESS SERIES, 2004, 271 : 13 - 26
  • [2] Large area hydrologic modeling and assessment - Part 1: Model development
    Arnold, JG
    Srinivasan, R
    Muttiah, RS
    Williams, JR
    [J]. JOURNAL OF THE AMERICAN WATER RESOURCES ASSOCIATION, 1998, 34 (01): : 73 - 89
  • [3] Classification of annual Great Lakes ice cycles: Winters of 1973-2002
    Assel, RA
    [J]. JOURNAL OF CLIMATE, 2005, 18 (22) : 4895 - 4905
  • [4] Probabilistically assessing the role of nutrient loading in harmful algal bloom formation in western Lake Erie
    Bertani, Isabella
    Obenour, Daniel R.
    Steger, Cara E.
    Stow, Craig A.
    Gronewold, Andrew D.
    Scavia, Donald
    [J]. JOURNAL OF GREAT LAKES RESEARCH, 2016, 42 (06) : 1184 - 1192
  • [5] Tracking cyanobacteria blooms: Do different monitoring approaches tell the same story?
    Bertani, Isabella
    Steger, Cara E.
    Obenour, Daniel R.
    Fahnenstiel, Gary L.
    Bridgeman, Thomas B.
    Johengen, Thomas H.
    Sayers, Michael J.
    Shuchman, Robert A.
    Scavia, Donald
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2017, 575 : 294 - 308
  • [6] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140
  • [7] A novel method for tracking western Lake Erie Microcystis blooms, 2002-2011
    Bridgeman, Thomas B.
    Chaffin, Justin D.
    Filbrun, Jesse E.
    [J]. JOURNAL OF GREAT LAKES RESEARCH, 2013, 39 (01) : 83 - 89
  • [8] Chaffin J.D., 2013, ADV MICOBIOL, V3, P16, DOI [10.4236/aim.2013.36A003, DOI 10.4236/AIM.2013.36A003]
  • [9] Effectiveness of a fixed-depth sensor deployed from a buoy to estimate water-column cyanobacterial biomass depends on wind speed
    Chaffin, Justin D.
    Kane, Douglas D.
    Johnson, Alex
    [J]. JOURNAL OF ENVIRONMENTAL SCIENCES, 2020, 93 : 23 - 29
  • [10] Summer phytoplankton nutrient limitation in Maumee Bay of Lake Erie during high-flow and low-flow years
    Chaffin, Justin D.
    Bridgeman, Thomas B.
    Bade, Darren L.
    Mobilian, Courtney N.
    [J]. JOURNAL OF GREAT LAKES RESEARCH, 2014, 40 (03) : 524 - 531