Algal bloom forecasting leveraging signal processing: A novel perspective from ensemble learning

被引:0
作者
Xu, Caicai [1 ,2 ,3 ]
Huang, Yuzhou [4 ,5 ]
Xin, Ruoxue [3 ,6 ]
Wu, Na [1 ,2 ]
Liu, Muyuan [7 ]
机构
[1] Inst Zhejiang Univ, 99 Zheda Rd, Quzhou 324000, Peoples R China
[2] Zhejiang Univ, Coll Chem & Biol Engn, Key Lab Biomass Chem Engn, Minist Educ, Hangzhou 310027, Peoples R China
[3] Shandong Key Lab Marine Ecol Environm & Disaster P, Qingdao 266061, Peoples R China
[4] Minist Nat Resources, Inst Oceanog 4, Key Lab Trop Marine Ecosyst & Bioresource, Beihai 536000, Peoples R China
[5] Minist Nat Resources, Inst Oceanog 4, Guangxi Key Lab Beibu Gulf Marine Resources Enviro, Beihai 536015, Peoples R China
[6] North China Sea Marine Forecasting Ctr, State Ocean Adm, Qingdao 266000, Peoples R China
[7] Imperial Coll London, Dept Earth Sci & Engn, London SW7 2AZ, England
关键词
Algal blooms forecasting; Signal processing; CEEMDAN; Machine learning; Ensemble learning; NETWORK; SELECTION; MODELS;
D O I
10.1016/j.watres.2025.123800
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate forecasting of algal blooms is essential for implementing timely control measures. However, given their inherent complex time-frequency characteristics, capturing the dynamics of algal blooms remains an ongoing challenge in standalone models. Targeting this challenge, this study demonstrates an ensemble framework that combines signal processing with machine learning (ML) techniques to collectively forecast algal dynamics. This method utilizes an efficient signal processing algorithm, namely the compete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), to decompose the highly non-stationary patterns of algal dynamics, while leveraging the complementary strengths of four distinct ML models to optimize the learning of the decomposed components. Our results demonstrated that CEEMDAN can largely improve the forecasting performance of standalone ML models (e.g., long short-term memory), achieving an average increase in validation R2 by 63 %. Moreover, by incorporating the ensemble effects that leverage model-specific strengths, this performance gain was further amplified, resulting in an average increase of 75 % in validation R2 compared to standalone ML models. The developed method, termed CEEMDAN-Hybrid-Ensemble (CHES) model, consistently delivered accurate forecasting of algal dynamics across multiple time resolutions (hourly, daily, and biweekly) in both rivers (River Enborne and The Cut) and lakes (Blelham Tarn and Lake Lillinonah), as suggested by high validation R2 values of 0.955, 0.878, 0.824, and 0.957, respectively. In addition, the CHES model achieved stable multi-step forecasting of algal dynamics with gaps ranging from 1 to 7 steps, as indicated by an average validation R2 of 0.72 +/- 0.17 (S.D.) and an average validation root-mean-square-error (RMSE) of 0.32 +/- 0.11 RFU. This study highlighted the ensemble effect achieved by integrating signal processing and ML techniques, presenting a novel perspective that enhances forecasting robustness to support the early warning of algal blooms.
引用
收藏
页数:12
相关论文
共 64 条
[1]  
[Anonymous], 2017, AUTOMATIC DIFFERENTI
[2]  
Bowes M.J., 2015, NERC Environm. Inf. Data Centre, DOI [10.5285/abe4dd7c, DOI 10.5285/ABE4DD7C]
[3]  
Bowes MJ, 2015, EIDC, DOI 10.5285/11D712E0-7456-4EA9-8AF8-FE81A666E91B
[4]   Development and application of a decision group Back-Propagation Neural Network for flood forecasting [J].
Chen, Chang-Shian ;
Chen, Boris Po-Tsang ;
Chou, Frederick Nai-Fang ;
Yang, Chao-Chung .
JOURNAL OF HYDROLOGY, 2010, 385 (1-4) :173-182
[5]   A coupled model to improve river water quality prediction towards addressing non-stationarity and data limitation [J].
Chen, Shengyue ;
Huang, Jinliang ;
Wang, Peng ;
Tang, Xi ;
Zhang, Zhenyu .
WATER RESEARCH, 2024, 248
[6]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[7]   Improved complete ensemble EMD: A suitable tool for biomedical signal processing [J].
Colominas, Marcelo A. ;
Schlotthauer, Gaston ;
Torres, Maria E. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2014, 14 :19-29
[8]   Coastal phytoplankton blooms expand and intensify in the 21st century [J].
Dai, Yanhui ;
Yang, Shangbo ;
Zhao, Dan ;
Hu, Chuanmin ;
Xu, Wang ;
Anderson, Donald M. ;
Li, Yun ;
Song, Xiao-Peng ;
Boyce, Daniel G. ;
Gibson, Luke ;
Zheng, Chunmiao ;
Feng, Lian .
NATURE, 2023, 615 (7951) :280-+
[9]   Deep learning based soft-sensor for continuous chlorophyll estimation on decentralized data [J].
Diaz, Judith Sainz-Pardo ;
Castrillo, Maria ;
Garcia, Alvaro Lopez .
WATER RESEARCH, 2023, 246
[10]   Modelling algal abnormal proliferation in a reservoir using support vector regression: a case study [J].
Diaz Muniz, C. ;
Alonso Fernandez, J. R. ;
Garcia Nieto, P. J. ;
Alvarez Anton, J. C. .
ECOHYDROLOGY, 2015, 8 (06) :1109-1118