Two-stage meta-ensembling machine learning model for enhanced water quality forecasting

被引：12

作者：

Heydari, Sepideh ^{[1
]}

Nikoo, Mohammad Reza ^{[2
]}

Mohammadi, Ali ^{[3
]}

Barzegar, Rahim ^{[4
]}

机构：

[1] Univ Tehran, Fac Environm Engn, Dept Environm Engn, Tehran, Iran

[2] Sultan Qaboos Univ, Dept Civil & Architectural Engn, Muscat, Oman

[3] Sharif Univ Technol, Dept Ind Engn, Tehran, Iran

[4] Univ Quebec Abitibi Temiscamingue UQAT, Res Inst Mines & Environm RIME, Groundwater Res Grp GRES, Amos, PQ, Canada

来源：

JOURNAL OF HYDROLOGY | 2024年 / 641卷

关键词：

Water quality forecasting; Machine learning; Multi-objective optimization; Genetic algorithm; Grey Wolf Optimizer; Chlorophyll-a and Dissolved oxygen; PREDICTION; IMPLEMENTATION; DECOMPOSITION; PERFORMANCE; STREAMFLOW; RESOURCES; SYSTEM;

D O I：

10.1016/j.jhydrol.2024.131767

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Accurate short-term forecasting of water quality variables (WQVs) such as dissolved oxygen (DO) and chlorophyll-a (Chl-a) is crucial for the effective management of aquatic resources. This study introduces a robust two-stage optimization-ensembling framework that integrates the Grey Wolf Optimizer (GWO) and the Nondominated Sorting Genetic Algorithm II (NSGA-II) to enhance the forecasting capabilities of machine learning (ML) models. Focusing on Small Prespa Lake, Greece, we implemented an array of diverse ML techniques, including eXtreme Gradient Boosting (XGB), Gradient Boosting Regressor (GBR), Light Gradient-Boosting Machine (LightGBM), and Multilayer Perceptron (MLP). These models were fine-tuned using GWO to optimize their performance over critical WQVs predicted six hours in advance. Our methodology employed rigorous data preprocessing techniques, including lag time feature engineering and principal component analysis (PCA), to handle the high dimensionality of the dataset. Optimal lag times ranging from 6 to 24 hour were evaluated, with the 24-hour lag proving to be the most effective in utilizing historical data to enhance forecasting accuracy. The GWO not only facilitated hyperparameter tuning but also demonstrated a notable improvement (7.6%) in the Kling-Gupta Efficiency (KGE) over conventional randomized search methods. Subsequently, the NSGA-II was utilized for multi-objective optimization, constructing powerful model ensembles that outperformed the individual GWO-optimized models by up to a 7% in KGE. In comparision to a standard genetic algorithm-based ensemble, the NSGA-II ensemble demonstrated superior effectiveness in balancing solution quality. This innovative approach not only establishes a new benchmark in water quality forecasting but also contributes substantially to proactive environmental monitoring and management strategies.

引用

页数：16

共 50 条

[21] A two-stage multiple-point conceptual model to predict river stage-discharge process using machine learning approaches [J].

Alizadeh, Farhad ;

Gharamaleki, Alireza Faregh ;

Jalilzadeh, Rasoul .

JOURNAL OF WATER AND CLIMATE CHANGE, 2021, 12 (01) :278-295

[22] An enhanced machine learning model for urban air quality forecasting under intense human activities [J].

Wang, Yelin ;

Xia, Feiyang ;

Yao, Linlin ;

Zhao, Shunyu ;

Li, Youjie ;

Cai, Yanpeng .

URBAN CLIMATE, 2025, 60

[23] A Two-Stage Approach for Flight Departure Delay Forecasting Using Ensemble Learning [J].

Guan, Feng ;

Hao, Mengyan ;

Guo, Zhen .

CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, :209-220

[24] Two-Stage Hybrid Extreme Learning Machine for Sequential Imbalanced Data [J].

Mao, Wentao ;

Wang, Jinwan ;

He, Ling ;

Tian, Yangyang .

PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 :423-433

[25] Machine Learning for K-Adaptability in Two-Stage Robust Optimization [J].

Julien, Esther ;

Postek, Krzysztof ;

Birbil, S. llker .

INFORMS JOURNAL ON COMPUTING, 2025, 37 (03) :644-665

[26] Predicting Stock Price Using Two-Stage Machine Learning Techniques [J].

Zhang, Jun ;

Li, Lan ;

Chen, Wei .

COMPUTATIONAL ECONOMICS, 2021, 57 (04) :1237-1261

[27] A Two-Stage Machine Learning Approach to Forecast the Lifetime of Movies in a Multiplex [J].

Ragav, Abhijith ;

Venkatesh, Sai Vishwanath ;

Murugappan, Ramanathan ;

Vijayaraghavan, Vineeth .

ADVANCES IN INFORMATION AND COMMUNICATION, VOL 2, 2020, 1130 :480-493

[28] Machine Learning-Enabled Evolutionary Two-Stage Stochastic Programming [J].

Pan, Jeng-Shyang ;

Song, Pei-Cheng ;

Chu, Shu-Chuan ;

Snasel, Vaclav ;

Watada, Junzo .

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025,

[29] Forecasting value of agricultural imports using a novel two-stage hybrid model [J].

Lee, Yi-Shian ;

Liu, Wan-Yu .

COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2014, 104 :71-83

[30] Two-stage model in perceptual learning: toward a unified theory [J].

Shibata, Kazuhisa ;

Sagi, Dov ;

Watanabe, Takeo .

YEAR IN COGNITIVE NEUROSCIENCE, 2014, 1316 :18-28

← 1 2 3 4 5 →