Real-Time Control of A2O Process in Wastewater Treatment Through Fast Deep Reinforcement Learning Based on Data-Driven Simulation Model

被引：0

作者：

Hu, Fukang ^{[1
]}

Zhang, Xiaodong ^{[2
]}

Lu, Baohong ^{[2
]}

Lin, Yue ^{[2
]}

机构：

[1] Univ New South Wales, Coll Civil Engn, Sydney, NSW 2052, Australia

[2] Hohai Univ, Coll Hydrol & Water Resources, Nanjing 210098, Peoples R China

来源：

WATER | 2024年 / 16卷 / 24期

关键词：

anaerobic-anoxic-oxic; real-time control; deep reinforcement learning; deep learning; ENERGY-CONSUMPTION; TREATMENT PLANTS;

D O I：

10.3390/w16243710

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Real-time control (RTC) can be applied to optimize the operation of the anaerobic-anoxic-oxic (A2O) process in wastewater treatment for energy saving. In recent years, many studies have utilized deep reinforcement learning (DRL) to construct a novel AI-based RTC system for optimizing the A2O process. However, existing DRL methods require the use of A2O process mechanistic models for training. Therefore they require specified data for the construction of mechanistic models, which is often difficult to achieve in many wastewater treatment plants (WWTPs) where data collection facilities are inadequate. Also, the DRL training is time-consuming because it needs multiple simulations of mechanistic model. To address these issues, this study designs a novel data-driven RTC method. The method first creates a simulation model for the A2O process using LSTM and an attention module (LSTM-ATT). This model can be established based on flexible data from the A2O process. The LSTM-ATT model is a simplified version of a large language model (LLM), which has much more powerful ability in analyzing time-sequence data than usual deep learning models, but with a small model architecture that avoids overfitting the A2O dynamic data. Based on this, a new DRL training framework is constructed, leveraging the rapid computational capabilities of LSTM-ATT to accelerate DRL training. The proposed method is applied to a WWTP in Western China. An LSTM-ATT simulation model is built and used to train a DRL RTC model for a reduction in aeration and qualified effluent. For the LSTM-ATT simulation, its mean squared error remains between 0.0039 and 0.0243, while its R-squared values are larger than 0.996. The control strategy provided by DQN effectively reduces the average DO setpoint values from 3.956 mg/L to 3.884 mg/L, with acceptable effluent. This study provides a pure data-driven RTC method for the A2O process in WWTPs based on DRL, which is effective in energy saving and consumption reduction. It also demonstrates that purely data-driven DRL can construct effective RTC methods for the A2O process, providing a decision-support method for management.

引用

页数：13

共 50 条

[21] Data-driven sustainable distributed energy resources? control based on multi-agent deep reinforcement learning
Jendoubi, Imen
Bouffard, Francois
SUSTAINABLE ENERGY GRIDS & NETWORKS, 2022, 32
[22] A Model-Free Deep Reinforcement Learning-Based Approach for Assessment of Real-Time PV Hosting Capacity
Suchithra, Jude
Robinson, Duane A.
Rajabi, Amin
ENERGIES, 2024, 17 (09)
[23] A Deep Reinforcement Learning Real-Time Recommendation Model Based on Long and Short-Term Preference
Yan-e Hou
Wenbo Gu
WeiChuan Dong
Lanxue Dang
International Journal of Computational Intelligence Systems, 16
[24] A Deep Reinforcement Learning Real-Time Recommendation Model Based on Long and Short-Term Preference
Hou, Yan-e
Gu, Wenbo
Dong, WeiChuan
Dang, Lanxue
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
[25] Real-time monitoring of forward osmosis membrane fouling in wastewater reuse process performed with a deep learning model
Im, Sung Ju
Nguyen Duc Viet
Jang, Am
CHEMOSPHERE, 2021, 275
[26] Data-driven prediction and control of wastewater treatment process through the combination of convolutional neural network and recurrent neural network
Guo, Zhiwei
Du, Boxin
Wang, Jianhui
Shen, Yu
Li, Qiao
Feng, Dong
Gao, Xu
Wang, Heng
RSC ADVANCES, 2020, 10 (23) : 13410 - 13419
[27] Evaluation of uncertain signals' impact on deep reinforcement learning-based real-time control strategy of urban drainage systems
Zhang, Mofan
Xu, Zhiwei
Wang, Yiming
Zeng, Siyu
Dong, Xin
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2022, 324
[28] Transfer learning for occupancy-based HVAC control: A data-driven approach using unsupervised learning of occupancy profiles and deep reinforcement learning
Esrafilian-Najafabadi, Mohammad
Haghighat, Fariborz
ENERGY AND BUILDINGS, 2023, 300
[29] Real-time control for fuel-optimal Moon landing based on an interactive deep reinforcement learning algorithm
Lin Cheng
Zhenbo Wang
Fanghua Jiang
Astrodynamics, 2019, 3 : 375 - 386
[30] Real-time control for fuel-optimal Moon landing based on an interactive deep reinforcement learning algorithm
Cheng, Lin
Wang, Zhenbo
Jiang, Fanghua
ASTRODYNAMICS, 2019, 3 (04) : 375 - 386

← 1 2 3 4 5 →