Real-Time Control of A2O Process in Wastewater Treatment Through Fast Deep Reinforcement Learning Based on Data-Driven Simulation Model

被引:0
|
作者
Hu, Fukang [1 ]
Zhang, Xiaodong [2 ]
Lu, Baohong [2 ]
Lin, Yue [2 ]
机构
[1] Univ New South Wales, Coll Civil Engn, Sydney, NSW 2052, Australia
[2] Hohai Univ, Coll Hydrol & Water Resources, Nanjing 210098, Peoples R China
关键词
anaerobic-anoxic-oxic; real-time control; deep reinforcement learning; deep learning; ENERGY-CONSUMPTION; TREATMENT PLANTS;
D O I
10.3390/w16243710
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Real-time control (RTC) can be applied to optimize the operation of the anaerobic-anoxic-oxic (A2O) process in wastewater treatment for energy saving. In recent years, many studies have utilized deep reinforcement learning (DRL) to construct a novel AI-based RTC system for optimizing the A2O process. However, existing DRL methods require the use of A2O process mechanistic models for training. Therefore they require specified data for the construction of mechanistic models, which is often difficult to achieve in many wastewater treatment plants (WWTPs) where data collection facilities are inadequate. Also, the DRL training is time-consuming because it needs multiple simulations of mechanistic model. To address these issues, this study designs a novel data-driven RTC method. The method first creates a simulation model for the A2O process using LSTM and an attention module (LSTM-ATT). This model can be established based on flexible data from the A2O process. The LSTM-ATT model is a simplified version of a large language model (LLM), which has much more powerful ability in analyzing time-sequence data than usual deep learning models, but with a small model architecture that avoids overfitting the A2O dynamic data. Based on this, a new DRL training framework is constructed, leveraging the rapid computational capabilities of LSTM-ATT to accelerate DRL training. The proposed method is applied to a WWTP in Western China. An LSTM-ATT simulation model is built and used to train a DRL RTC model for a reduction in aeration and qualified effluent. For the LSTM-ATT simulation, its mean squared error remains between 0.0039 and 0.0243, while its R-squared values are larger than 0.996. The control strategy provided by DQN effectively reduces the average DO setpoint values from 3.956 mg/L to 3.884 mg/L, with acceptable effluent. This study provides a pure data-driven RTC method for the A2O process in WWTPs based on DRL, which is effective in energy saving and consumption reduction. It also demonstrates that purely data-driven DRL can construct effective RTC methods for the A2O process, providing a decision-support method for management.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Data-driven sustainable distributed energy resources? control based on multi-agent deep reinforcement learning
    Jendoubi, Imen
    Bouffard, Francois
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2022, 32
  • [22] A Model-Free Deep Reinforcement Learning-Based Approach for Assessment of Real-Time PV Hosting Capacity
    Suchithra, Jude
    Robinson, Duane A.
    Rajabi, Amin
    ENERGIES, 2024, 17 (09)
  • [23] A Deep Reinforcement Learning Real-Time Recommendation Model Based on Long and Short-Term Preference
    Yan-e Hou
    Wenbo Gu
    WeiChuan Dong
    Lanxue Dang
    International Journal of Computational Intelligence Systems, 16
  • [24] A Deep Reinforcement Learning Real-Time Recommendation Model Based on Long and Short-Term Preference
    Hou, Yan-e
    Gu, Wenbo
    Dong, WeiChuan
    Dang, Lanxue
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [25] Real-time monitoring of forward osmosis membrane fouling in wastewater reuse process performed with a deep learning model
    Im, Sung Ju
    Nguyen Duc Viet
    Jang, Am
    CHEMOSPHERE, 2021, 275
  • [26] Data-driven prediction and control of wastewater treatment process through the combination of convolutional neural network and recurrent neural network
    Guo, Zhiwei
    Du, Boxin
    Wang, Jianhui
    Shen, Yu
    Li, Qiao
    Feng, Dong
    Gao, Xu
    Wang, Heng
    RSC ADVANCES, 2020, 10 (23) : 13410 - 13419
  • [27] Evaluation of uncertain signals' impact on deep reinforcement learning-based real-time control strategy of urban drainage systems
    Zhang, Mofan
    Xu, Zhiwei
    Wang, Yiming
    Zeng, Siyu
    Dong, Xin
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2022, 324
  • [28] Transfer learning for occupancy-based HVAC control: A data-driven approach using unsupervised learning of occupancy profiles and deep reinforcement learning
    Esrafilian-Najafabadi, Mohammad
    Haghighat, Fariborz
    ENERGY AND BUILDINGS, 2023, 300
  • [29] Real-time control for fuel-optimal Moon landing based on an interactive deep reinforcement learning algorithm
    Lin Cheng
    Zhenbo Wang
    Fanghua Jiang
    Astrodynamics, 2019, 3 : 375 - 386
  • [30] Real-time control for fuel-optimal Moon landing based on an interactive deep reinforcement learning algorithm
    Cheng, Lin
    Wang, Zhenbo
    Jiang, Fanghua
    ASTRODYNAMICS, 2019, 3 (04) : 375 - 386