Intelligent Control of Wastewater Treatment Plants Based on Model-Free Deep Reinforcement Learning

被引：11

作者：

Aponte-Rengifo, Oscar ^{[1
]}

Francisco, Mario ^{[1
]}

Vilanova, Ramon ^{[2
]}

Vega, Pastora ^{[1
]}

Revollar, Silvana ^{[1
]}

机构：

[1] Univ Salamanca, Fac Sci, Dept Comp Sci & Automat, Plaza Merced S-N, Salamanca 37008, Spain

[2] Autonomous Univ Barcelona, Dept Automat Syst & Adv Control Res, Barcelona 08193, Spain

来源：

PROCESSES | 2023年 / 11卷 / 08期

关键词：

intelligent control; model-free deep reinforcement learning; reusing policy; waste water treatment plant; DISSOLVED-OXYGEN CONTROL; SIMULATION; OPERATION;

D O I：

10.3390/pr11082269

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

In this work, deep reinforcement learning methodology takes advantage of transfer learning methodology to achieve a reasonable trade-off between environmental impact and operating costs in the activated sludge process of Wastewater treatment plants (WWTPs). WWTPs include complex nonlinear biological processes, high uncertainty, and climatic disturbances, among others. The dynamics of complex real processes are difficult to accurately approximate by mathematical models due to the complexity of the process itself. Consequently, model-based control can fail in practical application due to the mismatch between the mathematical model and the real process. Control based on the model-free reinforcement deep learning (RL) methodology emerges as an advantageous method to arrive at suboptimal solutions without the need for mathematical models of the real process. However, convergence of the RL method to a reasonable control for complex processes is data-intensive and time-consuming. For this reason, the RL method can use the transfer learning approach to cope with this inefficient and slow data-driven learning. In fact, the transfer learning method takes advantage of what has been learned so far so that the learning process to solve a new objective does not require so much data and time. The results demonstrate that cumulatively achieving conflicting objectives can efficiently be used to approach the control of complex real processes without relying on mathematical models.

引用

页数：25

共 39 条

[1]

Rusu AA, 2016, Arxiv, DOI [arXiv:1606.04671, DOI 10.43550/ARXIV:1606.04671, DOI 10.48550/ARXIV.1606.04671]

[2]

Agarwal Alekh, 2020, P MACHINE LEARNING R, V125

[3]

Ahansazan B., 2014, INT J ENV SCI DEV, V5, P81, DOI DOI 10.7763/IJESD.2014.V5.455

[4]

Ammar Haitham Bou, 2012, Adaptive and Learning Agents. International Workshop, ALA 2011 Held at AAMAS 2011. Revised Selected Papers, P21, DOI 10.1007/978-3-642-28499-1_2

[5]

Bertsekas D. P., 2019, algorithm for optimal control with integral reinforcement learn

[6] Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learning [J].

Chen, Kehua ;

Wang, Hongcheng ;

Valverde-Perez, Borja ;

Zhai, Siyuan ;

Vezzaro, Luca ;

Wang, Aijie .

CHEMOSPHERE, 2021, 279 (279)

[7] Transforming data into knowledge for improved wastewater treatment operation: A critical review of techniques [J].

Corominas, Ll. ;

Garrido-Baserba, M. ;

Villez, K. ;

Olsson, G. ;

Cortes, U. ;

Poch, M. .

ENVIRONMENTAL MODELLING & SOFTWARE, 2018, 106 :89-103

[8]

Czarnecki WM, 2019, PR MACH LEARN RES, V89

[9]

Devlin S., 2012, P INT C AUT AG MULT, P433

[10] Online reinforcement learning for a continuous space system with experimental validation [J].

Dogru, Oguzhan ;

Wieczorek, Nathan ;

Velswamy, Kirubakaran ;

Ibrahim, Fadi ;

Huang, Biao .

JOURNAL OF PROCESS CONTROL, 2021, 104 (104) :86-100

← 1 2 3 4 →