A Deep Reinforcement Learning Approach to Improve the Learning Performance in Process Control

被引:45
作者
Bao, Yaoyao [1 ]
Zhu, Yuanming [1 ]
Qian, Feng [1 ]
机构
[1] East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
PREDICTIVE CONTROL;
D O I
10.1021/acs.iecr.0c05678
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Advanced model-based control methods have been widely used in industrial process control, but excellent performance requires regular maintenance of its model. Reinforcement learning can online update its policy through the observed data by interacting with the environment. Since a fast and stable learning process is required to improve the adaptability of the controller, we propose an improved deep deterministic actor critic predictor in this paper, where the immediate reward is separated from the action-value function to provide the actor with reliable gradient information at early stages. Then, an expectation form of policy gradient is developed based on the assumption that the state obeys the normal distribution. Simulation results show that the proposed algorithm achieves a more stable and faster learning procedure than those state-of-art deep reinforcement learning (DRL) algorithms. Meanwhile, the obtained policy achieves a more advantageous performance than the fine-tuned proportional integral and derivative (PID) and linear model predictive controllers, especially for those processes with nonlinearity. These indicate that the improved DRL controller has the potential to become an important tool in practical applications.
引用
收藏
页码:5504 / 5515
页数:12
相关论文
共 38 条
[1]  
Alp E. C., 2019, PLAYING FLAPPY BIRD
[2]  
Andersen R. E., 2019, 29 INT C FLEX AUT IN, V2019
[3]  
[Anonymous], ABS151203385 CORR
[4]  
[Anonymous], 2016, INT C MACH LEARN ICM
[5]  
[Anonymous], 2015, RXIV150902971
[6]   Reinforcement learning for control: Performance, stability, and deep approximators [J].
Busoniu, Lucian ;
de Bruin, Tim ;
Tolic, Domagoj ;
Kober, Jens ;
Palunko, Ivana .
ANNUAL REVIEWS IN CONTROL, 2018, 46 :8-28
[7]   Hierarchical decompositions for MPC of resource constrained control systems: applications to building energy management [J].
Camponogara, Eduardo ;
Scherer, Helton ;
Biegler, Lorenz ;
Grossmann, Ignacio .
OPTIMIZATION AND ENGINEERING, 2021, 22 (01) :187-215
[8]   Status of Flue Gas Desulphurisation (FGD) systems from coal-fired power plants: Overview of the physic-chemical control processes of wet limestone FGDs [J].
Cordoba, Patricia .
FUEL, 2015, 144 :274-286
[9]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[10]   Anodization control for barrier-oxide thinning and 3D interconnected pores and direct electrodeposition of nanowire networks on native aluminium substrates [J].
Gillette, Eleanor ;
Wittenberg, Stefanie ;
Graham, Lauren ;
Lee, Kwijong ;
Rubloff, Gary ;
Banerjee, Parag ;
Lee, Sang Bok .
PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2015, 17 (05) :3873-3879