Nonintrusive parameter adaptation of chemical process models with reinforcement learning

被引:6
作者
Alhazmi, Khalid [1 ]
Sarathy, S. Mani [1 ]
机构
[1] King Abdullah Univ Sci & Technol KAUST, Clean Combust Res Ctr CCRC, Phys Sci & Engn Div PSE, Thuwal 239556900, Saudi Arabia
关键词
Nonlinear system identification; Parameter estimation; Reinforcement learning; CLOSED-LOOP IDENTIFICATION; PACKED-BED REACTOR; OPTIMIZATION; BEHAVIOR; HYDROGENATION; ETHENE; ETHYNE; VIEW;
D O I
10.1016/j.jprocont.2023.02.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Model-based control is one of the most prevalent techniques for designing and controlling engineering systems. However, many of these systems are complex and characterized by changing dynamics. Hence, online system identification is required to achieve optimum adaptive control performance for such complex systems. This work proposes an algorithm for nonintrusive, online, nonlinear parameter estimation of physical models using deep reinforcement learning (RL). The problem of training a neural network for parameter estimation is formulated as a reinforcement learning problem. The RL-based parameter estimation policy is tested on a simulation of the selective hydrogenation of acetylene, which is a highly nonlinear system. The learned model estimation policy is able to correctly predict the states of the system with a prediction error of less than 1% in various conditions, such as in the presence of measurement noise and structural differences in models.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:87 / 95
页数:9
相关论文
共 31 条
[1]   Model-Based Optimization of an Acetylene Hydrogenation Reactor To Improve Overall Ethylene Plant Economics [J].
Aeowjaroenlap, Hattachai ;
Chotiwiriyakun, Kritsada ;
Tiensai, Nattawat ;
Tanthapanichakoon, Wiwut ;
Spatenka, Stepan ;
Cano, Alejandro .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2018, 57 (30) :9943-9951
[2]  
Astrom Karl J, 2013, Adaptive Control
[3]   Closed-loop identification using direct approach and high order ARX/GOBF-ARX models [J].
Badwe, Abhijit S. ;
Patwardhan, Sachin C. ;
Gudi, Ravindra D. .
JOURNAL OF PROCESS CONTROL, 2011, 21 (07) :1056-1071
[4]  
Bertsekas D., 2000, DYNAMIC PROGRAMMING, V1
[5]  
Bertsekas D. P., 2019, REINFORCEMENT LEARNI
[6]   Particle Swarm Optimization for Single Objective Continuous Space Problems: A Review [J].
Bonyadi, Mohammad Reza ;
Michalewicz, Zbigniew .
EVOLUTIONARY COMPUTATION, 2017, 25 (01) :1-54
[7]   Selective hydrogenation of ethyne in ethene-rich streams on palladium catalysts. Part 1. Effect of changes to the catalyst during reaction [J].
Borodzinki, Andrzej .
CATALYSIS REVIEWS-SCIENCE AND ENGINEERING, 2006, 48 (02) :91-144
[8]   A KINETIC-STUDY OF THE HYDROGENATION OF ETHYNE AND ETHENE ON A COMMERCIAL PD/AL2O3 CATALYST [J].
BOS, ANR ;
BOOTSMA, ES ;
FOETH, F ;
SLEYSTER, HWJ ;
WESTERTERP, KR .
CHEMICAL ENGINEERING AND PROCESSING-PROCESS INTENSIFICATION, 1993, 32 (01) :53-63
[9]   BEHAVIOR OF AN ADIABATIC PACKED-BED REACTOR .2. MODELING [J].
BOS, ANR ;
VANDEBELD, L ;
MARTENS, HJ ;
WESTERTERP, KR .
CHEMICAL ENGINEERING COMMUNICATIONS, 1993, 121 :55-80
[10]   BEHAVIOR OF AN ADIABATIC PACKED-BED REACTOR .1. EXPERIMENTAL-STUDY [J].
BOS, ANR ;
VANDEBELD, L ;
OVERKAMP, JB ;
WESTERTERP, KR .
CHEMICAL ENGINEERING COMMUNICATIONS, 1993, 121 :27-53