Model-free output feedback optimal tracking control for two-dimensional batch processes

被引:8
作者
Shi, Huiyuan [1 ,2 ,3 ]
Ma, Jiayue [1 ]
Liu, Qiang [1 ]
Li, Jinna [1 ]
Jiang, Xueying [1 ,2 ]
Li, Ping [1 ,3 ]
机构
[1] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun 113001, Peoples R China
[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[3] Univ Sci & Technol Liaoning, Sch Elect & Informat Engn, Anshan 114051, Peoples R China
基金
中国国家自然科学基金;
关键词
Batch process; Two-dimensional; Model-free; Output feedback; Reinforcement learning; ITERATIVE LEARNING CONTROL; H-INFINITY CONTROL; PREDICTIVE CONTROL; ADAPTIVE-CONTROL; SYSTEMS; DESIGN;
D O I
10.1016/j.engappai.2024.109989
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the challenge of linear quadratic tracking for batch processes without taking into account the specifics regarding the system dynamics and the state observer, a two-dimensional off-policy model-free output feedback control method is proposed. Previous solutions have mostly failed to fully explore the historical input and output information of the system along time and batch dimensions, thereby constraining the flexibility and efficiency of the controller. However, the method proposed aims to reconstruct the state space by delving into these historical input and output data of the system. The correlation between the two-dimensional value function and the twodimensional Q-function with output feedback characteristics is found, thereby revealing the corresponding twodimensional Bellman equation. This method, through the use of reinforcement learning algorithms, can effectively learn the optimal control strategy without the need for prior knowledge of system dynamics. To this end, the proposed method not only converges faster but also has smaller errors, making it more suitable for complex and ever-changing industrial production processes in reality. This innovative research provides new ideas and methods for the design and optimization of control systems, which is expected to bring higher efficiency and more stable performance to industrial production. At last, simulation outcomes for the injection velocity process validate the suggested method's effectiveness.
引用
收藏
页数:19
相关论文
共 37 条
[1]   A Combined Reinforcement Learning and Sliding Mode Control Scheme for Grid Integration of a PV System [J].
Bag, Aurobinda ;
Subudhi, Bidyadhar ;
Ray, Pravat Kumar .
CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2019, 5 (04) :498-506
[2]   Design and Analysis of Integrated Predictive Iterative Learning Control for Batch Process Based on Two-dimensional System Theory [J].
Chen, Chen ;
Xiong, Zhihua ;
Zhong, Yisheng .
CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2014, 22 (07) :762-768
[3]   Adaptive control of delayed integrating systems: A PVC batch reactor [J].
Huzmezan, M ;
Dumont, GA ;
Gough, WA ;
Kovac, S .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2003, 11 (03) :390-398
[4]   Improved model-free H∞ control for batch processes via off-policy 2D game Q-learning [J].
Jiang, Xueying ;
Huang, Min ;
Kuang, Hanbin ;
Shi, Huiyuan ;
Wang, Xingwei ;
Lee, Loo Hay .
INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (10) :2447-2463
[5]   Optimal Output Regulation of Linear Discrete-Time Systems With Unknown Dynamics Using Reinforcement Learning [J].
Jiang, Yi ;
Kiumarsi, Bahare ;
Fan, Jialu ;
Chai, Tianyou ;
Li, Jinna ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) :3147-3156
[6]   Kinodynamic Motion Planning With Continuous-Time Q-Learning: An Online, Model-Free, and Safe Navigation Framework [J].
Kontoudis, George P. ;
Vamvoudakis, Kyriakos G. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (12) :3803-3817
[7]  
Lakshmanan NM, 1999, INT J CONTROL, V72, P659, DOI 10.1080/002071799220849
[8]   Model predictive control for batch processes: Ensuring validity of predictions [J].
Lauri, D. ;
Lennox, B. ;
Camacho, J. .
JOURNAL OF PROCESS CONTROL, 2014, 24 (01) :239-249
[9]   Model predictive control technique combined with iterative learning for batch processes [J].
Lee, KS ;
Chin, IS ;
Lee, HJ ;
Lee, JH .
AICHE JOURNAL, 1999, 45 (10) :2175-2187
[10]   Model predictive control for nonlinear batch processes with asymptotically perfect tracking [J].
Lee, KS ;
Lee, JH .
COMPUTERS & CHEMICAL ENGINEERING, 1997, 21 :S873-S879