Novel data-driven two-dimensional Q-learning for optimal tracking control of batch process with unknown dynamics

被引:18
|
作者
Wen, Xin [1 ]
Shi, Huiyuan [1 ,2 ,3 ]
Su, Chengli [1 ,4 ,7 ]
Jiang, Xueying [5 ]
Li, Ping [1 ,4 ]
Yu, Jingxian [6 ]
机构
[1] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun, Peoples R China
[2] Northwestern Polytech Univ, Sch Automat, Xian, Peoples R China
[3] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang, Peoples R China
[4] Univ Sci & Technol Liaoning, Sch Elect & Informat Engn, Anshan, Peoples R China
[5] Northeastern Univ, Sch Informat Sci & Engn, Shenyang, Peoples R China
[6] Liaoning Petrochem Univ, Sch Sci, Fushun, Peoples R China
[7] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun 113001, Peoples R China
基金
中国国家自然科学基金;
关键词
Batchprocess; Data-driven; 2Doff-policyQ-learning; Optimaltrackingcontrol; Injectionmolding; MODEL PREDICTIVE CONTROL; FAULT-TOLERANT CONTROL; STATE DELAY; DESIGN; FEEDBACK;
D O I
10.1016/j.isatra.2021.06.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In view that the previous control methods usually rely too much on the models of batch process and have difficulty in a practical batch process with unknown dynamics, a novel data-driven twodimensional (2D) off-policy Q-learning approach for optimal tracking control (OTC) is proposed to make the batch process obtain a model-free control law. Firstly, an extended state space equation composing of the state and output error is established for ensuring tracking performance of the designed controller. Secondly, the behavior policy of generating data and the target policy of optimization as well as learning is introduced based on this extended system. Then, the Bellman equation independent of model parameters is given via analyzing the relation between 2D value function and 2D Q-function. The measured data along the batch and time directions of batch process are just taken to carry out the policy iteration, which can figure out the optimal control problem despite lacking systematic dynamic information. The unbiasedness and convergence of the designed 2D off-policy Q-learning algorithm are proved. Finally, a simulation case for injection molding process manifests that control effect and tracking effect gradually become better with the increasing number of batches.(c) 2021 ISA. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:10 / 21
页数:12
相关论文
共 50 条
  • [21] Data-Driven Adaptive Tracking Control of Unknown Autonomous Marine Vehicles
    Weng, Yongpeng
    Wang, Ning
    Qin, Hongde
    Karimi, Hamid Reza
    Qi, Wenhai
    IEEE ACCESS, 2018, 6 : 55723 - 55730
  • [22] Two-Dimensional Model-Free Optimal Tracking Control for Batch Processes With Packet Loss
    Shi, Huiyuan
    Wen, Xin
    Jiang, Xueying
    Su, Chengli
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (02): : 1032 - 1045
  • [23] Constrained data-driven optimal iterative learning control
    Chi, Ronghu
    Liu, Xiaohe
    Zhang, Ruikun
    Hou, Zhongsheng
    Huang, Biao
    JOURNAL OF PROCESS CONTROL, 2017, 55 : 10 - 29
  • [24] Two-dimensional generalized predictive control (2D-GPC) scheme for the batch processes with two-dimensional (2D) dynamics
    Shi, Jia
    Yang, Bo
    Cao, Zhikai
    Zhou, Hua
    Yang, Yi
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2015, 26 (04) : 941 - 966
  • [25] The Convergence of Data-Driven Optimal Iterative Learning Control for Linear Multi-Phase Batch Processes
    Geng, Yan
    Wang, Shouqin
    Ruan, Xiaoe
    MATHEMATICS, 2022, 10 (13)
  • [26] Data-driven optimal PID type ILC for a class of nonlinear batch process
    Memon, Furqan
    Shao, Cheng
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2021, 52 (02) : 263 - 276
  • [27] Data-Driven Approximated Optimal Control of Sulfur Flotation Process
    He, Mingfang
    COMPLEXITY, 2019, 2019
  • [28] Design and Analysis of Integrated Predictive Iterative Learning Control for Batch Process Based on Two-dimensional System Theory
    Chen, Chen
    Xiong, Zhihua
    Zhong, Yisheng
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2014, 22 (07) : 762 - 768
  • [29] Optimal Tracking Control of Nonlinear Multiagent Systems Using Internal Reinforce Q-Learning
    Peng, Zhinan
    Luo, Rui
    Hu, Jiangping
    Shi, Kaibo
    Nguang, Sing Kiong
    Ghosh, Bijoy Kumar
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 4043 - 4055
  • [30] Optimal Tracking Control of Servo Motor Speed Based on Online Supplementary Q-Learning
    Zou X.
    Xiao X.
    He Q.
    Vyacheslav S.
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2019, 34 (05): : 917 - 923