Iterative Learning Control (ILC) Guided Reinforcement Learning Control (RLC) Scheme for Batch Processes

被引:0
|
作者
Xu, Xinghai [1 ]
Xie, Huimin [1 ]
Shi, Jia [1 ]
机构
[1] Xiamen Univ, Dept Chem & Biochem Engn, Sch Chem & Chem Engn, Xiamen 361005, Fujian, Peoples R China
来源
PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20) | 2020年
关键词
Iterative learning control (ILC); Deep reinforcement learning; Batch/repetitive processes;
D O I
10.1109/ddcls49620.2020.9275065
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Iterative learning control (ILC) is a kind of effective learning control scheme which is mainly designed to solve the problems in controlling a batch or repetitive process. Although the control performances of ILC systems can improved from batch to batch, it still strongly depends on the repeatability of the process and control target. Reinforcement learning (RL) is another learning based optimization algorithm which can be applied to many complicated decision-making scenarios. Data-driven based RL algorithms have good robustness due to the generalization of the policy neural network, however, it is low-data efficiency in network training. In this paper, for batch process control we propose a new reinforcement learning control (RLC) scheme which is guided by classical iterative leaning control. On the one hand, this RLC scheme has capability to optimize the policy network faster than RL algorithm without guidance, on the other hand, the generalization of deep policy network improves the robustness of the control system. Based on the numerical simulations, the effectiveness of the proposed control scheme is demonstrated by comparing with the conventional reinforcement learning algorithm and the P-type iterative learning control scheme. This paper provides a new way for the application of reinforcement learning algorithm to batch process control.
引用
收藏
页码:241 / 246
页数:6
相关论文
共 50 条
  • [41] A Cascade Iterative Learning Control Scheme for a Class of Repetitive Cascaded Processes
    Shi, Jia
    Zhou, Hua
    Cao, Zhikai
    Jiang, Qingyin
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 1449 - 1454
  • [42] A robust integrated model predictive iterative learning control strategy for batch processes
    Liuming ZHOU
    Li JIA
    Yu-Long WANG
    Science China(Information Sciences), 2019, 62 (11) : 212 - 214
  • [43] Iterative learning model predictive control for multi-phase batch processes
    Wang, Youqing
    Zhou, Donghua
    Gao, Furong
    JOURNAL OF PROCESS CONTROL, 2008, 18 (06) : 543 - 557
  • [44] Guaranteed Cost Iterative Learning Control for Multi-Phase Batch Processes
    王立敏
    王润泽
    熊玉婷
    王浩森
    朱琳
    张可
    高福荣
    Journal of Shanghai Jiaotong University(Science), 2018, 23 (06) : 811 - 819
  • [45] A robust integrated model predictive iterative learning control strategy for batch processes
    Liuming Zhou
    Li Jia
    Yu-Long Wang
    Science China Information Sciences, 2019, 62
  • [46] Policy Poisoning in Batch Reinforcement Learning and Control
    Ma, Yuzhe
    Zhang, Xuezhou
    Sun, Wen
    Zhu, Xiaojin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] Probability Based Stochastic Iterative Learning Control for Batch Processes With Actuator Faults
    Wang, Limin
    Li, Bingyun
    IEEE ACCESS, 2019, 7 : 141466 - 141475
  • [48] A robust integrated model predictive iterative learning control strategy for batch processes
    Zhou, Liuming
    Jia, Li
    Wang, Yu-Long
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (11)
  • [49] An Anticipatory Terminal Iterative Learning Control Approach with Applications to Constrained Batch Processes
    Chi Ronghu
    Zhang Dexia
    Liu Ximei
    Hou Zhongsheng
    Jin Shangtai
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2013, 21 (03) : 271 - 275
  • [50] Iterative Learning Control of Batch Processes Using Adaptive Differential Evolution Algorithm
    Zhang, Guohua
    Yuan, Kai
    Jia, Li
    SYSTEMS SIMULATION AND SCIENTIFIC COMPUTING, PT I, 2012, 326 : 297 - 303