Batch process control based on reinforcement learning with segmented prioritized experience replay

被引:2
|
作者
Xu, Chen [1 ]
Ma, Junwei [1 ]
Tao, Hongfeng [1 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
基金
中国国家自然科学基金;
关键词
reinforcement learning; batch process; soft actor-critic; priority experience replay; maximum entropy framework;
D O I
10.1088/1361-6501/ad21cf
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Batch process is difficult to control accurately due to their complex nonlinear dynamics and unstable operating conditions. The traditional methods such as model predictive control, will seriously affect control performance when process model is inaccurate. In contrast, reinforcement learning (RL) provides an viable alternative by interacting directly with the environment to learn optimal strategy. This paper proposes a batch process controller based on the segmented prioritized experience replay (SPER) soft actor-critic (SAC). SAC combines off-policy updates and maximum entropy RL with an actor-critic formulation, which can obtain a more robust control strategy than other RL methods. To improve the efficiency of the experience replay mechanism in tasks with long episodes and multiple phases, a new method of sampling experience called SPER is designed in SAC. In addition, a novel reward function is set for the SPER-SAC based controller to deal with the sparse reward. Finally, the effectiveness of the SPER-SAC based controller for batch process examples is demonstrated by comparing with the conventional RL-based control methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Exploring a Reinforcement Learning Agent with Improved Prioritized Experience Replay for a Confrontation Game
    Zhao, Tian
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 373 - 381
  • [2] Prioritized experience replay based on dynamics priority
    Li, Hu
    Qian, Xuezhong
    Song, Wei
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [3] Traffic navigation via reinforcement learning with episodic-guided prioritized experience replay
    Hassani, Hossein
    Nikan, Soodeh
    Shami, Abdallah
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [4] A transferable deep reinforcement learning high-speed railway rescheduling method based on prioritized experience replay
    Dai X.-W.
    Wu Y.
    Shi Q.
    Cui D.-L.
    Yu S.-P.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (08): : 2375 - 2388
  • [5] Reinforcement learning for batch process control: Review and perspectives
    Yoo, Haeun
    Byun, Ha Eun
    Han, Dongho
    Lee, Jay H.
    ANNUAL REVIEWS IN CONTROL, 2021, 52 : 108 - 119
  • [6] Autonomous reinforcement learning with experience replay
    Wawrzynski, Pawel
    Tanwani, Ajay Kumar
    NEURAL NETWORKS, 2013, 41 : 156 - 167
  • [7] An Experience Replay Method Based on Tree Structure for Reinforcement Learning
    Jiang, Wei-Cheng
    Hwang, Kao-Shing
    Lin, Jin-Ling
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (02) : 972 - 982
  • [8] Q-value-based experience replay in reinforcement learning
    Zhang, Zihong
    Li, Ruijia
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [9] Reinforcement learning based control of batch polymerisation processes
    Singh, Vikas
    Kodamana, Hariprasad
    IFAC PAPERSONLINE, 2020, 53 (01): : 667 - 672
  • [10] Trial and Error Experience Replay Based Deep Reinforcement Learning
    Zhang, Cheng
    Ma, Liang
    4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 221 - 226