Batch process control based on reinforcement learning with segmented prioritized experience replay

被引:2
|
作者
Xu, Chen [1 ]
Ma, Junwei [1 ]
Tao, Hongfeng [1 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
基金
中国国家自然科学基金;
关键词
reinforcement learning; batch process; soft actor-critic; priority experience replay; maximum entropy framework;
D O I
10.1088/1361-6501/ad21cf
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Batch process is difficult to control accurately due to their complex nonlinear dynamics and unstable operating conditions. The traditional methods such as model predictive control, will seriously affect control performance when process model is inaccurate. In contrast, reinforcement learning (RL) provides an viable alternative by interacting directly with the environment to learn optimal strategy. This paper proposes a batch process controller based on the segmented prioritized experience replay (SPER) soft actor-critic (SAC). SAC combines off-policy updates and maximum entropy RL with an actor-critic formulation, which can obtain a more robust control strategy than other RL methods. To improve the efficiency of the experience replay mechanism in tasks with long episodes and multiple phases, a new method of sampling experience called SPER is designed in SAC. In addition, a novel reward function is set for the SPER-SAC based controller to deal with the sparse reward. Finally, the effectiveness of the SPER-SAC based controller for batch process examples is demonstrated by comparing with the conventional RL-based control methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Robust experience replay sampling for multi-agent reinforcement learning
    Nicholaus, Isack Thomas
    Kang, Dae-Ki
    PATTERN RECOGNITION LETTERS, 2022, 155 : 135 - 142
  • [32] Intrusion Detection Based on Adaptive Sample Distribution Dual-Experience Replay Reinforcement Learning
    Tan, Haonan
    Wang, Le
    Zhu, Dong
    Deng, Jianyu
    MATHEMATICS, 2024, 12 (07)
  • [33] Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning
    Lin, Yijiong
    Huang, Jiancong
    Zimmer, Matthieu
    Guan, Yisheng
    Rojas, Juan
    Weng, Paul
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6615 - 6622
  • [34] Enhanced Off-Policy Reinforcement Learning With Focused Experience Replay
    Kong, Seung-Hyun
    Nahrendra, I. Made Aswin
    Paek, Dong-Hee
    IEEE ACCESS, 2021, 9 (09): : 93152 - 93164
  • [35] AI-based optimal control of fed-batch biopharmaceutical process leveraging deep reinforcement learning
    Li, Haoran
    Qiu, Tong
    You, Fengqi
    CHEMICAL ENGINEERING SCIENCE, 2024, 292
  • [36] Multi-objective reinforcement learning for fed-batch fermentation process control
    Li, Dazi
    Zhu, Fuqiang
    Wang, Xiao
    Jin, Qibing
    JOURNAL OF PROCESS CONTROL, 2022, 115 : 89 - 99
  • [37] Control of A Polyol Process Using Reinforcement Learning
    Zhu, Wenbo
    Rendall, Ricardo
    Castillo, Ivan
    Wang, Zhenyu
    Chiang, Leo H.
    Hayot, Philippe
    Romagnoli, Jose A.
    IFAC PAPERSONLINE, 2021, 54 (03): : 498 - 503
  • [38] Batch process modeling for optimization using reinforcement learning
    Martinez, EC
    COMPUTERS & CHEMICAL ENGINEERING, 2000, 24 (2-7) : 1187 - 1193
  • [39] Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcement learning
    Hafez, Muhammad Burhan
    Immisch, Tilman
    Weber, Tom
    Wermter, Stefan
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [40] Active and Reactive Power Coordinated Control of Active Distribution Networks Based on Prioritized Reinforcement Learning
    Wang, Xinming
    Liu, Haotian
    Cao, Xin
    Wu, Wenchuan
    Li, Shihui
    Jia, Xiaobu
    2021 POWER SYSTEM AND GREEN ENERGY CONFERENCE (PSGEC), 2021, : 91 - 96