Batch process control based on reinforcement learning with segmented prioritized experience replay

被引：2

作者：

Xu, Chen ^{[1
]}

Ma, Junwei ^{[1
]}

Tao, Hongfeng ^{[1
]}

机构：

[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China

来源：

MEASUREMENT SCIENCE AND TECHNOLOGY | 2024年 / 35卷 / 05期

基金：

中国国家自然科学基金;

关键词：

reinforcement learning; batch process; soft actor-critic; priority experience replay; maximum entropy framework;

D O I：

10.1088/1361-6501/ad21cf

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Batch process is difficult to control accurately due to their complex nonlinear dynamics and unstable operating conditions. The traditional methods such as model predictive control, will seriously affect control performance when process model is inaccurate. In contrast, reinforcement learning (RL) provides an viable alternative by interacting directly with the environment to learn optimal strategy. This paper proposes a batch process controller based on the segmented prioritized experience replay (SPER) soft actor-critic (SAC). SAC combines off-policy updates and maximum entropy RL with an actor-critic formulation, which can obtain a more robust control strategy than other RL methods. To improve the efficiency of the experience replay mechanism in tasks with long episodes and multiple phases, a new method of sampling experience called SPER is designed in SAC. In addition, a novel reward function is set for the SPER-SAC based controller to deal with the sparse reward. Finally, the effectiveness of the SPER-SAC based controller for batch process examples is demonstrated by comparing with the conventional RL-based control methods.

引用

页数：12

共 50 条

[31] Robust experience replay sampling for multi-agent reinforcement learning
Nicholaus, Isack Thomas
Kang, Dae-Ki
PATTERN RECOGNITION LETTERS, 2022, 155 : 135 - 142
[32] Intrusion Detection Based on Adaptive Sample Distribution Dual-Experience Replay Reinforcement Learning
Tan, Haonan
Wang, Le
Zhu, Dong
Deng, Jianyu
MATHEMATICS, 2024, 12 (07)
[33] Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning
Lin, Yijiong
Huang, Jiancong
Zimmer, Matthieu
Guan, Yisheng
Rojas, Juan
Weng, Paul
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6615 - 6622
[34] Enhanced Off-Policy Reinforcement Learning With Focused Experience Replay
Kong, Seung-Hyun
Nahrendra, I. Made Aswin
Paek, Dong-Hee
IEEE ACCESS, 2021, 9 (09): : 93152 - 93164
[35] AI-based optimal control of fed-batch biopharmaceutical process leveraging deep reinforcement learning
Li, Haoran
Qiu, Tong
You, Fengqi
CHEMICAL ENGINEERING SCIENCE, 2024, 292
[36] Multi-objective reinforcement learning for fed-batch fermentation process control
Li, Dazi
Zhu, Fuqiang
Wang, Xiao
Jin, Qibing
JOURNAL OF PROCESS CONTROL, 2022, 115 : 89 - 99
[37] Control of A Polyol Process Using Reinforcement Learning
Zhu, Wenbo
Rendall, Ricardo
Castillo, Ivan
Wang, Zhenyu
Chiang, Leo H.
Hayot, Philippe
Romagnoli, Jose A.
IFAC PAPERSONLINE, 2021, 54 (03): : 498 - 503
[38] Batch process modeling for optimization using reinforcement learning
Martinez, EC
COMPUTERS & CHEMICAL ENGINEERING, 2000, 24 (2-7) : 1187 - 1193
[39] Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcement learning
Hafez, Muhammad Burhan
Immisch, Tilman
Weber, Tom
Wermter, Stefan
FRONTIERS IN NEUROROBOTICS, 2023, 17
[40] Active and Reactive Power Coordinated Control of Active Distribution Networks Based on Prioritized Reinforcement Learning
Wang, Xinming
Liu, Haotian
Cao, Xin
Wu, Wenchuan
Li, Shihui
Jia, Xiaobu
2021 POWER SYSTEM AND GREEN ENERGY CONFERENCE (PSGEC), 2021, : 91 - 96

← 1 2 3 4 5 →