An improved reinforcement learning control strategy for batch processes

被引：0

作者：

Zhang, Peng ^{[1
]}

Zhang, Jie ^{[1
]}

Long, Yang ^{[2
]}

Hu, Bingzhang ^{[3
]}

机构：

[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England

[2] Univ Durham, Sch Comp, Durham, England

[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne NE7 7DN, Tyne & Wear, England

来源：

2019 24TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR) | 2019年

关键词：

Batch process; optimal control; reinforcement learning;

D O I：

10.1109/mmar.2019.8864632

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Batch processes are significant and essential manufacturing route for the agile manufacturing of high value added products and they are typically difficult to control because of unknown disturbances, model plant mismatches, and highly non-linear characteristic. Traditional one-step reinforcement learning and neural network have been applied to optimize and control batch processes. However, traditional one-step reinforcement learning and the neural network lack accuracy and robustness leading to unsatisfactory performance. To overcome these issues and difficulties, a modified multi-step action Q-learning algorithm (MMSA) based on multiple step action Q-learning (MSA) is proposed in this paper. For MSA, the action space is divided into some periods of same time steps and the same action is explored with fixed greedy policy being applied continuously during a period. Compared with MSA, the modification of MMSA is that the exploration and selection of action will follow an improved and various greedy policy in the whole system time which can improve the flexibility and speed of the learning algorithm. The proposed algorithm is applied to a highly nonlinear batch process and it is shown giving better control performance than the traditional one-step reinforcement learning and MSA.

引用

页码：360 / 365

页数：6

共 50 条

[41] An integrated iterative learning control strategy with model identification and dynamic R-parameter for batch processes
Jia, Li
Yang, Tian
Chiu, Minsen
JOURNAL OF PROCESS CONTROL, 2013, 23 (09) : 1332 - 1341
[42] Tracking control for batch processes through integrating batch-to-batch iterative learning control and within-batch on-line control
Xiong, ZH
Zhang, J
Wang, X
Xu, YM
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2005, 44 (11) : 3983 - 3992
[43] A active vibration control strategy based on reinforcement learning
Zhou J.
Dong L.
Meng C.
Sun H.
Dong, Longlei, 1600, Chinese Vibration Engineering Society (40): : 281 - 286
[44] A Reinforcement Learning Approach to Health Aware Control Strategy
Jha, Mayank S.
Weber, Philippe
Theilliol, Didier
Ponsart, Jean-Christophe
Maquin, Didier
2019 27TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2019, : 171 - 176
[45] Control of Gene Regulatory Networks Basin of Attractions with Batch Reinforcement Learning
Hayama Nishida, Cyntia Eico
Reali Costa, Anna Helena
da Costa Bianchi, Reinaldo Augusto
2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 127 - 132
[46] Parameterized Batch Reinforcement Learning for Longitudinal Control of Autonomous Land Vehicles
Huang, Zhenhua
Xu, Xin
He, Haibo
Tan, Jun
Sun, Zhenping
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (04): : 730 - 741
[47] Control-Informed Reinforcement Learning for Chemical Processes
Bloor, Maximilian
Ahmed, Akhil
Kotecha, Niki
Mercangoz, Mehmet
Tsay, Calvin
del Rio-Chanona, Ehecatl Antonio
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2025, 64 (09) : 4966 - 4978
[48] yBypass chromatography - design and analysis of an improved strategy for operating batch chromatography processes
Siitonen, Jani
Sainio, Tuorno
Rajendran, Arvind
JOURNAL OF CHROMATOGRAPHY A, 2012, 1230 : 77 - 92
[49] An Improved Statistical Modeling Strategy by Spectroscopy for Online Monitoring and Diagnosis of Batch Processes
Zhao, Chunhui
Gao, Furong
Liu, Tao
Wang, Fuli
ASCC: 2009 7TH ASIAN CONTROL CONFERENCE, VOLS 1-3, 2009, : 893 - 898
[50] A frequency cooperative control strategy for multimicrogrids with EVs based on improved evolutionary-deep reinforcement learning
Fan, Peixiao
Ke, Song
Yang, Jun
Wen, Yuxin
Xie, Lilong
Li, Yonghui
Kamel, Salah
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 159

← 1 2 3 4 5 →