An improved reinforcement learning control strategy for batch processes

被引:0
|
作者
Zhang, Peng [1 ]
Zhang, Jie [1 ]
Long, Yang [2 ]
Hu, Bingzhang [3 ]
机构
[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
[2] Univ Durham, Sch Comp, Durham, England
[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne NE7 7DN, Tyne & Wear, England
来源
2019 24TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR) | 2019年
关键词
Batch process; optimal control; reinforcement learning;
D O I
10.1109/mmar.2019.8864632
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Batch processes are significant and essential manufacturing route for the agile manufacturing of high value added products and they are typically difficult to control because of unknown disturbances, model plant mismatches, and highly non-linear characteristic. Traditional one-step reinforcement learning and neural network have been applied to optimize and control batch processes. However, traditional one-step reinforcement learning and the neural network lack accuracy and robustness leading to unsatisfactory performance. To overcome these issues and difficulties, a modified multi-step action Q-learning algorithm (MMSA) based on multiple step action Q-learning (MSA) is proposed in this paper. For MSA, the action space is divided into some periods of same time steps and the same action is explored with fixed greedy policy being applied continuously during a period. Compared with MSA, the modification of MMSA is that the exploration and selection of action will follow an improved and various greedy policy in the whole system time which can improve the flexibility and speed of the learning algorithm. The proposed algorithm is applied to a highly nonlinear batch process and it is shown giving better control performance than the traditional one-step reinforcement learning and MSA.
引用
收藏
页码:360 / 365
页数:6
相关论文
共 50 条
  • [41] An integrated iterative learning control strategy with model identification and dynamic R-parameter for batch processes
    Jia, Li
    Yang, Tian
    Chiu, Minsen
    JOURNAL OF PROCESS CONTROL, 2013, 23 (09) : 1332 - 1341
  • [42] Tracking control for batch processes through integrating batch-to-batch iterative learning control and within-batch on-line control
    Xiong, ZH
    Zhang, J
    Wang, X
    Xu, YM
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2005, 44 (11) : 3983 - 3992
  • [43] A active vibration control strategy based on reinforcement learning
    Zhou J.
    Dong L.
    Meng C.
    Sun H.
    Dong, Longlei, 1600, Chinese Vibration Engineering Society (40): : 281 - 286
  • [44] A Reinforcement Learning Approach to Health Aware Control Strategy
    Jha, Mayank S.
    Weber, Philippe
    Theilliol, Didier
    Ponsart, Jean-Christophe
    Maquin, Didier
    2019 27TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2019, : 171 - 176
  • [45] Control of Gene Regulatory Networks Basin of Attractions with Batch Reinforcement Learning
    Hayama Nishida, Cyntia Eico
    Reali Costa, Anna Helena
    da Costa Bianchi, Reinaldo Augusto
    2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 127 - 132
  • [46] Parameterized Batch Reinforcement Learning for Longitudinal Control of Autonomous Land Vehicles
    Huang, Zhenhua
    Xu, Xin
    He, Haibo
    Tan, Jun
    Sun, Zhenping
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (04): : 730 - 741
  • [47] Control-Informed Reinforcement Learning for Chemical Processes
    Bloor, Maximilian
    Ahmed, Akhil
    Kotecha, Niki
    Mercangoz, Mehmet
    Tsay, Calvin
    del Rio-Chanona, Ehecatl Antonio
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2025, 64 (09) : 4966 - 4978
  • [48] yBypass chromatography - design and analysis of an improved strategy for operating batch chromatography processes
    Siitonen, Jani
    Sainio, Tuorno
    Rajendran, Arvind
    JOURNAL OF CHROMATOGRAPHY A, 2012, 1230 : 77 - 92
  • [49] An Improved Statistical Modeling Strategy by Spectroscopy for Online Monitoring and Diagnosis of Batch Processes
    Zhao, Chunhui
    Gao, Furong
    Liu, Tao
    Wang, Fuli
    ASCC: 2009 7TH ASIAN CONTROL CONFERENCE, VOLS 1-3, 2009, : 893 - 898
  • [50] A frequency cooperative control strategy for multimicrogrids with EVs based on improved evolutionary-deep reinforcement learning
    Fan, Peixiao
    Ke, Song
    Yang, Jun
    Wen, Yuxin
    Xie, Lilong
    Li, Yonghui
    Kamel, Salah
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 159