Integration of reinforcement learning and model predictive control to optimize semi-batch bioreactor

被引:32
作者
Oh, Tae Hoon [1 ]
Park, Hyun Min [1 ]
Kim, Jong Woo [2 ]
Lee, Jong Min [1 ]
机构
[1] Seoul Natl Univ, Sch Chem & Biol Engn, Inst Chem Proc, 1 Gwanak Ro, Seoul 08826, South Korea
[2] Tech Univ Berlin, Bioproc Engn, Berlin, Germany
基金
新加坡国家研究基金会;
关键词
bioprocess; deep neural network; model predictive control; optimal control; reinforcement learning; FED-BATCH FERMENTATION; PENICILLIN PRODUCTION; STRUCTURED MODEL; BIG DATA; STABILITY;
D O I
10.1002/aic.17658
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
As the digital transformation of the bioprocess is progressing, several studies propose to apply data-based methods to obtain a substrate feeding strategy that minimizes the operating cost of a semi-batch bioreactor. However, the negligent application of model-free reinforcement learning (RL) has a high chance to fail on improving the existing control policy because the available amount of data is limited. In this article, we propose an integrated algorithm of double-deep Q-network and model predictive control. The proposed method learns the action-value function in an off-policy fashion and solves the model-based optimal control problem where the terminal cost is assigned by the action-value function. For simulation study, the proposed method, model-based method, and model-free methods are applied to the industrial scale penicillin process. The results show that the proposed method outperforms other methods, and it can learn with fewer data than model-free RL algorithms.
引用
收藏
页数:16
相关论文
共 65 条
[11]  
Duan Y, 2016, PR MACH LEARN RES, V48
[12]   Predictive control of an activated sludge process for long term operation [J].
Foscoliano, Chiara ;
Del Vigo, Stefania ;
Mulas, Michela ;
Tronci, Stefania .
CHEMICAL ENGINEERING JOURNAL, 2016, 304 :1031-1044
[13]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[14]  
García J, 2015, J MACH LEARN RES, V16, P1437
[15]   The development of an industrial-scale fed-batch fermentation simulation [J].
Goldrick, Stephen ;
Stefan, Andrei ;
Lovett, David ;
Montague, Gary ;
Lennox, Barry .
JOURNAL OF BIOTECHNOLOGY, 2015, 193 :70-82
[16]   Data-Driven Economic NMPC Using Reinforcement Learning [J].
Gros, Sebastien ;
Zanon, Mario .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) :636-648
[17]  
Haarnoja T, 2018, PR MACH LEARN RES, V80
[18]   A deep reinforcement learning based multi-criteria decision support system for optimizing textile chemical process [J].
He, Zhenglei ;
Tran, Kim-Phuc ;
Thomassey, Sebastien ;
Zeng, Xianyi ;
Xu, Jie ;
Yi, Changhai .
COMPUTERS IN INDUSTRY, 2021, 125
[19]  
Henderson P, 2018, AAAI CONF ARTIF INTE, P3207
[20]   Learning-Based Model Predictive Control: Toward Safe Learning in Control [J].
Hewing, Lukas ;
Wabersich, Kim P. ;
Menner, Marcel ;
Zeilinger, Melanie N. .
ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 :269-296