Integration of reinforcement learning and model predictive control to optimize semi-batch bioreactor

被引：32

作者：

Oh, Tae Hoon ^{[1
]}

Park, Hyun Min ^{[1
]}

Kim, Jong Woo ^{[2
]}

Lee, Jong Min ^{[1
]}

机构：

[1] Seoul Natl Univ, Sch Chem & Biol Engn, Inst Chem Proc, 1 Gwanak Ro, Seoul 08826, South Korea

[2] Tech Univ Berlin, Bioproc Engn, Berlin, Germany

来源：

AICHE JOURNAL | 2022年 / 68卷 / 06期

基金：

新加坡国家研究基金会;

关键词：

bioprocess; deep neural network; model predictive control; optimal control; reinforcement learning; FED-BATCH FERMENTATION; PENICILLIN PRODUCTION; STRUCTURED MODEL; BIG DATA; STABILITY;

D O I：

10.1002/aic.17658

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

As the digital transformation of the bioprocess is progressing, several studies propose to apply data-based methods to obtain a substrate feeding strategy that minimizes the operating cost of a semi-batch bioreactor. However, the negligent application of model-free reinforcement learning (RL) has a high chance to fail on improving the existing control policy because the available amount of data is limited. In this article, we propose an integrated algorithm of double-deep Q-network and model predictive control. The proposed method learns the action-value function in an off-policy fashion and solves the model-based optimal control problem where the terminal cost is assigned by the action-value function. For simulation study, the proposed method, model-based method, and model-free methods are applied to the industrial scale penicillin process. The results show that the proposed method outperforms other methods, and it can learn with fewer data than model-free RL algorithms.

引用

页数：16

共 65 条

[11]

Duan Y, 2016, PR MACH LEARN RES, V48

[12] Predictive control of an activated sludge process for long term operation [J].

Foscoliano, Chiara ;

Del Vigo, Stefania ;

Mulas, Michela ;

Tronci, Stefania .

CHEMICAL ENGINEERING JOURNAL, 2016, 304 :1031-1044

[13]

Fujimoto S, 2018, PR MACH LEARN RES, V80

[14]

García J, 2015, J MACH LEARN RES, V16, P1437

[15] The development of an industrial-scale fed-batch fermentation simulation [J].

Goldrick, Stephen ;

Stefan, Andrei ;

Lovett, David ;

Montague, Gary ;

Lennox, Barry .

JOURNAL OF BIOTECHNOLOGY, 2015, 193 :70-82

[16] Data-Driven Economic NMPC Using Reinforcement Learning [J].

Gros, Sebastien ;

Zanon, Mario .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) :636-648

[17]

Haarnoja T, 2018, PR MACH LEARN RES, V80

[18] A deep reinforcement learning based multi-criteria decision support system for optimizing textile chemical process [J].

He, Zhenglei ;

Tran, Kim-Phuc ;

Thomassey, Sebastien ;

Zeng, Xianyi ;

Xu, Jie ;

Yi, Changhai .

COMPUTERS IN INDUSTRY, 2021, 125

[19]

Henderson P, 2018, AAAI CONF ARTIF INTE, P3207

[20] Learning-Based Model Predictive Control: Toward Safe Learning in Control [J].

Hewing, Lukas ;

Wabersich, Kim P. ;

Menner, Marcel ;

Zeilinger, Melanie N. .

ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 :269-296

← 1 2 3 4 5 6 7 →