A deep reinforcement learning approach for chemical production scheduling

被引：125

作者：

Hubbs, Christian D. ^{[1
]}

Li, Can ^{[1
]}

Sahinidis, Nikolaos, V ^{[1
]}

Grossmann, Ignacio E. ^{[1
]}

Wassick, John M. ^{[2
]}

机构：

[1] Carnegie Mellon Univ, Dept Chem Engn, Pittsburgh, PA 15123 USA

[2] Digital Fulfillment Ctr, Dow Chem, Midland, MI 48667 USA

来源：

COMPUTERS & CHEMICAL ENGINEERING | 2020年 / 141卷

关键词：

Machine learning; Reinforcement learning; Optimization; Scheduling; Stochastic programming; OPTIMIZATION APPROACH; PROCESS SYSTEMS; UNCERTAINTY; MANAGEMENT; GAME; GO;

D O I：

10.1016/j.compchemeng.2020.106982

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This work examines applying deep reinforcement learning to a chemical production scheduling process to account for uncertainty and achieve online, dynamic scheduling, and benchmarks the results with a mixed-integer linear programming (MILP) model that schedules each time interval on a receding horizon basis. An industrial example is used as a case study for comparing the differing approaches. Results show that the reinforcement learning method outperforms the naive MILP approaches and is competitive with a shrinking horizon MILP approach in terms of profitability, inventory levels, and customer service. The speed and flexibility of the reinforcement learning system is promising for achieving real-time optimization of a scheduling system, but there is reason to pursue integration of data-driven deep reinforcement learning methods and model-based mathematical optimization approaches. (C) 2020 The Authors. Published by Elsevier Ltd.

引用

页数：22

共 46 条

[1]

[Anonymous], 1997, Introduction to Stochastic Programming

[2]

[Anonymous], 2018, AAAI

[3]

[Anonymous], 2016, HOTNETS, DOI DOI 10.1145/3005745.3005750

[4]

Badgwell T.A., 2018, COMPUTER AIDED CHEM, P71, DOI 10.1016/B978-0-

[5] Approximation to multistage stochastic optimization in multiperiod batch plant scheduling under demand uncertainty [J].

Balasubramanian, J ;

Grossmann, IE .

INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2004, 43 (14) :3695-3713

[6] Scheduling optimization under uncertainty - an alternative approach [J].

Balasubramanian, J ;

Grossmann, IE .

COMPUTERS & CHEMICAL ENGINEERING, 2003, 27 (04) :469-490

[7]

Bellman R., 1957, MARKOVIAN DECISION P, DOI [10.1007/BF02935461, DOI 10.1007/BF02935461]

[8] Theory and Applications of Robust Optimization [J].

Bertsimas, Dimitris ;

Brown, David B. ;

Caramanis, Constantine .

SIAM REVIEW, 2011, 53 (03) :464-501

[9]

Bishop CM., 2006, Springer Google Schola, V2, P1122, DOI [10.5555/1162264, DOI 10.18637/JSS.V017.B05]

[10]

C3D Labs LLC, 2020, C3D TOOLKIT DEVELOPE

← 1 2 3 4 5 →