A deep reinforcement learning approach for chemical production scheduling

被引:125
作者
Hubbs, Christian D. [1 ]
Li, Can [1 ]
Sahinidis, Nikolaos, V [1 ]
Grossmann, Ignacio E. [1 ]
Wassick, John M. [2 ]
机构
[1] Carnegie Mellon Univ, Dept Chem Engn, Pittsburgh, PA 15123 USA
[2] Digital Fulfillment Ctr, Dow Chem, Midland, MI 48667 USA
关键词
Machine learning; Reinforcement learning; Optimization; Scheduling; Stochastic programming; OPTIMIZATION APPROACH; PROCESS SYSTEMS; UNCERTAINTY; MANAGEMENT; GAME; GO;
D O I
10.1016/j.compchemeng.2020.106982
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work examines applying deep reinforcement learning to a chemical production scheduling process to account for uncertainty and achieve online, dynamic scheduling, and benchmarks the results with a mixed-integer linear programming (MILP) model that schedules each time interval on a receding horizon basis. An industrial example is used as a case study for comparing the differing approaches. Results show that the reinforcement learning method outperforms the naive MILP approaches and is competitive with a shrinking horizon MILP approach in terms of profitability, inventory levels, and customer service. The speed and flexibility of the reinforcement learning system is promising for achieving real-time optimization of a scheduling system, but there is reason to pursue integration of data-driven deep reinforcement learning methods and model-based mathematical optimization approaches. (C) 2020 The Authors. Published by Elsevier Ltd.
引用
收藏
页数:22
相关论文
共 46 条
[1]  
[Anonymous], 1997, Introduction to Stochastic Programming
[2]  
[Anonymous], 2018, AAAI
[3]  
[Anonymous], 2016, HOTNETS, DOI DOI 10.1145/3005745.3005750
[4]  
Badgwell T.A., 2018, COMPUTER AIDED CHEM, P71, DOI 10.1016/B978-0-
[5]   Approximation to multistage stochastic optimization in multiperiod batch plant scheduling under demand uncertainty [J].
Balasubramanian, J ;
Grossmann, IE .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2004, 43 (14) :3695-3713
[6]   Scheduling optimization under uncertainty - an alternative approach [J].
Balasubramanian, J ;
Grossmann, IE .
COMPUTERS & CHEMICAL ENGINEERING, 2003, 27 (04) :469-490
[7]  
Bellman R., 1957, MARKOVIAN DECISION P, DOI [10.1007/BF02935461, DOI 10.1007/BF02935461]
[8]   Theory and Applications of Robust Optimization [J].
Bertsimas, Dimitris ;
Brown, David B. ;
Caramanis, Constantine .
SIAM REVIEW, 2011, 53 (03) :464-501
[9]  
Bishop CM., 2006, Springer Google Schola, V2, P1122, DOI [10.5555/1162264, DOI 10.18637/JSS.V017.B05]
[10]  
C3D Labs LLC, 2020, C3D TOOLKIT DEVELOPE