Self-adapting WIP parameter setting using deep reinforcement learning

被引:4
作者
De Andrade e Silva, Manuel Tome [1 ]
Azevedo, Americo [1 ,2 ]
机构
[1] Univ Porto, Fac Engn, Porto, Portugal
[2] Inst Syst & Comp Engn, Technol & Sci, Porto, Portugal
关键词
WIP reduction; CONWIP; Deep reinforcement learning; WORKLOAD CONTROL; SYSTEMS; CONWIP; NUMBER; KANBANS; MULTIPRODUCT; THROUGHPUT; ALGORITHM; TIMES;
D O I
10.1016/j.cor.2022.105854
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study investigates the potential of dynamically adjusting WIP cap levels to maximize the throughput (TH) performance and minimize work in process (WIP), according to real-time system state arising from process variability associated with low volume and high-variety production systems. Using an innovative approach based on state-of-the-art deep reinforcement learning (proximal policy optimization algorithm), we attain WIP reductions of up to 50% and 30%, with practically no losses in throughput, against pure-push systems and the statistical throughput control method (STC), respectively. An exploratory study based on simulation experiments was performed to provide support to our research. The reinforcement learning agent's performance was shown to be robust to variability changes within the production systems.
引用
收藏
页数:14
相关论文
共 52 条
[41]   Ordering alternatives in JIT production systems [J].
Takahashi, K ;
Nakamura, N .
PRODUCTION PLANNING & CONTROL, 1998, 9 (08) :784-794
[42]   An adaptive approach to controlling kanban systems [J].
Tardif, V ;
Maaseidvaag, L .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 132 (02) :411-424
[43]   Workload Control and Order Release: A Lean Solution for Make-to-Order Companies [J].
Thuerer, Matthias ;
Stevenson', Mark ;
Silva, Cristovao ;
Land, Martin J. ;
Fredendall, Lawrence D. .
PRODUCTION AND OPERATIONS MANAGEMENT, 2012, 21 (05) :939-953
[44]   Optimising workload norms: the influence of shop floor characteristics on setting workload norms for the workload control concept [J].
Thuerer, Matthias ;
Silva, Cristovao ;
Stevenson, Mark .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2011, 49 (04) :1151-1171
[45]   Material Flow Control in High-Variety Make-to-Order Shops: Combining COBACABANA and POLCA [J].
Thurer, Matthias ;
Fernandes, Nuno O. ;
Stevenson, Mark .
PRODUCTION AND OPERATIONS MANAGEMENT, 2020, 29 (09) :2138-2152
[46]   OPTIMUM NUMBER OF KANBANS BETWEEN 2 ADJACENT WORKSTATIONS IN A JIT SYSTEM [J].
WANG, HL ;
WANG, HP .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 1991, 22 (03) :179-188
[47]   Parallel algorithm for setting WIP levels for multi-product CONWIP systems [J].
Wang, L. ;
Prabhu, V. .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2006, 44 (21) :4681-4693
[48]  
Wang Z., 2016, Sample efficient actor-critic with experience replay
[49]  
Wu Y., 2017, Advances in Neural Information Processing Systems, P5280
[50]   Reinforcement learning-based adaptive production control of pull manufacturing systems [J].
Xanthopoulos, A. S. ;
Chnitidis, G. ;
Koulouriotis, D. E. .
JOURNAL OF INDUSTRIAL AND PRODUCTION ENGINEERING, 2019, 36 (05) :313-323