Self-adapting WIP parameter setting using deep reinforcement learning

被引：4

作者：

De Andrade e Silva, Manuel Tome ^{[1
]}

Azevedo, Americo ^{[1
,2
]}

机构：

[1] Univ Porto, Fac Engn, Porto, Portugal

[2] Inst Syst & Comp Engn, Technol & Sci, Porto, Portugal

来源：

COMPUTERS & OPERATIONS RESEARCH | 2022年 / 144卷

关键词：

WIP reduction; CONWIP; Deep reinforcement learning; WORKLOAD CONTROL; SYSTEMS; CONWIP; NUMBER; KANBANS; MULTIPRODUCT; THROUGHPUT; ALGORITHM; TIMES;

D O I：

10.1016/j.cor.2022.105854

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This study investigates the potential of dynamically adjusting WIP cap levels to maximize the throughput (TH) performance and minimize work in process (WIP), according to real-time system state arising from process variability associated with low volume and high-variety production systems. Using an innovative approach based on state-of-the-art deep reinforcement learning (proximal policy optimization algorithm), we attain WIP reductions of up to 50% and 30%, with practically no losses in throughput, against pure-push systems and the statistical throughput control method (STC), respectively. An exploratory study based on simulation experiments was performed to provide support to our research. The reinforcement learning agent's performance was shown to be robust to variability changes within the production systems.

引用

页数：14

共 52 条

[41] Ordering alternatives in JIT production systems [J].