Reinforcement learning for an intelligent and autonomous production control of complex job-shops under time constraints

被引:51
作者
Altenmueller, Thomas [1 ]
Stueker, Tillmann [1 ]
Waschneck, Bernd [1 ]
Kuhnle, Andreas [2 ]
Lanza, Gisela [2 ]
机构
[1] Infineon Technol AG, Campeon 1-12, D-85579 Neubiberg, Germany
[2] KIT, Wbk Inst Prod Sci, Kaiserstr 12, D-76131 Karlsruhe, Germany
来源
PRODUCTION ENGINEERING-RESEARCH AND DEVELOPMENT | 2020年 / 14卷 / 03期
关键词
Complex job shop; Production planning and control; Reinforcement learning; Time constraints; DESIGN;
D O I
10.1007/s11740-020-00967-8
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Reinforcement learning (RL) offers promising opportunities to handle the ever-increasing complexity in managing modern production systems. We apply a Q-learning algorithm in combination with a process-based discrete-event simulation in order to train a self-learning, intelligent, and autonomous agent for the decision problem of order dispatching in a complex job shop with strict time constraints. For the first time, we combine RL in production control with strict time constraints. The simulation represents the characteristics of complex job shops typically found in semiconductor manufacturing. A real-world use case from a wafer fab is addressed with a developed and implemented framework. The performance of an RL approach and benchmark heuristics are compared. It is shown that RL can be successfully applied to manage order dispatching in a complex environment including time constraints. An RL-agent with a gain function rewarding the selection of the least critical order with respect to time-constraints beats heuristic rules strictly by picking the most critical lot first. Hence, this work demonstrates that a self-learning agent can successfully manage time constraints with the agent performing better than the traditional benchmark, a time-constraint heuristic combining due date deviations and a classical first-in-first-out approach.
引用
收藏
页码:319 / 328
页数:10
相关论文
共 23 条
[1]  
Bauernhansel T., 2014, Automatisierung Und Logistik
[2]  
EVERSHEIM W, 2000, WORTERBUCH PPS DICT
[3]   Matrix structures for high volumes and flexibility in production systems [J].
Greschke, P. ;
Schoenemann, M. ;
Thiede, S. ;
Herrmann, C. .
VARIETY MANAGEMENT IN MANUFACTURING: PROCEEDINGS OF THE 47TH CIRP CONFERENCE ON MANUFACTURING SYSTEMS, 2014, 17 :160-165
[4]  
KIENER S, 2017, PRODUKTIONSMANAGEMEN
[5]  
Klemmt Andreas, 2012, WINTER SIMULATION C
[6]  
KNOPP S, 2016, THESIS
[7]   Design, Implementation and Evaluation of Reinforcement Learning for an Adaptive Order Dispatching in Job Shop Manufacturing Systems [J].
Kuhnle, Andreas ;
Schaefer, Louis ;
Stricker, Nicole ;
Lanza, Gisela .
52ND CIRP CONFERENCE ON MANUFACTURING SYSTEMS (CMS), 2019, 81 :234-239
[8]   Autonomous order dispatching in the semiconductor industry using reinforcement learning [J].
Kuhnle, Andreas ;
Roehrig, Nicole ;
Lanza, Gisela .
12TH CIRP CONFERENCE ON INTELLIGENT COMPUTATION IN MANUFACTURING ENGINEERING, 2019, 79 :391-396
[9]   Reinforcement learning for opportunistic maintenance optimization [J].
Kuhnle, Andreas ;
Jakubik, Johannes ;
Lanza, Gisela .
PRODUCTION ENGINEERING-RESEARCH AND DEVELOPMENT, 2019, 13 (01) :33-41
[10]   Global production networks: Design and operation [J].
Lanza, Gisela ;
Ferdows, Kasra ;
Kara, Sami ;
Mourtzis, Dimitris ;
Schuh, Guenther ;
Vancza, Jozsef ;
Wang, Lihui ;
Wiendahl, Hans-Peter .
CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2019, 68 (02) :823-841