A deep reinforcement learning based hyper-heuristic for modular production control

被引：11

作者：

Panzer, Marcel ^{[1
,2
]}

Bender, Benedict ^{[1
]}

Gronau, Norbert ^{[1
]}

机构：

[1] Univ Potsdam, Chair Business Informat Proc & Syst, Potsdam, Germany

[2] Univ Potsdam, Chair Business Informat Proc & Syst, Karl Marx St 67, D-14482 Potsdam, Germany

来源：

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH | 2024年 / 62卷 / 08期

关键词：

Production control; modular production; multi-agent system; deep reinforcement learning; deep learning; multi-objective optimisation; DISPATCHING RULES; FRAMEWORK; SIMULATION; SYSTEMS;

D O I：

10.1080/00207543.2023.2233641

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

In nowadays production, fluctuations in demand, shortening product life-cycles, and highly configurable products require an adaptive and robust control approach to maintain competitiveness. This approach must not only optimise desired production objectives but also cope with unforeseen machine failures, rush orders, and changes in short-term demand. Previous control approaches were often implemented using a single operations layer and a standalone deep learning approach, which may not adequately address the complex organisational demands of modern manufacturing systems. To address this challenge, we propose a hyper-heuristics control model within a semi-heterarchical production system, in which multiple manufacturing and distribution agents are spread across pre-defined modules. The agents employ a deep reinforcement learning algorithm to learn a policy for selecting low-level heuristics in a situation-specific manner, thereby leveraging system performance and adaptability. We tested our approach in simulation and transferred it to a hybrid production environment. By that, we were able to demonstrate its multi-objective optimisation capabilities compared to conventional approaches in terms of mean throughput time, tardiness, and processing of prioritised orders in a multi-layered production system. The modular design is promising in reducing the overall system complexity and facilitates a quick and seamless integration into other scenarios.

引用

页码：2747 / 2768

页数：22

共 72 条

[1] Baer Schirin, 2020, P ICMSMM, P78
[2] A review of the applications of multi-agent reinforcement learning in smart factories
Bahrpeyma, Fouad
Reichelt, Dirk
[J]. FRONTIERS IN ROBOTICS AND AI, 2022, 9
[3] A survey of factory control algorithms that can be implemented in a multi-agent heterarchy: Dispatching, scheduling, and pull
Baker, AD
[J]. JOURNAL OF MANUFACTURING SYSTEMS, 1998, 17 (04) : 297 - 320
[4] Balaji PG, 2010, STUD COMPUT INTELL, V310, P1
[5] Optimization of Material Supply in Smart Manufacturing Environment: A Metaheuristic Approach for Matrix Production
Banyai, Tamas
[J]. MACHINES, 2021, 9 (10)
[6] A MARKOVIAN DECISION PROCESS
BELLMAN, R
[J]. JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05): : 679 - 684
[7] On the use of artificial neural networks in simulation-based manufacturing control
Bergmann, S.
Stelzer, S.
Strassburger, S.
[J]. JOURNAL OF SIMULATION, 2014, 8 (01) : 76 - 90
[8] Hierarchy in distributed shop floor control
Bongaerts, L
Monostori, L
McFarlane, D
Kádár, B
[J]. COMPUTERS IN INDUSTRY, 2000, 43 (02) : 123 - 137
[9] An implementing framework for holonic manufacturing control with multiple robot-vision stations
Borangiu, Theodor
Gilbert, Pascal
Ivanescu, Nick-Andrei
Rosu, Andrei
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2009, 22 (4-5) : 505 - 521
[10] Borangiu Theodor., 2010, INTELLIGENT MANUFACT, V43, P108, DOI [https://doi.org/10.3182/20100701-2-PT-4011.00020, DOI 10.3182/20100701-2-PT-4011.00020]

← 1 2 3 4 5 6 7 8 →