Cooperative multi-agent system for production control using reinforcement learning

被引：35

作者：

Dittrich, Marc-Andre ^{[1
]}

Fohlmeister, Silas ^{[1
]}

机构：

[1] Leibniz Univ Hannover, Inst Prod Engn & Machine Tools IFW, Hannover, Germany

来源：

CIRP ANNALS-MANUFACTURING TECHNOLOGY | 2020年 / 69卷 / 01期

关键词：

Production planning; Machine learning; Multi-agent system;

D O I：

10.1016/j.cirp.2020.04.005

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Multi-agent systems can limit the control problem in complex production systems and solve them more efficiently. However, they often show local optimization tendencies. This paper presents a novel approach for a cooperative multi-agent system, which uses reinforcement learning and considers global key performance indicators. For this purpose, a central deep q-learning module transfers its knowledge to the cooperative order agents. The order agent's experience is stored in a replay memory for subsequent reinforcement learning. Interdependencies between the characteristics of nonlinear production systems and learning parameters are investigated and the performance is evaluated in comparison to conventional methods of production control. (C) 2020 CIRP. Published by Elsevier Ltd. All rights reserved.

引用

页码：389 / 392

页数：4

共 16 条

[1] [Anonymous], 2002, An Introduction to MultiAgent Systems
[2] [Anonymous], 2013, HDB MANUFACTURING CO
[3] [Anonymous], 1998, INTRO REINFORCEMENT
[4] Product variety management
ElMaraghy, H.
Schuh, G.
ElMaraghy, W.
Piller, F.
Schoensleben, P.
Tseng, M.
Bernard, A.
[J]. CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2013, 62 (02) : 629 - 652
[5] Automatic design of scheduling rules for complex manufacturing systems by multi-objective simulation-based optimization
Freitag, Michael
Hildebrandt, Torsten
[J]. CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2016, 65 (01) : 433 - 436
[6] Glorot X., 2011, P 14 INT C ART INT S
[7] APPROXIMATION CAPABILITIES OF MULTILAYER FEEDFORWARD NETWORKS
HORNIK, K
[J]. NEURAL NETWORKS, 1991, 4 (02) : 251 - 257
[8] Agent-based distributed manufacturing control: A state-of-the-art survey
Leitao, Paulo
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2009, 22 (07) : 979 - 991
[9] Human-level control through deep reinforcement learning
Mnih, Volodymyr
Kavukcuoglu, Koray
Silver, David
Rusu, Andrei A.
Veness, Joel
Bellemare, Marc G.
Graves, Alex
Riedmiller, Martin
Fidjeland, Andreas K.
Ostrovski, Georg
Petersen, Stig
Beattie, Charles
Sadik, Amir
Antonoglou, Ioannis
King, Helen
Kumaran, Dharshan
Wierstra, Daan
Legg, Shane
Hassabis, Demis
[J]. NATURE, 2015, 518 (7540) : 529 - 533
[10] Agent-based systems for manufacturing
Monostori, L.
Vancza, J.
Kumara, S. R. T.
[J]. CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2006, 55 (02) : 697 - 720

← 1 2 →