Cooperative multi-agent system for production control using reinforcement learning

被引:35
作者
Dittrich, Marc-Andre [1 ]
Fohlmeister, Silas [1 ]
机构
[1] Leibniz Univ Hannover, Inst Prod Engn & Machine Tools IFW, Hannover, Germany
关键词
Production planning; Machine learning; Multi-agent system;
D O I
10.1016/j.cirp.2020.04.005
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Multi-agent systems can limit the control problem in complex production systems and solve them more efficiently. However, they often show local optimization tendencies. This paper presents a novel approach for a cooperative multi-agent system, which uses reinforcement learning and considers global key performance indicators. For this purpose, a central deep q-learning module transfers its knowledge to the cooperative order agents. The order agent's experience is stored in a replay memory for subsequent reinforcement learning. Interdependencies between the characteristics of nonlinear production systems and learning parameters are investigated and the performance is evaluated in comparison to conventional methods of production control. (C) 2020 CIRP. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:389 / 392
页数:4
相关论文
共 16 条
  • [1] [Anonymous], 2002, An Introduction to MultiAgent Systems
  • [2] [Anonymous], 2013, HDB MANUFACTURING CO
  • [3] [Anonymous], 1998, INTRO REINFORCEMENT
  • [4] Product variety management
    ElMaraghy, H.
    Schuh, G.
    ElMaraghy, W.
    Piller, F.
    Schoensleben, P.
    Tseng, M.
    Bernard, A.
    [J]. CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2013, 62 (02) : 629 - 652
  • [5] Automatic design of scheduling rules for complex manufacturing systems by multi-objective simulation-based optimization
    Freitag, Michael
    Hildebrandt, Torsten
    [J]. CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2016, 65 (01) : 433 - 436
  • [6] Glorot X., 2011, P 14 INT C ART INT S
  • [7] APPROXIMATION CAPABILITIES OF MULTILAYER FEEDFORWARD NETWORKS
    HORNIK, K
    [J]. NEURAL NETWORKS, 1991, 4 (02) : 251 - 257
  • [8] Agent-based distributed manufacturing control: A state-of-the-art survey
    Leitao, Paulo
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2009, 22 (07) : 979 - 991
  • [9] Human-level control through deep reinforcement learning
    Mnih, Volodymyr
    Kavukcuoglu, Koray
    Silver, David
    Rusu, Andrei A.
    Veness, Joel
    Bellemare, Marc G.
    Graves, Alex
    Riedmiller, Martin
    Fidjeland, Andreas K.
    Ostrovski, Georg
    Petersen, Stig
    Beattie, Charles
    Sadik, Amir
    Antonoglou, Ioannis
    King, Helen
    Kumaran, Dharshan
    Wierstra, Daan
    Legg, Shane
    Hassabis, Demis
    [J]. NATURE, 2015, 518 (7540) : 529 - 533
  • [10] Agent-based systems for manufacturing
    Monostori, L.
    Vancza, J.
    Kumara, S. R. T.
    [J]. CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2006, 55 (02) : 697 - 720