Dynamic production scheduling towards self-organizing mass personalization: A multi-agent dueling deep reinforcement learning approach

被引:29
作者
Qin, Zhaojun [1 ]
Johnson, Dazzle [1 ]
Lu, Yuqian [1 ]
机构
[1] Univ Auckland, Dept Mech & Mechatron Engn, Auckland, New Zealand
关键词
Mass personalization; Self-organizing manufacturing network; Dynamic flexible job shop scheduling problem; Multi-agent production scheduling; Reinforcement learning; OF-THE-ART; MANUFACTURING SYSTEMS; MACHINE BREAKDOWNS; GENETIC ALGORITHMS; WORKLOAD CONTROL; BOND GRAPHS; SHOP; AGENT; ARCHITECTURE; OPTIMIZATION;
D O I
10.1016/j.jmsy.2023.03.003
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Mass personalization is rapidly approaching. In response, manufacturing systems should be capable of autono-mously changing production plans, configurations and schedules under dynamic manufacturing environments for producing personalized products. Self-organizing manufacturing network is a promising paradigm for mass personalization. The backbone of a self-organizing manufacturing network is an adaptive production scheduling method to dynamically allocate and sequence manufacturing jobs under dynamic settings, such as stochastic processing time or unplanned machine breakdown. However, existing production scheduling methods (i.e., heuristic rules, meta-heuristic algorithms, and existing reinforcement learning models) fail to automatically optimize production schedules while maintaining stable manufacturing performance, under dynamic settings. In this paper, we designed a reinforcement learning-based static-training-dynamic-execution approach for dynamic job shop scheduling problems. The scheduling policies are learned from static scheduling instances by a multi -agent dueling deep reinforcement learning approach. Under this approach, we proposed new representations of observation, action, reward, and cooperation mechanisms between agents. The learned scheduling policies are then deployed to a dynamic scheduling system where stochastic processing time and unplanned machine breakdown randomly occur. Extensive simulation experiments demonstrated that our approach outperforms heuristic rules on makespan under two dynamic manufacturing settings.
引用
收藏
页码:242 / 257
页数:16
相关论文
共 77 条
  • [31] A multi-agent architecture for scheduling in platform-based smart manufacturing systems
    Liu, Yong-kui
    Zhang, Xue-song
    Zhang, Lin
    Tao, Fei
    Wang, Li-hui
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (11) : 1465 - 1492
  • [32] Smart manufacturing process and system automation - A critical review of the standards and envisioned scenarios
    Lu, Yuqian
    Xu, Xun
    Wang, Lihui
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 (56) : 312 - 325
  • [33] Real-Time Scheduling for Dynamic Partial-No-Wait Multiobjective Flexible Job Shop by Deep Reinforcement Learning
    Luo, Shu
    Zhang, Linxuan
    Fan, Yushun
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) : 3020 - 3038
  • [34] Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning
    Luo, Shu
    [J]. APPLIED SOFT COMPUTING, 2020, 91
  • [35] Ant colony optimization for resource-constrained project scheduling
    Merkle, D
    Middendorf, M
    Schmeck, H
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (04) : 333 - 346
  • [36] Human-level control through deep reinforcement learning
    Mnih, Volodymyr
    Kavukcuoglu, Koray
    Silver, David
    Rusu, Andrei A.
    Veness, Joel
    Bellemare, Marc G.
    Graves, Alex
    Riedmiller, Martin
    Fidjeland, Andreas K.
    Ostrovski, Georg
    Petersen, Stig
    Beattie, Charles
    Sadik, Amir
    Antonoglou, Ioannis
    King, Helen
    Kumaran, Dharshan
    Wierstra, Daan
    Legg, Shane
    Hassabis, Demis
    [J]. NATURE, 2015, 518 (7540) : 529 - 533
  • [37] Morales E.F., 2014, DECISION THEORY MODE, DOI [10.4018/978-1-60960-165-2.ch004, DOI 10.4018/978-1-60960-165-2.CH004]
  • [38] Distributional reinforcement learning with the independent learners for flexible job shop scheduling problem with high variability
    Oh, Seung Heon
    Cho, Young In
    Woo, Jong Hun
    [J]. JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2022, 9 (04) : 1157 - 1174
  • [39] An autonomous manufacturing system based on swarm of cognitive agents
    Park, Hong-Seok
    Tran, Ngoc-Hien
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2012, 31 (03) : 337 - 348
  • [40] A Reinforcement Learning Approach to Robust Scheduling of Semiconductor Manufacturing Facilities
    Park, In-Beom
    Huh, Jaeseok
    Kim, Joongkyun
    Park, Jonghun
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (03) : 1420 - 1431