Dynamic production scheduling towards self-organizing mass personalization: A multi-agent dueling deep reinforcement learning approach

被引：40

作者：

Qin, Zhaojun ^{[1
]}

Johnson, Dazzle ^{[1
]}

Lu, Yuqian ^{[1
]}

机构：

[1] Univ Auckland, Dept Mech & Mechatron Engn, Auckland, New Zealand

来源：

JOURNAL OF MANUFACTURING SYSTEMS | 2023年 / 68卷

关键词：

Mass personalization; Self-organizing manufacturing network; Dynamic flexible job shop scheduling problem; Multi-agent production scheduling; Reinforcement learning; OF-THE-ART; MANUFACTURING SYSTEMS; MACHINE BREAKDOWNS; GENETIC ALGORITHMS; WORKLOAD CONTROL; BOND GRAPHS; SHOP; AGENT; ARCHITECTURE; OPTIMIZATION;

D O I：

10.1016/j.jmsy.2023.03.003

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Mass personalization is rapidly approaching. In response, manufacturing systems should be capable of autono-mously changing production plans, configurations and schedules under dynamic manufacturing environments for producing personalized products. Self-organizing manufacturing network is a promising paradigm for mass personalization. The backbone of a self-organizing manufacturing network is an adaptive production scheduling method to dynamically allocate and sequence manufacturing jobs under dynamic settings, such as stochastic processing time or unplanned machine breakdown. However, existing production scheduling methods (i.e., heuristic rules, meta-heuristic algorithms, and existing reinforcement learning models) fail to automatically optimize production schedules while maintaining stable manufacturing performance, under dynamic settings. In this paper, we designed a reinforcement learning-based static-training-dynamic-execution approach for dynamic job shop scheduling problems. The scheduling policies are learned from static scheduling instances by a multi -agent dueling deep reinforcement learning approach. Under this approach, we proposed new representations of observation, action, reward, and cooperation mechanisms between agents. The learned scheduling policies are then deployed to a dynamic scheduling system where stochastic processing time and unplanned machine breakdown randomly occur. Extensive simulation experiments demonstrated that our approach outperforms heuristic rules on makespan under two dynamic manufacturing settings.

引用

页码：242 / 257

页数：16

共 77 条

[31] A multi-agent architecture for scheduling in platform-based smart manufacturing systems [J].

Liu, Yong-kui ;

Zhang, Xue-song ;

Zhang, Lin ;

Tao, Fei ;

Wang, Li-hui .

FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (11) :1465-1492

[32] Smart manufacturing process and system automation - A critical review of the standards and envisioned scenarios [J].

Lu, Yuqian ;

Xu, Xun ;

Wang, Lihui .

JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 :312-325

[33] Real-Time Scheduling for Dynamic Partial-No-Wait Multiobjective Flexible Job Shop by Deep Reinforcement Learning [J].

Luo, Shu ;

Zhang, Linxuan ;

Fan, Yushun .

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) :3020-3038

[34] Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning [J].

Luo, Shu .

APPLIED SOFT COMPUTING, 2020, 91

[35] Ant colony optimization for resource-constrained project scheduling [J].

Merkle, D ;

Middendorf, M ;

Schmeck, H .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (04) :333-346

[36] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[37]

Morales E.F., 2014, DECISION THEORY MODE, DOI [10.4018/978-1-60960-165-2.ch004, DOI 10.4018/978-1-60960-165-2.CH004]

[38] Distributional reinforcement learning with the independent learners for flexible job shop scheduling problem with high variability [J].

Oh, Seung Heon ;

Cho, Young In ;

Woo, Jong Hun .

JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2022, 9 (04) :1157-1174

[39] An autonomous manufacturing system based on swarm of cognitive agents [J].

Park, Hong-Seok ;

Tran, Ngoc-Hien .

JOURNAL OF MANUFACTURING SYSTEMS, 2012, 31 (03) :337-348

[40] A Reinforcement Learning Approach to Robust Scheduling of Semiconductor Manufacturing Facilities [J].

Park, In-Beom ;

Huh, Jaeseok ;

Kim, Joongkyun ;

Park, Jonghun .

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (03) :1420-1431

← 1 2 3 4 5 6 7 8 →