Deep reinforcement learning for dynamic flexible job shop scheduling problem considering variable processing times

被引:64
作者
Zhang, Lu [1 ]
Feng, Yi [1 ]
Xiao, Qinge [2 ]
Xu, Yunlang [3 ]
Li, Di [4 ]
Yang, Dongsheng [5 ]
Yang, Zhile [2 ]
机构
[1] Dalian Univ Technol, Sch Control Sci & Engn, Dalian, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[3] Fudan Univ, Sch Microelect, State Key Lab ASIC & Syst, Shanghai, Peoples R China
[4] South China Univ Technol, Sch Mech & Automot Engn, Guangzhou, Peoples R China
[5] Northeastern Univ, Intelligent Elect Sci & Technol Res Inst, Shenyang, Peoples R China
关键词
Dynamic scheduling flexible job shop problem; Deep reinforcement learning; Variable processing times; Proximal policy optimization; Makespan; TABU SEARCH; OPTIMIZATION; ALGORITHM;
D O I
10.1016/j.jmsy.2023.09.009
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In recent years, the uncertainties and complexity in the production process, due to the boosted customized requirements, has dramatically increased the difficulties of Dynamic Flexible Job Shop Scheduling (DFJSP). This paper investigates a new DFJSP model taking into account the minimum completion time under the condition of machine processing time uncertainty, e.t. VPT-FJSP problem. In the formulated VPT-FJSP process, each workpiece needs to be processed by required machine at a certain time slot where Markov decision process (MDP) and reinforcement learning methods are adopted to solve VPT-FJSP. The agent designed in this paper employs the Proximal Policy Optimization(PPO) algorithm in deep reinforcement learning, which includes the Actor-Critic network. The input of the network is to extract the processing information matrix and to embed some advanced states in the workshop by graph neural network, which enables the agent to learn the complete state of the environment. Finally, we train and test the proposed framework on the canonical FJSP benchmark, and the experimental results show that our framework can make agent better than genetic algorithm and ant colony optimization in most cases, 94.29% of static scheduling. It is also shown superiority compared to the scheduling rules in dynamic environment and has demonstrated strong robustness in solving VPT-FJSP. Furthermore, this study conducted tests to assess the generalization capability of the agent on VPT-FJSP at different scales. In terms of exploring Makespan minimization, the agent outperformed four priority scheduling rules. These results indicate that the proposed dynamic scheduling framework and PPO algorithm are more effective in achieving superior solutions.
引用
收藏
页码:257 / 273
页数:17
相关论文
共 47 条
[1]   Robust and stable flexible job shop scheduling with random machine breakdowns using a hybrid genetic algorithm [J].
Al-Hinai, Nasr ;
ElMekkawy, T. Y. .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2011, 132 (02) :279-291
[2]   Dynamic job-shop scheduling using reinforcement learning agents [J].
Aydin, ME ;
Öztemel, E .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2000, 33 (2-3) :169-178
[3]   Executing production schedules in the face of uncertainties: A review and some future directions [J].
Aytug, H ;
Lawley, MA ;
McKay, K ;
Mohan, S ;
Uzsoy, R .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 161 (01) :86-110
[5]   Greedy randomized adaptive search for dynamic flexible job-shop scheduling [J].
Baykasoglu, Adil ;
Madenoglu, Fatma S. ;
Hamzadayi, Alper .
JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 :425-451
[6]  
Brandimarte P., 1993, Annals of Operations Research, V41, P157, DOI 10.1007/BF02023073
[7]   Two<bold>-</bold>stage teaching-learning<bold>-</bold>based optimization method for flexible job-shop scheduling under machine breakdown [J].
Buddala, Raviteja ;
Mahapatra, Siba Sankar .
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2019, 100 (5-8) :1419-1432
[8]  
Chen G, 2018, Arxiv, DOI arXiv:1804.06461
[9]   Flexible job shop scheduling problem under Industry 5.0: A survey on human reintegration, environmental consideration and resilience improvement [J].
Destouet, Candice ;
Tlahig, Houda ;
Bettayeb, Belgacem ;
Mazari, Belahcene .
JOURNAL OF MANUFACTURING SYSTEMS, 2023, 67 :155-173
[10]   A survey of dispatching rules for the dynamic unrelated machines environment [J].
Durasevic, Marko ;
Jakobovic, Domagoj .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 113 :555-569