Deep reinforcement learning driven trajectory-based meta-heuristic for distributed heterogeneous flexible job shop scheduling problem

被引：1

作者：

Zhang, Qichen ^{[1
]}

Shao, Weishi ^{[1
,3
,4
]}

Shao, Zhongshi ^{[2
]}

Pi, Dechang ^{[4
]}

Gao, Jiaquan ^{[1
,3
]}

机构：

[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing, Peoples R China

[2] Shaanxi Normal Univ, Sch Comp Sci, Xian, Peoples R China

[3] Minist Educ, Key Lab Numer Simulat Large Scale Complex Syst, Beijing, Peoples R China

[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

来源：

SWARM AND EVOLUTIONARY COMPUTATION | 2024年 / 91卷

基金：

中国博士后科学基金;

关键词：

Distributed heterogeneous flexible job shop; scheduling problem; Deep Q network; Variable neighborhood search; Makespan; Critical path; ALGORITHM; SEARCH;

D O I：

10.1016/j.swevo.2024.101753

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the production environment evolves, distributed manufacturing exhibits heterogeneous characteristics, including diverse machines, workers, and production processes. This paper examines a distributed heterogeneous flexible job shop scheduling problem (DHFJSP) with varying processing times. A mixed integer linear programming (MILP) model of the DHFJSP is formulated with the objective of minimizing the makespan. To solve the DHFJSP, we propose a deep Q network-aided automatic design of a variable neighborhood search algorithm (DQN-VNS). By analyzing schedules, sixty-one types of scheduling features are extracted. These features, along with six shaking strategies, are used as states and actions. A DHFJSP environment simulator is developed to train the deep Q network. The well-trained DQN then generates the shaking procedure for VNS. Additionally, a greedy initialization method is proposed to enhance the quality of the initial solution. Seven efficient critical path-based neighborhood structures with three-vector encoding scheme are introduced to improve local search. Numerical experiments on various scales of instances validate the effectiveness of the MILP model and the DQN-VNS algorithm. The results show that the DQN-VNS algorithm achieves an average relative percentage deviation (ARPD) of 3.2%, which represents an approximately 88.45% reduction compared to the best-performing algorithm among the six compared, with an ARPD of 27.7%. This significant reduction in ARPD highlights the superior stability and performance of the proposed DQN-VNS algorithm.

引用

页数：23

共 50 条

[21] An Effective Meta-Heuristic Algorithm to Minimize Makespan in Job Shop Scheduling
Nazif, Habibeh
INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2019, 18 (03): : 360 - 368
[22] A heuristic algorithm for solving flexible job shop scheduling problem
Mohsen Ziaee
The International Journal of Advanced Manufacturing Technology, 2014, 71 : 519 - 528
[23] A meta-heuristic approach to solve a JIT scheduling problem in hybrid flow shop
Khalouli, Safa
Ghedjati, Fatima
Hamzaoui, Abdelaziz
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (05) : 765 - 771
[24] A cooperative hierarchical deep reinforcement learning based multi-agent method for distributed job shop scheduling problem with random job arrivals
Huang, Jiang-Ping
Gao, Liang
Li, Xin-Yu
Zhang, Chun-Jiang
COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 185
[25] End-to-End Multitarget Flexible Job Shop Scheduling With Deep Reinforcement Learning
Wang, Rongkai
Jing, Yiyang
Gu, Chaojie
He, Shibo
Chen, Jiming
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4420 - 4434
[26] Dynamic multi-objective scheduling for flexible job shop by deep reinforcement learning
Luo, Shu
Zhang, Linxuan
Fan, Yushun
COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159
[27] A random flight-follow leader and reinforcement learning approach for flexible job shop scheduling problem
Shao, Changshun
Yu, Zhenglin
Ding, Hongchang
Cao, Guohua
Duan, Jingsong
Zhou, Bin
JOURNAL OF SUPERCOMPUTING, 2025, 81 (03):
[28] A spatial pyramid pooling-based deep reinforcement learning model for dynamic job-shop scheduling problem
Wu, Xinquan
Yan, Xuefeng
COMPUTERS & OPERATIONS RESEARCH, 2023, 160
[29] An effective hybrid meta-heuristic for flexible flow shop scheduling with limited buffers and step-deteriorating jobs
Zheng, Qian-Qian
Zhang, Yu
Tian, Hong-Wei
He, Li-Jun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
[30] Hyper-heuristic for flexible job shop scheduling problem with stochastic job arrivals
Lim, Kelvin Ching Wei
Wong, Li-Pei
Chin, Jeng Feng
MANUFACTURING LETTERS, 2023, 36 : 5 - 8

← 1 2 3 4 5 →