Deep reinforcement learning driven trajectory-based meta-heuristic for distributed heterogeneous flexible job shop scheduling problem

被引:1
|
作者
Zhang, Qichen [1 ]
Shao, Weishi [1 ,3 ,4 ]
Shao, Zhongshi [2 ]
Pi, Dechang [4 ]
Gao, Jiaquan [1 ,3 ]
机构
[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing, Peoples R China
[2] Shaanxi Normal Univ, Sch Comp Sci, Xian, Peoples R China
[3] Minist Educ, Key Lab Numer Simulat Large Scale Complex Syst, Beijing, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
基金
中国博士后科学基金;
关键词
Distributed heterogeneous flexible job shop; scheduling problem; Deep Q network; Variable neighborhood search; Makespan; Critical path; ALGORITHM; SEARCH;
D O I
10.1016/j.swevo.2024.101753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the production environment evolves, distributed manufacturing exhibits heterogeneous characteristics, including diverse machines, workers, and production processes. This paper examines a distributed heterogeneous flexible job shop scheduling problem (DHFJSP) with varying processing times. A mixed integer linear programming (MILP) model of the DHFJSP is formulated with the objective of minimizing the makespan. To solve the DHFJSP, we propose a deep Q network-aided automatic design of a variable neighborhood search algorithm (DQN-VNS). By analyzing schedules, sixty-one types of scheduling features are extracted. These features, along with six shaking strategies, are used as states and actions. A DHFJSP environment simulator is developed to train the deep Q network. The well-trained DQN then generates the shaking procedure for VNS. Additionally, a greedy initialization method is proposed to enhance the quality of the initial solution. Seven efficient critical path-based neighborhood structures with three-vector encoding scheme are introduced to improve local search. Numerical experiments on various scales of instances validate the effectiveness of the MILP model and the DQN-VNS algorithm. The results show that the DQN-VNS algorithm achieves an average relative percentage deviation (ARPD) of 3.2%, which represents an approximately 88.45% reduction compared to the best-performing algorithm among the six compared, with an ARPD of 27.7%. This significant reduction in ARPD highlights the superior stability and performance of the proposed DQN-VNS algorithm.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] An Effective Meta-Heuristic Algorithm to Minimize Makespan in Job Shop Scheduling
    Nazif, Habibeh
    INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2019, 18 (03): : 360 - 368
  • [22] A heuristic algorithm for solving flexible job shop scheduling problem
    Mohsen Ziaee
    The International Journal of Advanced Manufacturing Technology, 2014, 71 : 519 - 528
  • [23] A meta-heuristic approach to solve a JIT scheduling problem in hybrid flow shop
    Khalouli, Safa
    Ghedjati, Fatima
    Hamzaoui, Abdelaziz
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (05) : 765 - 771
  • [24] A cooperative hierarchical deep reinforcement learning based multi-agent method for distributed job shop scheduling problem with random job arrivals
    Huang, Jiang-Ping
    Gao, Liang
    Li, Xin-Yu
    Zhang, Chun-Jiang
    COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 185
  • [25] End-to-End Multitarget Flexible Job Shop Scheduling With Deep Reinforcement Learning
    Wang, Rongkai
    Jing, Yiyang
    Gu, Chaojie
    He, Shibo
    Chen, Jiming
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4420 - 4434
  • [26] Dynamic multi-objective scheduling for flexible job shop by deep reinforcement learning
    Luo, Shu
    Zhang, Linxuan
    Fan, Yushun
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159
  • [27] A random flight-follow leader and reinforcement learning approach for flexible job shop scheduling problem
    Shao, Changshun
    Yu, Zhenglin
    Ding, Hongchang
    Cao, Guohua
    Duan, Jingsong
    Zhou, Bin
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (03):
  • [28] A spatial pyramid pooling-based deep reinforcement learning model for dynamic job-shop scheduling problem
    Wu, Xinquan
    Yan, Xuefeng
    COMPUTERS & OPERATIONS RESEARCH, 2023, 160
  • [29] An effective hybrid meta-heuristic for flexible flow shop scheduling with limited buffers and step-deteriorating jobs
    Zheng, Qian-Qian
    Zhang, Yu
    Tian, Hong-Wei
    He, Li-Jun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [30] Hyper-heuristic for flexible job shop scheduling problem with stochastic job arrivals
    Lim, Kelvin Ching Wei
    Wong, Li-Pei
    Chin, Jeng Feng
    MANUFACTURING LETTERS, 2023, 36 : 5 - 8