Deep reinforcement learning driven trajectory-based meta-heuristic for distributed heterogeneous flexible job shop scheduling problem

被引:1
|
作者
Zhang, Qichen [1 ]
Shao, Weishi [1 ,3 ,4 ]
Shao, Zhongshi [2 ]
Pi, Dechang [4 ]
Gao, Jiaquan [1 ,3 ]
机构
[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing, Peoples R China
[2] Shaanxi Normal Univ, Sch Comp Sci, Xian, Peoples R China
[3] Minist Educ, Key Lab Numer Simulat Large Scale Complex Syst, Beijing, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
基金
中国博士后科学基金;
关键词
Distributed heterogeneous flexible job shop; scheduling problem; Deep Q network; Variable neighborhood search; Makespan; Critical path; ALGORITHM; SEARCH;
D O I
10.1016/j.swevo.2024.101753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the production environment evolves, distributed manufacturing exhibits heterogeneous characteristics, including diverse machines, workers, and production processes. This paper examines a distributed heterogeneous flexible job shop scheduling problem (DHFJSP) with varying processing times. A mixed integer linear programming (MILP) model of the DHFJSP is formulated with the objective of minimizing the makespan. To solve the DHFJSP, we propose a deep Q network-aided automatic design of a variable neighborhood search algorithm (DQN-VNS). By analyzing schedules, sixty-one types of scheduling features are extracted. These features, along with six shaking strategies, are used as states and actions. A DHFJSP environment simulator is developed to train the deep Q network. The well-trained DQN then generates the shaking procedure for VNS. Additionally, a greedy initialization method is proposed to enhance the quality of the initial solution. Seven efficient critical path-based neighborhood structures with three-vector encoding scheme are introduced to improve local search. Numerical experiments on various scales of instances validate the effectiveness of the MILP model and the DQN-VNS algorithm. The results show that the DQN-VNS algorithm achieves an average relative percentage deviation (ARPD) of 3.2%, which represents an approximately 88.45% reduction compared to the best-performing algorithm among the six compared, with an ARPD of 27.7%. This significant reduction in ARPD highlights the superior stability and performance of the proposed DQN-VNS algorithm.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Preference learning based deep reinforcement learning for flexible job shop scheduling problem
    Liu, Xinning
    Han, Li
    Kang, Ling
    Liu, Jiannan
    Miao, Huadong
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (02)
  • [2] A Deep Reinforcement Learning Method Based on a Transformer Model for the Flexible Job Shop Scheduling Problem
    Xu, Shuai
    Li, Yanwu
    Li, Qiuyang
    ELECTRONICS, 2024, 13 (18)
  • [3] An effective hybrid meta-heuristic for a heterogeneous flow shop scheduling problem
    Araujo, Matheus de Freitas
    Arroyo, Jose Elias C.
    Tavares, Ricardo G.
    GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 245 - 252
  • [4] Optimization of job shop scheduling problem based on deep reinforcement learning
    Qiao, Dongping
    Duan, Lvqi
    Li, Honglei
    Xiao, Yanqiu
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 371 - 383
  • [5] A heuristic algorithm for the distributed and flexible job-shop scheduling problem
    Ziaee, Mohsen
    JOURNAL OF SUPERCOMPUTING, 2014, 67 (01): : 69 - 83
  • [6] A heuristic algorithm for the distributed and flexible job-shop scheduling problem
    Mohsen Ziaee
    The Journal of Supercomputing, 2014, 67 : 69 - 83
  • [7] Solving Flexible Job-Shop Scheduling Problem with Heterogeneous Graph Neural Network Based on Relation and Deep Reinforcement Learning
    Tang, Hengliang
    Dong, Jinda
    MACHINES, 2024, 12 (08)
  • [8] Expert-Guided Deep Reinforcement Learning for Flexible Job Shop Scheduling Problem
    Zhang, Wenqiang
    Geng, Huili
    Bao, Xuan
    Gen, Mitsuo
    Zhang, Guohui
    Deng, Miaolei
    BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 2, BIC-TA 2023, 2024, 2062 : 50 - 60
  • [9] Incorporating learning effect and deterioration for solving a SDST flexible job-shop scheduling problem with a hybrid meta-heuristic approach
    Araghi, M. E. Tayebi
    Jolai, F.
    Rabiee, M.
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2014, 27 (08) : 733 - 746
  • [10] Co-Evolution With Deep Reinforcement Learning for Energy-Aware Distributed Heterogeneous Flexible Job Shop Scheduling
    Li, Rui
    Gong, Wenyin
    Wang, Ling
    Lu, Chao
    Dong, Chenxin
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (01): : 201 - 211