An end-to-end decentralised scheduling framework based on deep reinforcement learning for dynamic distributed heterogeneous flowshop scheduling

被引:0
|
作者
Li, Haoran [1 ]
Gao, Liang [1 ]
Fan, Qingsong [1 ]
Li, Xinyu [1 ]
Han, Baoan [2 ,3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China
[2] Beijing Xiaomi Mobile Software Co Ltd, Beijing, Peoples R China
[3] Beihang Univ, Dept Ind & Mfg Syst Engn, Beijing, Peoples R China
关键词
Distributed heterogeneous flowshop; decentralised scheduling framework; deep reinforcement learning; dynamic scheduling; greedy heuristics; PERMUTATION FLOWSHOP; GREEDY ALGORITHM;
D O I
10.1080/00207543.2024.2449240
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Heterogeneity among factories in distributed manufacturing significantly expands the solution space, complicating optimisation. Traditional centralised scheduling methods lack the scalability to adapt to varying factory scales. This paper proposes an end-to-end decentralised scheduling framework based on deep reinforcement learning (DRL) for dynamic distributed heterogeneous permutation flowshop scheduling problem (DDHPFSP) with random job arrivals. The framework utilises a multi-agent architecture, where each factory operates as an independent agent, enabling efficient, robust, and scalable scheduling. Specifically, the DDHPFSP is formulated as a partially observable Markov decision process (POMDP), with a state space reflecting heterogeneity and permutation characteristics and a new tailored reward function addressing sparse rewards and high reward variance. An end-to-end policy network with dual-layer architecture is developed, incorporating a feature extraction network to capture intrinsic relationships between jobs and heterogeneous factories, enhancing the agent's self-learning and policy evolution. Moreover, a backward swap search (BSS) method based on greedy heuristics optimises the pre-scheduling plan during the online phase with minimal computation time. Experimental results demonstrate the framework outperforms the best comparison methods by 39.76% on 540 baseline instances and 59.95% on 2430 generalisation instances. Furthermore, the framework's effectiveness improves by 68.9% with the introduction of the BSS method.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] End-to-End Multitarget Flexible Job Shop Scheduling With Deep Reinforcement Learning
    Wang, Rongkai
    Jing, Yiyang
    Gu, Chaojie
    He, Shibo
    Chen, Jiming
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4420 - 4434
  • [2] Reinforcement Learning Based VNF Scheduling with End-to-End Delay Guarantee
    Li, Junling
    Shi, Weisen
    Zhang, Ning
    Shen, Xuemin Sherman
    2019 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2019,
  • [3] An end-to-end deep reinforcement learning method based on graph neural network for distributed job-shop scheduling problem
    Huang, Jiang-Ping
    Gao, Liang
    Li, Xin-Yu
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [4] A deep reinforcement learning based approach for dynamic distributed blocking flowshop scheduling with job insertions
    Sun, Xueyan
    Vogel-Heuser, Birgit
    Bi, Fandi
    Shen, Weiming
    IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2022, 4 (03) : 166 - 180
  • [5] An End-to-End Deep Learning Method for Dynamic Job Shop Scheduling Problem
    Chen, Shifan
    Huang, Zuyi
    Guo, Hongfei
    MACHINES, 2022, 10 (07)
  • [6] A Knowledge-Guided End-to-End Optimization Framework based on Reinforcement Learning for Flow Shop Scheduling
    Pan, Zixiao
    Wang, Ling
    Dong, ChenXin
    Chen, Jing-fang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 1853 - 1861
  • [7] A Deep Reinforcement Learning-Based Evolutionary Algorithm for Distributed Heterogeneous Green Hybrid Flowshop Scheduling
    Xu, Hua
    Huang, Lingxiang
    Tao, Juntai
    Zhang, Chenjie
    Zheng, Jianlu
    PROCESSES, 2025, 13 (03)
  • [8] Decentralised hybrid workflow scheduling algorithm for minimum end-to-end delay in heterogeneous computing environment
    Department of Mathematics and Computer Science, University of Central Missouri, Warrensburg
    MO, United States
    不详
    IL, United States
    Int. J. High Perform. Comput. Networking, 4 (324-336):
  • [9] An End-to-end Hierarchical Reinforcement Learning Framework for Large-scale Dynamic Flexible Job-shop Scheduling Problem
    Lei, Kun
    Guo, Peng
    Wang, Yi
    Xiong, Jianyu
    Zhao, Wenchao
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] HeterPS: Distributed deep learning with reinforcement learning based scheduling in heterogeneous environments
    Liu, Ji
    Wu, Zhihua
    Feng, Danlei
    Zhang, Minxu
    Wu, Xinxuan
    Yao, Xuefeng
    Yu, Dianhai
    Ma, Yanjun
    Zhao, Feng
    Dou, Dejing
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 106 - 117