An end-to-end decentralised scheduling framework based on deep reinforcement learning for dynamic distributed heterogeneous flowshop scheduling

被引：0

作者：

Li, Haoran ^{[1
]}

Gao, Liang ^{[1
]}

Fan, Qingsong ^{[1
]}

Li, Xinyu ^{[1
]}

Han, Baoan ^{[2
,3
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China

[2] Beijing Xiaomi Mobile Software Co Ltd, Beijing, Peoples R China

[3] Beihang Univ, Dept Ind & Mfg Syst Engn, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH | 2025年

关键词：

Distributed heterogeneous flowshop; decentralised scheduling framework; deep reinforcement learning; dynamic scheduling; greedy heuristics; PERMUTATION FLOWSHOP; GREEDY ALGORITHM;

D O I：

10.1080/00207543.2024.2449240

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Heterogeneity among factories in distributed manufacturing significantly expands the solution space, complicating optimisation. Traditional centralised scheduling methods lack the scalability to adapt to varying factory scales. This paper proposes an end-to-end decentralised scheduling framework based on deep reinforcement learning (DRL) for dynamic distributed heterogeneous permutation flowshop scheduling problem (DDHPFSP) with random job arrivals. The framework utilises a multi-agent architecture, where each factory operates as an independent agent, enabling efficient, robust, and scalable scheduling. Specifically, the DDHPFSP is formulated as a partially observable Markov decision process (POMDP), with a state space reflecting heterogeneity and permutation characteristics and a new tailored reward function addressing sparse rewards and high reward variance. An end-to-end policy network with dual-layer architecture is developed, incorporating a feature extraction network to capture intrinsic relationships between jobs and heterogeneous factories, enhancing the agent's self-learning and policy evolution. Moreover, a backward swap search (BSS) method based on greedy heuristics optimises the pre-scheduling plan during the online phase with minimal computation time. Experimental results demonstrate the framework outperforms the best comparison methods by 39.76% on 540 baseline instances and 59.95% on 2430 generalisation instances. Furthermore, the framework's effectiveness improves by 68.9% with the introduction of the BSS method.

引用

页数：21

共 50 条

[1] End-to-End Multitarget Flexible Job Shop Scheduling With Deep Reinforcement Learning
Wang, Rongkai
Jing, Yiyang
Gu, Chaojie
He, Shibo
Chen, Jiming
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4420 - 4434
[2] Reinforcement Learning Based VNF Scheduling with End-to-End Delay Guarantee
Li, Junling
Shi, Weisen
Zhang, Ning
Shen, Xuemin Sherman
2019 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2019,
[3] An end-to-end deep reinforcement learning method based on graph neural network for distributed job-shop scheduling problem
Huang, Jiang-Ping
Gao, Liang
Li, Xin-Yu
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
[4] A deep reinforcement learning based approach for dynamic distributed blocking flowshop scheduling with job insertions
Sun, Xueyan
Vogel-Heuser, Birgit
Bi, Fandi
Shen, Weiming
IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2022, 4 (03) : 166 - 180
[5] An End-to-End Deep Learning Method for Dynamic Job Shop Scheduling Problem
Chen, Shifan
Huang, Zuyi
Guo, Hongfei
MACHINES, 2022, 10 (07)
[6] A Knowledge-Guided End-to-End Optimization Framework based on Reinforcement Learning for Flow Shop Scheduling
Pan, Zixiao
Wang, Ling
Dong, ChenXin
Chen, Jing-fang
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 1853 - 1861
[7] A Deep Reinforcement Learning-Based Evolutionary Algorithm for Distributed Heterogeneous Green Hybrid Flowshop Scheduling
Xu, Hua
Huang, Lingxiang
Tao, Juntai
Zhang, Chenjie
Zheng, Jianlu
PROCESSES, 2025, 13 (03)
[8] Decentralised hybrid workflow scheduling algorithm for minimum end-to-end delay in heterogeneous computing environment
Department of Mathematics and Computer Science, University of Central Missouri, Warrensburg
MO, United States
不详
IL, United States
Int. J. High Perform. Comput. Networking, 4 (324-336):
[9] An End-to-end Hierarchical Reinforcement Learning Framework for Large-scale Dynamic Flexible Job-shop Scheduling Problem
Lei, Kun
Guo, Peng
Wang, Yi
Xiong, Jianyu
Zhao, Wenchao
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[10] HeterPS: Distributed deep learning with reinforcement learning based scheduling in heterogeneous environments
Liu, Ji
Wu, Zhihua
Feng, Danlei
Zhang, Minxu
Wu, Xinxuan
Yao, Xuefeng
Yu, Dianhai
Ma, Yanjun
Zhao, Feng
Dou, Dejing
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 106 - 117

← 1 2 3 4 5 →