Graph neural networks-based scheduler for production planning problems using reinforcement learning

被引：12

作者：

Hameed, Mohammed Sharafath Abdul ^{[1
]}

Schwung, Andreas ^{[1
]}

机构：

[1] South Westphalia Univ Appl Sci, Dept Automat Technol & Learning Syst, D-59494 Soest, Germany

来源：

JOURNAL OF MANUFACTURING SYSTEMS | 2023年 / 69卷

关键词：

Job shop scheduling; Reinforcement learning; Graph neural networks; Distributed optimization; Production planning; Tabu search; Genetic algorithm; PROGRAMMING-MODELS; TABU SEARCH; GO; ALGORITHM; SYSTEMS; SHOGI; CHESS;

D O I：

10.1016/j.jmsy.2023.06.005

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Reinforcement learning (RL) is increasingly adopted in job shop scheduling problems (JSSP). But RL for JSSP is usually done using a vectorized representation of machine features as the state space. It has three major problems: (1) the relationship between the machine units and the job sequence is not fully captured, (2) exponential increase in the size of the state space with increasing machines/jobs, and (3) the generalization of the agent to unseen scenarios. This paper presents a novel framework named GraSP-RL, GRAph neural network-based Scheduler for Production planning problems using Reinforcement Learning. It represents JSSP as a graph and trains the RL agent using features extracted using a graph neural network (GNN). While the graph is itself in the non-Euclidean space, the features extracted using the GNNs provide a rich encoding of the current production state in the Euclidean space. At its core is a custom message-passing algorithm applied to the GNN. The node features encoded by the GNN are then used by the RL agent to select the next job. Further, we cast the scheduling problem as a decentralized optimization problem in which the learning agent is assigned to all the production units individually and the agent learns asynchronously from the experience collected on all the other production units. The GraSP-RL is then applied to a complex injection molding production environment with 30 jobs and 4 machines. The task is to minimize the makespan of the production plan. The schedule planned by GraSP-RL is then compared and analyzed with a priority dispatch rule algorithm like first-in-first-out (FIFO) and metaheuristics like tabu search (TS) and genetic algorithm (GA). The proposed GraSP-RL outperforms the FIFO, TS, and GA for the trained task of planning 30 jobs in JSSP. We further test the generalization capability of the trained agent on two different problem classes: Open shop system (OSS) and Reactive JSSP (RJSSP). In these modified problem classes our method produces results better than FIFO and comparable results to TS and GA, without any further training while also providing schedules instantly.

引用

页码：91 / 102

页数：12

共 50 条

[31] Robotic Motion Planning Based on Deep Reinforcement Learning and Artificial Neural Networks
Liu, Huashan
Li, Xiangjian
Dong, Menghua
Gu, Yuqing
Shen, Bo
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
[32] Automatic Intersection Management in Mixed Traffic Using Reinforcement Learning and Graph Neural Networks
Klimke, Marvin
Voelz, Benjamin
Buchholz, Michael
2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
[33] A generic intelligent routing method using deep reinforcement learning with graph neural networks
Huang, Wanwei
Yuan, Bo
Wang, Sunan
Zhang, Jianwei
Li, Junfei
Zhang, Xiaohui
IET COMMUNICATIONS, 2022, 16 (19) : 2343 - 2351
[34] Power allocation using spatio-temporal graph neural networks and reinforcement learning
Jamshidiha, Saeed
Pourahmadi, Vahid
Mohammadi, Abbas
Bennis, Mehdi
WIRELESS NETWORKS, 2025, 31 (02) : 1163 - 1176
[35] GNOSIS: Proactive Image Placement Using Graph Neural Networks & Deep Reinforcement Learning
Theodoropoulos, Theodoros
Makris, Antonios
Psomakelis, Evangelos
Carlini, Emanuele
Mordacchini, Matteo
Dazzi, Patrizio
Tserpes, Konstantinos
2023 IEEE 16TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD, 2023, : 120 - 128
[36] Reward shaping using directed graph convolution neural networks for reinforcement learning and games
Sang, Jianghui
Ahmad Khan, Zaki
Yin, Hengfu
Wang, Yupeng
FRONTIERS IN PHYSICS, 2023, 11
[37] Reinforcement learning-based secure training for adversarial defense in graph neural networks
An, Dongdong
Yang, Yi
Gao, Xin
Qi, Hongda
Yang, Yang
Ye, Xin
Li, Maozhen
Zhao, Qin
NEUROCOMPUTING, 2025, 630
[38] PoisonedGNN: Backdoor Attack on Graph Neural Networks-Based Hardware Security Systems
Alrahis, Lilas
Patnaik, Satwik
Hanif, Muhammad Abdullah
Shafique, Muhammad
Sinanoglu, Ozgur
IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (10) : 2822 - 2834
[39] Improved Graph Convolutional Neural Networks-based Cellular Network Fault Diagnosis
Gao, Zongzhen
Liu, Wenlai
EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2025, 27 (02):
[40] Combining Reinforcement Learning Algorithms with Graph Neural Networks to Solve Dynamic Job Shop Scheduling Problems
Yang, Zhong
Bi, Li
Jiao, Xiaogang
PROCESSES, 2023, 11 (05)

← 1 2 3 4 5 →