Graph neural networks-based scheduler for production planning problems using reinforcement learning

被引:12
|
作者
Hameed, Mohammed Sharafath Abdul [1 ]
Schwung, Andreas [1 ]
机构
[1] South Westphalia Univ Appl Sci, Dept Automat Technol & Learning Syst, D-59494 Soest, Germany
关键词
Job shop scheduling; Reinforcement learning; Graph neural networks; Distributed optimization; Production planning; Tabu search; Genetic algorithm; PROGRAMMING-MODELS; TABU SEARCH; GO; ALGORITHM; SYSTEMS; SHOGI; CHESS;
D O I
10.1016/j.jmsy.2023.06.005
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Reinforcement learning (RL) is increasingly adopted in job shop scheduling problems (JSSP). But RL for JSSP is usually done using a vectorized representation of machine features as the state space. It has three major problems: (1) the relationship between the machine units and the job sequence is not fully captured, (2) exponential increase in the size of the state space with increasing machines/jobs, and (3) the generalization of the agent to unseen scenarios. This paper presents a novel framework named GraSP-RL, GRAph neural network-based Scheduler for Production planning problems using Reinforcement Learning. It represents JSSP as a graph and trains the RL agent using features extracted using a graph neural network (GNN). While the graph is itself in the non-Euclidean space, the features extracted using the GNNs provide a rich encoding of the current production state in the Euclidean space. At its core is a custom message-passing algorithm applied to the GNN. The node features encoded by the GNN are then used by the RL agent to select the next job. Further, we cast the scheduling problem as a decentralized optimization problem in which the learning agent is assigned to all the production units individually and the agent learns asynchronously from the experience collected on all the other production units. The GraSP-RL is then applied to a complex injection molding production environment with 30 jobs and 4 machines. The task is to minimize the makespan of the production plan. The schedule planned by GraSP-RL is then compared and analyzed with a priority dispatch rule algorithm like first-in-first-out (FIFO) and metaheuristics like tabu search (TS) and genetic algorithm (GA). The proposed GraSP-RL outperforms the FIFO, TS, and GA for the trained task of planning 30 jobs in JSSP. We further test the generalization capability of the trained agent on two different problem classes: Open shop system (OSS) and Reactive JSSP (RJSSP). In these modified problem classes our method produces results better than FIFO and comparable results to TS and GA, without any further training while also providing schedules instantly.
引用
收藏
页码:91 / 102
页数:12
相关论文
共 50 条
  • [31] Robotic Motion Planning Based on Deep Reinforcement Learning and Artificial Neural Networks
    Liu, Huashan
    Li, Xiangjian
    Dong, Menghua
    Gu, Yuqing
    Shen, Bo
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [32] Automatic Intersection Management in Mixed Traffic Using Reinforcement Learning and Graph Neural Networks
    Klimke, Marvin
    Voelz, Benjamin
    Buchholz, Michael
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [33] A generic intelligent routing method using deep reinforcement learning with graph neural networks
    Huang, Wanwei
    Yuan, Bo
    Wang, Sunan
    Zhang, Jianwei
    Li, Junfei
    Zhang, Xiaohui
    IET COMMUNICATIONS, 2022, 16 (19) : 2343 - 2351
  • [34] Power allocation using spatio-temporal graph neural networks and reinforcement learning
    Jamshidiha, Saeed
    Pourahmadi, Vahid
    Mohammadi, Abbas
    Bennis, Mehdi
    WIRELESS NETWORKS, 2025, 31 (02) : 1163 - 1176
  • [35] GNOSIS: Proactive Image Placement Using Graph Neural Networks & Deep Reinforcement Learning
    Theodoropoulos, Theodoros
    Makris, Antonios
    Psomakelis, Evangelos
    Carlini, Emanuele
    Mordacchini, Matteo
    Dazzi, Patrizio
    Tserpes, Konstantinos
    2023 IEEE 16TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD, 2023, : 120 - 128
  • [36] Reward shaping using directed graph convolution neural networks for reinforcement learning and games
    Sang, Jianghui
    Ahmad Khan, Zaki
    Yin, Hengfu
    Wang, Yupeng
    FRONTIERS IN PHYSICS, 2023, 11
  • [37] Reinforcement learning-based secure training for adversarial defense in graph neural networks
    An, Dongdong
    Yang, Yi
    Gao, Xin
    Qi, Hongda
    Yang, Yang
    Ye, Xin
    Li, Maozhen
    Zhao, Qin
    NEUROCOMPUTING, 2025, 630
  • [38] PoisonedGNN: Backdoor Attack on Graph Neural Networks-Based Hardware Security Systems
    Alrahis, Lilas
    Patnaik, Satwik
    Hanif, Muhammad Abdullah
    Shafique, Muhammad
    Sinanoglu, Ozgur
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (10) : 2822 - 2834
  • [39] Improved Graph Convolutional Neural Networks-based Cellular Network Fault Diagnosis
    Gao, Zongzhen
    Liu, Wenlai
    EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2025, 27 (02):
  • [40] Combining Reinforcement Learning Algorithms with Graph Neural Networks to Solve Dynamic Job Shop Scheduling Problems
    Yang, Zhong
    Bi, Li
    Jiao, Xiaogang
    PROCESSES, 2023, 11 (05)