Multi-Agent Reinforcement Learning for Job Shop Scheduling in Dynamic Environments

被引:6
作者
Pu, Yu [1 ]
Li, Fang [1 ]
Rahimifard, Shahin [2 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[2] Loughborough Univ, Sch Mech Elect & Mfg Engn, Loughborough LE11 3TU, England
关键词
multi-agent proximal policy optimization; job shop scheduling problem; graph neural network; green scheduling; ALGORITHM; SELECTION; NETWORK; RULES;
D O I
10.3390/su16083234
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In response to the challenges of dynamic adaptability, real-time interactivity, and dynamic optimization posed by the application of existing deep reinforcement learning algorithms in solving complex scheduling problems, this study proposes a novel approach using graph neural networks and deep reinforcement learning to complete the task of job shop scheduling. A distributed multi-agent scheduling architecture (DMASA) is constructed to maximize global rewards, modeling the intelligent manufacturing job shop scheduling problem as a sequential decision problem represented by graphs and using a Graph Embedding-Heterogeneous Graph Neural Network (GE-HetGNN) to encode state nodes and map them to the optimal scheduling strategy, including machine matching and process selection strategies. Finally, an actor-critic architecture-based multi-agent proximal policy optimization algorithm is employed to train the network and optimize the decision-making process. Experimental results demonstrate that the proposed framework exhibits generalizability, outperforms commonly used scheduling rules and RL-based scheduling methods on benchmarks, shows better stability than single-agent scheduling architectures, and breaks through the instance-size constraint, making it suitable for large-scale problems. We verified the feasibility of our proposed method in a specific experimental environment. The experimental results demonstrate that our research can achieve formal modeling and mapping with specific physical processing workshops, which aligns more closely with real-world green scheduling issues and makes it easier for subsequent researchers to integrate algorithms with actual environments.
引用
收藏
页数:26
相关论文
共 60 条
[1]   Multi-objective enhanced memetic algorithm for green job shop scheduling with uncertain times [J].
Afsar, Sezin ;
Jose Palacios, Juan ;
Puente, Jorge ;
Vela, Camino R. ;
Gonzalez-Rodriguez, Ines .
SWARM AND EVOLUTIONARY COMPUTATION, 2022, 68
[2]   A review on evolution of production scheduling with neural networks [J].
Akyol, Derya Eren ;
Bayhan, G. Mirac .
COMPUTERS & INDUSTRIAL ENGINEERING, 2007, 53 (01) :95-122
[3]  
Azemi F., 2019, P UBT INT C 2019
[4]   Solving comprehensive dynamic job shop scheduling problem by using a GRASP-based approach [J].
Baykasoglu, Adil ;
Karaslan, Fatma S. .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2017, 55 (11) :3308-3325
[5]  
BEHNKE D., 2012, TEST INSTANCES FLEXI
[6]  
Brandimarte P., 1993, Annals of Operations Research, V41, P157, DOI 10.1007/BF02023073
[7]   An effective backtracking search algorithm for multi-objective flexible job shop scheduling considering new job arrivals and energy consumption [J].
Caldeira, Rylan H. ;
Gnanavelbabu, A. ;
Vaidyanathan, T. .
COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149
[8]   SF-FWA: A Self-Adaptive Fast Fireworks Algorithm for effective large-scale optimization [J].
Chen, Maiyue ;
Tan, Ying .
SWARM AND EVOLUTIONARY COMPUTATION, 2023, 80
[9]  
Cimino A., 2023, P EUROPEAN MODELING
[10]   Hybrid of human learning optimization algorithm and particle swarm optimization algorithm with scheduling strategies for the flexible job-shop scheduling problem [J].
Ding, Haojie ;
Gu, Xingsheng .
NEUROCOMPUTING, 2020, 414 (414) :313-332