A dynamic scheduling method with Conv-Dueling and generalized representation based on reinforcement learning

被引：1

作者：

Xia, Minghao ^{[1
]}

Liu, Haibin ^{[1
]}

Li, Mingfei ^{[1
]}

Wang, Long ^{[1
]}

机构：

[1] Beijing Univ Technol, Fac Mat & Mfg, Beijing 100124, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING COMPUTATIONS | 2023年 / 14卷 / 04期

关键词：

Scheduling problem; Deep reinforcement learning; Dynamic disturbance; Conv-dueling network model; JOB; ALGORITHM;

D O I：

10.5267/j.ijiec.2023.6.003

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

In modern industrial manufacturing, there are uncertain dynamic disturbances between processing machines and jobs which will disrupt the original production plan. This research focuses on dynamic multi-objective flexible scheduling problems such as the multi-constraint relationship among machines, jobs, and uncertain disturbance events. The possible disturbance events include job insertion, machine breakdown, and processing time change. The paper proposes a conv-dueling network model, a multidimensional state representation of the job processing information, and multiple scheduling objectives for minimizing makespan and delay time, while maximizing the completion punctuality rate. We design a multidimensional state space that includes job and machine processing information, an efficient and complete intelligent agent scheduling action space, and a compound scheduling reward function that combines the main task and the branch task. The unsupervised training of the network model utilizes the dueling-double-deep Q-network (D3QN) algorithm. Finally, based on the multi-constraint and multi-disturbance production environment information, the multidimensional state representation matrix of the job is used as input and the optimal scheduling rules are output after the feature extraction of the conv-dueling network model and decision making. This study carries out simulation experiments on 50 test cases. The results show the proposed conv-dueling network model can quickly converge for DQN, DDQN, and D3QN algorithms, and has good stability and universality. The experimental results indicate that the scheduling algorithm proposed in this paper outperforms DQN, DDQN, and single scheduling algorithms in all three scheduling objectives. It also demonstrates high robustness and excellent comprehensive scheduling performance.

引用

页码：805 / 820

页数：16

共 41 条

[1] Dynamic job-shop scheduling using reinforcement learning agents
Aydin, ME
Öztemel, E
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2000, 33 (2-3) : 169 - 178
[2] A MARKOVIAN DECISION PROCESS
BELLMAN, R
[J]. JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05): : 679 - 684
[3] JOB-SHOP SCHEDULING WITH MULTIPURPOSE MACHINES
BRUCKER, P
SCHLIE, R
[J]. COMPUTING, 1990, 45 (04) : 369 - 375
[4] Burggraf Peter, 2022, Procedia CIRP, P57, DOI 10.1016/j.procir.2022.09.024
[5] An Optimized Scheduling Algorithm on a Cloud Workflow Using a Discrete Particle Swarm
Cao, Jianfang
Chen, Junjie
Zhao, Qingshan
[J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2014, 14 (01) : 25 - 39
[6] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival
Chang, Jingru
Yu, Dong
Hu, Yi
He, Wuwei
Yu, Haoyu
[J]. PROCESSES, 2022, 10 (04)
[7] CHAO LF, 1993, ACM IEEE D, P566
[8] Adaptive grid job scheduling with genetic algorithms
Gao, Y
Rong, HQ
Huang, JZ
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2005, 21 (01): : 151 - 161
[9] Garey M. R., 1976, Mathematics of Operations Research, V1, P117, DOI 10.1287/moor.1.2.117
[10] A hybrid genetic algorithm for the job shop scheduling problem
Gonçalves, JF
Mendes, JJDM
Resende, MGC
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 167 (01) : 77 - 95

← 1 2 3 4 5 →