A dynamic scheduling method with Conv-Dueling and generalized representation based on reinforcement learning

被引:1
作者
Xia, Minghao [1 ]
Liu, Haibin [1 ]
Li, Mingfei [1 ]
Wang, Long [1 ]
机构
[1] Beijing Univ Technol, Fac Mat & Mfg, Beijing 100124, Peoples R China
关键词
Scheduling problem; Deep reinforcement learning; Dynamic disturbance; Conv-dueling network model; JOB; ALGORITHM;
D O I
10.5267/j.ijiec.2023.6.003
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In modern industrial manufacturing, there are uncertain dynamic disturbances between processing machines and jobs which will disrupt the original production plan. This research focuses on dynamic multi-objective flexible scheduling problems such as the multi-constraint relationship among machines, jobs, and uncertain disturbance events. The possible disturbance events include job insertion, machine breakdown, and processing time change. The paper proposes a conv-dueling network model, a multidimensional state representation of the job processing information, and multiple scheduling objectives for minimizing makespan and delay time, while maximizing the completion punctuality rate. We design a multidimensional state space that includes job and machine processing information, an efficient and complete intelligent agent scheduling action space, and a compound scheduling reward function that combines the main task and the branch task. The unsupervised training of the network model utilizes the dueling-double-deep Q-network (D3QN) algorithm. Finally, based on the multi-constraint and multi-disturbance production environment information, the multidimensional state representation matrix of the job is used as input and the optimal scheduling rules are output after the feature extraction of the conv-dueling network model and decision making. This study carries out simulation experiments on 50 test cases. The results show the proposed conv-dueling network model can quickly converge for DQN, DDQN, and D3QN algorithms, and has good stability and universality. The experimental results indicate that the scheduling algorithm proposed in this paper outperforms DQN, DDQN, and single scheduling algorithms in all three scheduling objectives. It also demonstrates high robustness and excellent comprehensive scheduling performance.
引用
收藏
页码:805 / 820
页数:16
相关论文
共 41 条
  • [1] Dynamic job-shop scheduling using reinforcement learning agents
    Aydin, ME
    Öztemel, E
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2000, 33 (2-3) : 169 - 178
  • [2] A MARKOVIAN DECISION PROCESS
    BELLMAN, R
    [J]. JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05): : 679 - 684
  • [3] JOB-SHOP SCHEDULING WITH MULTIPURPOSE MACHINES
    BRUCKER, P
    SCHLIE, R
    [J]. COMPUTING, 1990, 45 (04) : 369 - 375
  • [4] Burggraf Peter, 2022, Procedia CIRP, P57, DOI 10.1016/j.procir.2022.09.024
  • [5] An Optimized Scheduling Algorithm on a Cloud Workflow Using a Discrete Particle Swarm
    Cao, Jianfang
    Chen, Junjie
    Zhao, Qingshan
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2014, 14 (01) : 25 - 39
  • [6] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival
    Chang, Jingru
    Yu, Dong
    Hu, Yi
    He, Wuwei
    Yu, Haoyu
    [J]. PROCESSES, 2022, 10 (04)
  • [7] CHAO LF, 1993, ACM IEEE D, P566
  • [8] Adaptive grid job scheduling with genetic algorithms
    Gao, Y
    Rong, HQ
    Huang, JZ
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2005, 21 (01): : 151 - 161
  • [9] Garey M. R., 1976, Mathematics of Operations Research, V1, P117, DOI 10.1287/moor.1.2.117
  • [10] A hybrid genetic algorithm for the job shop scheduling problem
    Gonçalves, JF
    Mendes, JJDM
    Resende, MGC
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 167 (01) : 77 - 95