A spatial pyramid pooling-based deep reinforcement learning model for dynamic job-shop scheduling problem

被引：16

作者：

Wu, Xinquan ^{[1
]}

Yan, Xuefeng ^{[1
,2
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China

[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210016, Peoples R China

来源：

COMPUTERS & OPERATIONS RESEARCH | 2023年 / 160卷

关键词：

Dynamic job shop scheduling problem; Deep reinforcement learning; Spatial pyramid pooling networks; PPO; Random job arrivals; FEATURE-SELECTION; ALGORITHM; BENCHMARKS; HEURISTICS; SEARCH; RULES;

D O I：

10.1016/j.cor.2023.106401

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The dynamic job-shop scheduling problem (DJSP) is a typical of scheduling tasks where rescheduling is performed when encountering unexpected events such as random job arrivals and rush order. However, the current rescheduling approaches cannot reuse the trained scheduling policies or the experiences due to the variant size of scheduling problems. In this paper, we propose a deep reinforcement learning (DRL) scheduling model for DJSP based on spatial pyramid pooling networks (SPP-Net). A new state representation is proposed based on the machine matrix and remaining time matrix which is decomposed from the scheduling instance matrix. And a new reward function is derived from the area of total scheduling time where the accumulated reward is negatively linearly dependent with the make-span of a scheduling task. Moreover, a size-agnostic scheduling policy is designed based on the SPP-Net and SoftMax function, which is trained by the proximal policy optimization (PPO). Besides, various paired priority dispatching rules (PDR) are used as available actions. Static experiments on classic benchmark instances show that our scheduling model achieves better results on average than existing DRL methods. In addition, dynamic scheduling experiments are tested and our model obtains better results than the PDR scheduling methods in reasonable time when encountering unexpected events such as random job arrivals and rush order.

引用

页数：16

共 65 条

[1] THE SHIFTING BOTTLENECK PROCEDURE FOR JOB SHOP SCHEDULING [J].

ADAMS, J ;

BALAS, E ;

ZAWACK, D .

MANAGEMENT SCIENCE, 1988, 34 (03) :391-401

[2] End-to-end on-line rescheduling from Gantt chart images using deep reinforcement learning [J].

Andres Palombarini, Jorge ;

Carlos Martinez, Ernesto .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (14) :4434-4463

[3]

Applegate D., 1991, ORSA Journal on Computing, V3, P149, DOI 10.1287/ijoc.3.2.149

[4] A parallel artificial bee colony algorithm for the job shop scheduling problem with a dynamic migration strategy [J].

Asadzadeh, Leila .

COMPUTERS & INDUSTRIAL ENGINEERING, 2016, 102 :359-367

[5] Dynamic job-shop scheduling using reinforcement learning agents [J].

Aydin, ME ;

Öztemel, E .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2000, 33 (2-3) :169-178

[6]

Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473

[7] Solving comprehensive dynamic job shop scheduling problem by using a GRASP-based approach [J].

Baykasoglu, Adil ;

Karaslan, Fatma S. .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2017, 55 (11) :3308-3325

[8] Automated Design of Production Scheduling Heuristics: A Review [J].

Branke, Juergen ;

Su Nguyen ;

Pickardt, Christoph W. ;

Zhang, Mengjie .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2016, 20 (01) :110-124

[9]

Burggraf Peter, 2022, Procedia CIRP, P57, DOI [10.1016/j.procir.2022.09.024, 10.1016/j.procir.2022.09.024]

[10] AN ALGORITHM FOR SOLVING THE JOB-SHOP PROBLEM [J].

CARLIER, J ;

PINSON, E .

MANAGEMENT SCIENCE, 1989, 35 (02) :164-176

← 1 2 3 4 5 6 7 →