Reward Shaping for Job Shop Scheduling

被引：0

作者：

Nasuta, Alexander ^{[1
]}

Kemmerling, Marco ^{[1
]}

Luetticke, Daniel ^{[1
]}

Schmitt, Robert H. ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, Inst Informat Management Mech Engn WZLMQ IMA, Aachen, Germany

来源：

MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I | 2024年 / 14505卷

关键词：

Production Scheduling; Job Shop Scheduling; Disjunctive Graph; Reinforcement Learning; Reward Shaping;

D O I：

10.1007/978-3-031-53969-5_16

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Effective production scheduling is an integral part of the success of many industrial enterprises. In particular, the job shop problem (JSP) is highly relevant for flexible production scheduling in the modern era. Recently, numerous approaches for the JSP using reinforcement learning (RL) have been formulated. Different approaches employ different reward functions, but the individual effects of these reward functions on the achieved solution quality have received insufficient attention in the literature. We examine various reward functions using a novel flexible RL environment for the JSP based on the disjunctive graph approach. Our experiments show that a formulation of the reward function based on machine utilization is most appropriate for minimizing the makespan of a JSP among the investigated reward functions.

引用

页码：197 / 211

页数：15

共 18 条

[1]

Applegate D., 1991, ORSA Journal on Computing, V3, P149, DOI 10.1287/ijoc.3.2.149

[2]

Biewald Lukas, 2020, Experiment tracking with weights and biases

[3] The disjunctive graph machine representation of the job shop scheduling problem [J].

Blazewicz, J ;

Pesch, E ;

Sterna, M .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2000, 127 (02) :317-331

[4]

Blazewicz J., 2019, INT HDB INFORM SYSTE, Vsecond, DOI [10.1007/978-3-319-99849-7, DOI 10.1007/978-3-319-99849-7]

[5]

Burda Y., 2018, INT C LEARN REPR

[6] On reliability of reinforcement learning based production scheduling systems: a comparative survey [J].

de Puiseau, Constantin Waubert ;

Meyes, Richard ;

Meisen, Tobias .

JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (04) :911-927

[7]

Fisher H., 1963, PROBABILISTIC LEARNI, V45, P225, DOI DOI 10.1109/ICAL.2009.5262867

[8]

Grzes M, 2017, AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, P565

[9]

Perron L., 2022, Or-tools

[10]

Pinedo M., 2005, Planning and scheduling in manufacturing and services, DOI DOI 10.1007/B139030

← 1 2 →