Design patterns of deep reinforcement learning models for job shop scheduling problems

被引：6

作者：

Wang, Shiyong ^{[1
]}

Li, Jiaxian ^{[1
]}

Jiao, Qingsong ^{[2
]}

Ma, Fang ^{[3
]}

机构：

[1] South China Univ Technol, Sch Mech & Automot Engn, Guangzhou 510640, Peoples R China

[2] South China Univ Technol, Dept Elect Business, Guangzhou 510640, Peoples R China

[3] China Natl Elect Apparat Res Inst Co Ltd, Guangzhou 510300, Peoples R China

来源：

JOURNAL OF INTELLIGENT MANUFACTURING | 2024年

基金：

国家重点研发计划;

关键词：

Production scheduling; Reinforcement learning; Smart manufacturing; Industry; 4.0; OPTIMIZATION;

D O I：

10.1007/s10845-024-02454-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Production scheduling has a significant role when optimizing production objectives such as production efficiency, resource utilization, cost control, energy-saving, and emission reduction. Currently, deep reinforcement learning-based production scheduling methods achieve roughly equivalent precision as the widely used meta-heuristic algorithms while exhibiting higher efficiency, along with powerful generalization abilities. Therefore, this new paradigm has drawn much attention and plenty of research results have been reported. By reviewing available deep reinforcement learning models for the job shop scheduling problems, the typical design patterns and pattern combinations of the common components, i.e., agent, environment, state, action, and reward, were identified. Around this essential contribution, the architecture and procedure of training deep reinforcement learning scheduling models and applying resultant scheduling solvers were generalized. Furthermore, the key evaluation indicators were summarized and the promising research areas were outlined. This work surveys several deep reinforcement learning models for a range of production scheduling problems.

引用

页数：19

共 75 条

[61] Sutton RS, 2000, ADV NEUR IN, V12, P1057
[62] Tassel P, 2021, Arxiv, DOI arXiv:2104.03760
[63] DEEP Q-NETWORK MODEL FOR DYNAMIC JOB SHOP SCHEDULING PPROBLEM BASED ON DISCRETE EVENT SIMULATION
Turgut, Yakup
Bozdag, Cafer Erhan
[J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 1551 - 1559
[64] van Ekeris T., 2021, P C PROD SYST LOG CP, P709, DOI DOI 10.15488/11231
[65] Dynamic job-shop scheduling in smart manufacturing using deep reinforcement learning
Wang, Libing
Hu, Xin
Wang, Yin
Xu, Sujie
Ma, Shijun
Yang, Kexin
Liu, Zhijun
Wang, Weidong
[J]. COMPUTER NETWORKS, 2021, 190 (190)
[66] Wang ZY, 2016, Arxiv, DOI [arXiv:1511.06581, DOI 10.48550/ARXIV.1511.06581]
[67] Waschneck B, 2018, ASMC PROC, P301, DOI 10.1109/ASMC.2018.8373191
[68] Webster J, 2002, MIS QUART, V26, pXIII
[69] Workneh Abebaw Degu, 2023, Artificial Intelligence and Industrial Applications: Smart Operation Management. Lecture Notes in Networks and Systems (771), P137, DOI 10.1007/978-3-031-43524-9_10
[70] A spatial pyramid pooling-based deep reinforcement learning model for dynamic job-shop scheduling problem
Wu, Xinquan
Yan, Xuefeng
[J]. COMPUTERS & OPERATIONS RESEARCH, 2023, 160

← 1 2 3 4 5 6 7 8 →