Design patterns of deep reinforcement learning models for job shop scheduling problems

被引:6
作者
Wang, Shiyong [1 ]
Li, Jiaxian [1 ]
Jiao, Qingsong [2 ]
Ma, Fang [3 ]
机构
[1] South China Univ Technol, Sch Mech & Automot Engn, Guangzhou 510640, Peoples R China
[2] South China Univ Technol, Dept Elect Business, Guangzhou 510640, Peoples R China
[3] China Natl Elect Apparat Res Inst Co Ltd, Guangzhou 510300, Peoples R China
基金
国家重点研发计划;
关键词
Production scheduling; Reinforcement learning; Smart manufacturing; Industry; 4.0; OPTIMIZATION;
D O I
10.1007/s10845-024-02454-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Production scheduling has a significant role when optimizing production objectives such as production efficiency, resource utilization, cost control, energy-saving, and emission reduction. Currently, deep reinforcement learning-based production scheduling methods achieve roughly equivalent precision as the widely used meta-heuristic algorithms while exhibiting higher efficiency, along with powerful generalization abilities. Therefore, this new paradigm has drawn much attention and plenty of research results have been reported. By reviewing available deep reinforcement learning models for the job shop scheduling problems, the typical design patterns and pattern combinations of the common components, i.e., agent, environment, state, action, and reward, were identified. Around this essential contribution, the architecture and procedure of training deep reinforcement learning scheduling models and applying resultant scheduling solvers were generalized. Furthermore, the key evaluation indicators were summarized and the promising research areas were outlined. This work surveys several deep reinforcement learning models for a range of production scheduling problems.
引用
收藏
页数:19
相关论文
共 75 条
  • [61] Sutton RS, 2000, ADV NEUR IN, V12, P1057
  • [62] Tassel P, 2021, Arxiv, DOI arXiv:2104.03760
  • [63] DEEP Q-NETWORK MODEL FOR DYNAMIC JOB SHOP SCHEDULING PPROBLEM BASED ON DISCRETE EVENT SIMULATION
    Turgut, Yakup
    Bozdag, Cafer Erhan
    [J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 1551 - 1559
  • [64] van Ekeris T., 2021, P C PROD SYST LOG CP, P709, DOI DOI 10.15488/11231
  • [65] Dynamic job-shop scheduling in smart manufacturing using deep reinforcement learning
    Wang, Libing
    Hu, Xin
    Wang, Yin
    Xu, Sujie
    Ma, Shijun
    Yang, Kexin
    Liu, Zhijun
    Wang, Weidong
    [J]. COMPUTER NETWORKS, 2021, 190 (190)
  • [66] Wang ZY, 2016, Arxiv, DOI [arXiv:1511.06581, DOI 10.48550/ARXIV.1511.06581]
  • [67] Waschneck B, 2018, ASMC PROC, P301, DOI 10.1109/ASMC.2018.8373191
  • [68] Webster J, 2002, MIS QUART, V26, pXIII
  • [69] Workneh Abebaw Degu, 2023, Artificial Intelligence and Industrial Applications: Smart Operation Management. Lecture Notes in Networks and Systems (771), P137, DOI 10.1007/978-3-031-43524-9_10
  • [70] A spatial pyramid pooling-based deep reinforcement learning model for dynamic job-shop scheduling problem
    Wu, Xinquan
    Yan, Xuefeng
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2023, 160