Preference learning based deep reinforcement learning for flexible job shop scheduling problem

被引:0
|
作者
Liu, Xinning [1 ]
Han, Li [1 ]
Kang, Ling [2 ]
Liu, Jiannan [1 ]
Miao, Huadong [3 ]
机构
[1] Dalian Neusoft Univ Informat, Sch Comp & Software, Dalian 116023, Liaoning, Peoples R China
[2] Dalian Neusoft Univ Informat, Neusoft Res Inst, Dalian 116023, Liaoning, Peoples R China
[3] SNOW China Beijing Co Ltd, Dalian Branch, Dalian 116023, Liaoning, Peoples R China
关键词
Flexible job shop scheduling problem; Preference learning; Proximal policy optimization; Deep reinforcement learning; BENCHMARKS; ALGORITHM;
D O I
10.1007/s40747-024-01772-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The flexible job shop scheduling problem (FJSP) holds significant importance in both theoretical research and practical applications. Given the complexity and diversity of FJSP, improving the generalization and quality of scheduling methods has become a hot topic of interest in both industry and academia. To address this, this paper proposes a Preference-Based Mask-PPO (PBMP) algorithm, which leverages the strengths of preference learning and invalid action masking to optimize FJSP solutions. First, a reward predictor based on preference learning is designed to model reward prediction by comparing random fragments, eliminating the need for complex reward function design. Second, a novel intelligent switching mechanism is introduced, where proximal policy optimization (PPO) is employed to enhance exploration during sampling, and masked proximal policy optimization (Mask-PPO) refines the action space during training, significantly improving efficiency and solution quality. Furthermore, the Pearson correlation coefficient (PCC) is used to evaluate the performance of the preference model. Finally, comparative experiments on FJSP benchmark instances of varying sizes demonstrate that PBMP outperforms traditional scheduling strategies such as dispatching rules, OR-Tools, and other deep reinforcement learning (DRL) algorithms, achieving superior scheduling policies and faster convergence. Even with increasing instance sizes, preference learning proves to be an effective reward mechanism in reinforcement learning for FJSP. The ablation study further highlights the advantages of each key component in the PBMP algorithm across performance metrics.
引用
收藏
页数:23
相关论文
共 50 条
  • [11] Deep Reinforcement Learning Method for Flexible Job Shop Scheduling
    Zhu, Zhengyu
    Guo, Jutao
    Lyu, Youlong
    Zuo, Liling
    Zhang, Jie
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2024, 35 (11): : 2007 - 2014
  • [12] Dynamic flexible job shop scheduling algorithm based on deep reinforcement learning
    Zhao, Tianrui
    Wang, Yanhong
    Tan, Yuanyuan
    Zhang, Jun
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 5099 - 5104
  • [13] Expert-Guided Deep Reinforcement Learning for Flexible Job Shop Scheduling Problem
    Zhang, Wenqiang
    Geng, Huili
    Bao, Xuan
    Gen, Mitsuo
    Zhang, Guohui
    Deng, Miaolei
    BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 2, BIC-TA 2023, 2024, 2062 : 50 - 60
  • [14] Deep Reinforcement Learning Algorithm Based on CNN to Solve Flexible Job-Shop Scheduling Problem
    Li, Xingzhou
    Li, Yanwu
    Xie, Hui
    Computer Engineering and Applications, 2024, 60 (17) : 312 - 320
  • [15] Deep Reinforcement Learning Based on Graph Neural Network for Flexible Job Shop Scheduling Problem with Lot Streaming
    He, Junchao
    Li, Junqing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 85 - 95
  • [16] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival
    Chang, Jingru
    Yu, Dong
    Hu, Yi
    He, Wuwei
    Yu, Haoyu
    PROCESSES, 2022, 10 (04)
  • [17] Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning
    Luo, Shu
    APPLIED SOFT COMPUTING, 2020, 91
  • [18] Dynamic scheduling for flexible job shop using a deep reinforcement learning approach
    Gui, Yong
    Tang, Dunbing
    Zhu, Haihua
    Zhang, Yi
    Zhang, Zequn
    COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 180
  • [19] Solving flexible job shop scheduling problems via deep reinforcement learning
    Yuan, Erdong
    Wang, Liejun
    Cheng, Shuli
    Song, Shiji
    Fan, Wei
    Li, Yongming
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [20] Scheduling for the Flexible Job-Shop Problem with a Dynamic Number of Machines Using Deep Reinforcement Learning
    Chang, Yu-Hung
    Liu, Chien-Hung
    You, Shingchern D.
    INFORMATION, 2024, 15 (02)