Deep Reinforcement Learning Based Optimization Algorithm for Permutation Flow-Shop Scheduling

被引:66
作者
Pan, Zixiao [1 ]
Wang, Ling [1 ]
Wang, Jingjing [1 ]
Lu, Jiawen [2 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Huawei Noahs Ark Lab, Shenzhen 518129, Peoples R China
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2023年 / 7卷 / 04期
基金
中国国家自然科学基金;
关键词
Optimization; Job shop scheduling; Decoding; Reinforcement learning; Dynamic scheduling; Encoding; Computational intelligence; deep neural network; flow-shop scheduling; optimization algorithm; improvement strategy; HEURISTIC ALGORITHM; MINIMIZE MAKESPAN; M-MACHINE; N-JOB;
D O I
10.1109/TETCI.2021.3098354
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a new analogy paradigm of human learning process, reinforcement learning (RL) has become an emerging topic in computational intelligence (CI). The synergy between the RL and CI is an emerging way to develop efficient solution algorithms for solving complex combinatorial optimization (CO) problems like machine scheduling problem. In this paper, we proposed an efficient optimization algorithm based on Deep RL for solving permutation flow-shop scheduling problem (PFSP) to minimize the maximum completion time. Firstly, a new deep neural network (PFSPNet) is designed for the PFSP to achieve the end-to-end output without limitation of problem sizes. Secondly, an actor-critic method of RL is used to train the PFSPNet without depending on the collection of high-quality labelled data. Thirdly, an improvement strategy is designed to refine the solution provided by the PFSPNet. Simulation results and statistical comparison show that the proposed optimization algorithm based on deep RL can obtain better results than the existing heuristics in similar computational time for solving the PFSP.
引用
收藏
页码:983 / 994
页数:12
相关论文
共 50 条
  • [21] A DISCRETE BUTTERFLY-INSPIRED OPTIMIZATION ALGORITHM FOR SOLVING PERMUTATION FLOW-SHOP SCHEDULING PROBLEMS
    Qi, X.
    Yuan, Z.
    Han, X.
    Liu, S.
    [J]. NEURAL NETWORK WORLD, 2020, 30 (04) : 211 - 229
  • [22] A novel deep reinforcement learning-based algorithm for multi-objective energy-efficient flow-shop scheduling
    Liang, Peng
    Xiao, Pengfei
    Li, Zeya
    Luo, Min
    Zhang, Chaoyong
    [J]. IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2024, 6 (04)
  • [23] A computationally efficient Branch-and-Bound algorithm for the permutation flow-shop scheduling problem
    Gmys, Jan
    Mezmaz, Mohand
    Melab, Nouredine
    Tuyttens, Daniel
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2020, 284 (03) : 814 - 833
  • [24] An Improved Artificial Bee Colony Algorithm With Q-Learning for Solving Permutation Flow-Shop Scheduling Problems
    Li, Hanxiao
    Gao, Kaizhou
    Duan, Pei-Yong
    Li, Jun-Qing
    Zhang, Le
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (05): : 2684 - 2693
  • [26] Multiprocessor Scheduling Based on Evolutionary Technique for Solving Permutation Flow Shop Problem
    Priya, Annu
    Sahana, Sudip Kumar
    [J]. IEEE ACCESS, 2020, 8 : 53151 - 53161
  • [27] Optimization of job shop scheduling problem based on deep reinforcement learning
    Qiao, Dongping
    Duan, Lvqi
    Li, Honglei
    Xiao, Yanqiu
    [J]. EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 371 - 383
  • [28] Optimization of job shop scheduling problem based on deep reinforcement learning
    Dongping Qiao
    Lvqi Duan
    HongLei Li
    Yanqiu Xiao
    [J]. Evolutionary Intelligence, 2024, 17 : 371 - 383
  • [29] A DE-based Algorithm for Reentrant Permutation Flow-shop Scheduling with Different Job Reentrant Times
    Qian, Bin
    Wan, Jing
    Liu, Bo
    Hu, Rong
    Che, Guo-Lin
    [J]. PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN SCHEDULING (CISCHED), 2013, : 22 - 27
  • [30] An effective estimation of distribution algorithm for solving the distributed permutation flow-shop scheduling problem
    Wang, Sheng-yao
    Wang, Ling
    Liu, Min
    Xu, Ye
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2013, 145 (01) : 387 - 396