A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem

被引:1
作者
Di, Yuanzhu [1 ]
Deng, Libao [1 ]
Zhang, Lili [2 ]
机构
[1] Harbin Inst Technol, Sch Informat Sci & Engn, Weihai 264209, Peoples R China
[2] Dublin City Univ, Sch Comp, Dublin, Ireland
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Multi-agent system; Reinforcement learning; Deep neural network; Collaborative learning; Distributed hybrid flow shop scheduling; problem; EVOLUTIONARY ALGORITHM; TARDINESS; MAKESPAN;
D O I
10.1016/j.swevo.2024.101764
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the increasing level of implementation of artificial intelligence technology in solving complex engineering optimization problems, various learning mechanisms, including deep learning (DL) and reinforcement learning (RL), have been developed for manufacturing scheduling. In this paper, a collaborative-learning multi-agent RL method (CL-MARL) is proposed for solving distributed hybrid flow-shop scheduling problem (DHFSP), minimizing both makespan and total energy consumption. First, the DHFSP is formulated as the Markov decision process, the features of machines and jobs are represented as state and observation matrixes according to their characteristics, the candidate operation set is used as action space, and a reward mechanism is designed based on the machine utilization. Next, a set of critic networks and actor networks, consist of recurrent neural networks and fully connected networks, are employed to map the states and observations into the output values. Then, a novel distance matching strategy is designed for each agent to select the most appropriate action at each scheduling step. Finally, the proposed CL-MARL model is trained through multi-agent deep deterministic policy gradient algorithm in collaborative-learning manner. The numerical results prove the effectiveness of the proposed multi-agent system, and the comparisons with existing algorithms demonstrate the high-potential of CL-MARL in solving DHFSP.
引用
收藏
页数:14
相关论文
共 51 条
  • [41] Carbon peak and carbon neutrality in China: Goals, implementation path and prospects
    Wang, Yao
    Guo, Chi-hui
    Chen, Xi-jie
    Jia, Li-qiong
    Guo, Xiao-na
    Chen, Rui-shan
    Zhang, Mao-sheng
    Chen, Ze-yu
    Wang, Hao-dong
    [J]. CHINA GEOLOGY, 2021, 4 (04) : 720 - 746
  • [42] Learning to schedule dynamic distributed reconfigurable workshops using expected deep Q-network
    Yang, Shengluo
    Wang, Junyi
    Xu, Zhigang
    [J]. ADVANCED ENGINEERING INFORMATICS, 2024, 59
  • [43] Solving job shop scheduling problems via deep reinforcement learning
    Yuan, Erdong
    Cheng, Shuli
    Wang, Liejun
    Song, Shiji
    Wu, Fang
    [J]. APPLIED SOFT COMPUTING, 2023, 143
  • [44] Distributed Co-Evolutionary Memetic Algorithm for Distributed Hybrid Differentiation Flowshop Scheduling Problem
    Zhang, Guanghui
    Liu, Bo
    Wang, Ling
    Yu, Dengxiu
    Xing, Keyi
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (05) : 1043 - 1057
  • [45] Fuzzy neural network-based rescheduling decision mechanism for semiconductor manufacturing
    Zhang, J.
    Qin, W.
    Wu, L. H.
    Zhai, W. B.
    [J]. COMPUTERS IN INDUSTRY, 2014, 65 (08) : 1115 - 1125
  • [46] DeepMAG: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling
    Zhang, Jia-Dong
    He, Zhixiang
    Chan, Wing -Ho
    Chow, Chi -Yin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [47] MOEA/D: A multiobjective evolutionary algorithm based on decomposition
    Zhang, Qingfu
    Li, Hui
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2007, 11 (06) : 712 - 731
  • [48] A Reinforcement Learning Driven Cooperative Meta-Heuristic Algorithm for Energy-Efficient Distributed No-Wait Flow-Shop Scheduling With Sequence-Dependent Setup Time
    Zhao, Fuqing
    Jiang, Tao
    Wang, Ling
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (07) : 8427 - 8440
  • [49] A Self-Learning Discrete Jaya Algorithm for Multiobjective Energy-Efficient Distributed No-Idle Flow-Shop Scheduling Problem in Heterogeneous Factory System
    Zhao, Fuqing
    Ma, Ru
    Wang, Ling
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 12675 - 12686
  • [50] A Population-Based Iterated Greedy Algorithm for Distributed Assembly No-Wait Flow-Shop Scheduling Problem
    Zhao, Fuqing
    Xu, Zesong
    Wang, Ling
    Zhu, Ningning
    Xu, Tianpeng
    Jonrinaldi, J.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 6692 - 6705