A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem

被引：1

作者：

Di, Yuanzhu ^{[1
]}

Deng, Libao ^{[1
]}

Zhang, Lili ^{[2
]}

机构：

[1] Harbin Inst Technol, Sch Informat Sci & Engn, Weihai 264209, Peoples R China

[2] Dublin City Univ, Sch Comp, Dublin, Ireland

来源：

SWARM AND EVOLUTIONARY COMPUTATION | 2024年 / 91卷

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Multi-agent system; Reinforcement learning; Deep neural network; Collaborative learning; Distributed hybrid flow shop scheduling; problem; EVOLUTIONARY ALGORITHM; TARDINESS; MAKESPAN;

D O I：

10.1016/j.swevo.2024.101764

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the increasing level of implementation of artificial intelligence technology in solving complex engineering optimization problems, various learning mechanisms, including deep learning (DL) and reinforcement learning (RL), have been developed for manufacturing scheduling. In this paper, a collaborative-learning multi-agent RL method (CL-MARL) is proposed for solving distributed hybrid flow-shop scheduling problem (DHFSP), minimizing both makespan and total energy consumption. First, the DHFSP is formulated as the Markov decision process, the features of machines and jobs are represented as state and observation matrixes according to their characteristics, the candidate operation set is used as action space, and a reward mechanism is designed based on the machine utilization. Next, a set of critic networks and actor networks, consist of recurrent neural networks and fully connected networks, are employed to map the states and observations into the output values. Then, a novel distance matching strategy is designed for each agent to select the most appropriate action at each scheduling step. Finally, the proposed CL-MARL model is trained through multi-agent deep deterministic policy gradient algorithm in collaborative-learning manner. The numerical results prove the effectiveness of the proposed multi-agent system, and the comparisons with existing algorithms demonstrate the high-potential of CL-MARL in solving DHFSP.

引用

页数：14

共 51 条

[41] Carbon peak and carbon neutrality in China: Goals, implementation path and prospects
Wang, Yao
Guo, Chi-hui
Chen, Xi-jie
Jia, Li-qiong
Guo, Xiao-na
Chen, Rui-shan
Zhang, Mao-sheng
Chen, Ze-yu
Wang, Hao-dong
[J]. CHINA GEOLOGY, 2021, 4 (04) : 720 - 746
[42] Learning to schedule dynamic distributed reconfigurable workshops using expected deep Q-network
Yang, Shengluo
Wang, Junyi
Xu, Zhigang
[J]. ADVANCED ENGINEERING INFORMATICS, 2024, 59
[43] Solving job shop scheduling problems via deep reinforcement learning
Yuan, Erdong
Cheng, Shuli
Wang, Liejun
Song, Shiji
Wu, Fang
[J]. APPLIED SOFT COMPUTING, 2023, 143
[44] Distributed Co-Evolutionary Memetic Algorithm for Distributed Hybrid Differentiation Flowshop Scheduling Problem
Zhang, Guanghui
Liu, Bo
Wang, Ling
Yu, Dengxiu
Xing, Keyi
[J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (05) : 1043 - 1057
[45] Fuzzy neural network-based rescheduling decision mechanism for semiconductor manufacturing
Zhang, J.
Qin, W.
Wu, L. H.
Zhai, W. B.
[J]. COMPUTERS IN INDUSTRY, 2014, 65 (08) : 1115 - 1125
[46] DeepMAG: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling
Zhang, Jia-Dong
He, Zhixiang
Chan, Wing -Ho
Chow, Chi -Yin
[J]. KNOWLEDGE-BASED SYSTEMS, 2023, 259
[47] MOEA/D: A multiobjective evolutionary algorithm based on decomposition
Zhang, Qingfu
Li, Hui
[J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2007, 11 (06) : 712 - 731
[48] A Reinforcement Learning Driven Cooperative Meta-Heuristic Algorithm for Energy-Efficient Distributed No-Wait Flow-Shop Scheduling With Sequence-Dependent Setup Time
Zhao, Fuqing
Jiang, Tao
Wang, Ling
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (07) : 8427 - 8440
[49] A Self-Learning Discrete Jaya Algorithm for Multiobjective Energy-Efficient Distributed No-Idle Flow-Shop Scheduling Problem in Heterogeneous Factory System
Zhao, Fuqing
Ma, Ru
Wang, Ling
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 12675 - 12686
[50] A Population-Based Iterated Greedy Algorithm for Distributed Assembly No-Wait Flow-Shop Scheduling Problem
Zhao, Fuqing
Xu, Zesong
Wang, Ling
Zhu, Ningning
Xu, Tianpeng
Jonrinaldi, J.
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 6692 - 6705

← 1 2 3 4 5 6 →