Multi-Task Multi-Agent Reinforcement Learning for Real-Time Scheduling of a Dual-Resource Flexible Job Shop with Robots

被引：13

作者：

Zhu, Xiaofei ^{[1
]}

Xu, Jiazhong ^{[1
]}

Ge, Jianghua ^{[1
]}

Wang, Yaping ^{[1
]}

Xie, Zhiqiang ^{[2
]}

机构：

[1] Harbin Univ Sci & Technol, Key Lab Adv Mfg & Intelligent Technol, Minist Educ, Harbin 150080, Peoples R China

[2] Harbin Univ Sci & Technol, Coll Comp Sci & Technol, Harbin 150080, Peoples R China

来源：

PROCESSES | 2023年 / 11卷 / 01期

基金：

中国国家自然科学基金;

关键词：

real-time scheduling; dual-resource constraint; multi-task multi-agent reinforcement learning; flexible job shop scheduling; flexible process planning; ALGORITHM; MACHINES;

D O I：

10.3390/pr11010267

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

In this paper, a real-time scheduling problem of a dual-resource flexible job shop with robots is studied. Multiple independent robots and their supervised machine sets form their own work cells. First, a mixed integer programming model is established, which considers the scheduling problems of jobs and machines in the work cells, and of jobs between work cells, based on the process plan flexibility. Second, in order to make real-time scheduling decisions, a framework of multi-task multi-agent reinforcement learning based on centralized training and decentralized execution is proposed. Each agent interacts with the environment and completes three decision-making tasks: job sequencing, machine selection, and process planning. In the process of centralized training, the value network is used to evaluate and optimize the policy network to achieve multi-agent cooperation, and the attention mechanism is introduced into the policy network to realize information sharing among multiple tasks. In the process of decentralized execution, each agent performs multiple task decisions through local observations according to the trained policy network. Then, observation, action, and reward are designed. Rewards include global and local rewards, which are decomposed into sub-rewards corresponding to tasks. The reinforcement learning training algorithm is designed based on a double-deep Q-network. Finally, the scheduling simulation environment is derived from benchmarks, and the experimental results show the effectiveness of the proposed method.

引用

页数：28

共 50 条

[1] DeepMAG: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling
Zhang, Jia-Dong
He, Zhixiang
Chan, Wing -Ho
Chow, Chi -Yin
KNOWLEDGE-BASED SYSTEMS, 2023, 259
[2] Multi-Agent Reinforcement Learning for Job Shop Scheduling in Dynamic Environments
Pu, Yu
Li, Fang
Rahimifard, Shahin
SUSTAINABILITY, 2024, 16 (08)
[3] Multi-agent reinforcement learning based on graph convolutional network for flexible job shop scheduling
Jing, Xuan
Yao, Xifan
Liu, Min
Zhou, Jiajun
JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (01) : 75 - 93
[4] Multi-agent reinforcement learning based on graph convolutional network for flexible job shop scheduling
Xuan Jing
Xifan Yao
Min Liu
Jiajun Zhou
Journal of Intelligent Manufacturing, 2024, 35 : 75 - 93
[5] Research on Optimization of Dual-Resource Batch Scheduling in Flexible Job Shop
Liu, Qinhui
Gao, Zhijie
Li, Jiang
Li, Shuo
Zhu, Laizheng
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 2503 - 2530
[6] Real-time scheduling of multi-stage flexible job shop floor
Ham, Myoungsoo
Lee, Young Hoon
Kim, Sun Hoon
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2011, 49 (12) : 3715 - 3730
[7] Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems
Zhang, Yi
Zhu, Haihua
Tang, Dunbing
Zhou, Tong
Gui, Yong
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78
[8] Variable neighbourhood search for dual-resource constrained flexible job shop scheduling
Lei, Deming
Guo, Xiuping
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2014, 52 (09) : 2519 - 2529
[9] Hierarchical Reinforcement Learning for Multi-Objective Real-Time Flexible Scheduling in a Smart Shop Floor
Chang, Jingru
Yu, Dong
Zhou, Zheng
He, Wuwei
Zhang, Lipeng
MACHINES, 2022, 10 (12)
[10] Real-time scheduling for production-logistics collaborative environment using multi-agent deep reinforcement learning
Li, Yuxin
Li, Xinyu
Gao, Liang
ADVANCED ENGINEERING INFORMATICS, 2025, 65

← 1 2 3 4 5 →