Multi-Task Multi-Agent Reinforcement Learning for Real-Time Scheduling of a Dual-Resource Flexible Job Shop with Robots

被引:13
|
作者
Zhu, Xiaofei [1 ]
Xu, Jiazhong [1 ]
Ge, Jianghua [1 ]
Wang, Yaping [1 ]
Xie, Zhiqiang [2 ]
机构
[1] Harbin Univ Sci & Technol, Key Lab Adv Mfg & Intelligent Technol, Minist Educ, Harbin 150080, Peoples R China
[2] Harbin Univ Sci & Technol, Coll Comp Sci & Technol, Harbin 150080, Peoples R China
基金
中国国家自然科学基金;
关键词
real-time scheduling; dual-resource constraint; multi-task multi-agent reinforcement learning; flexible job shop scheduling; flexible process planning; ALGORITHM; MACHINES;
D O I
10.3390/pr11010267
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
In this paper, a real-time scheduling problem of a dual-resource flexible job shop with robots is studied. Multiple independent robots and their supervised machine sets form their own work cells. First, a mixed integer programming model is established, which considers the scheduling problems of jobs and machines in the work cells, and of jobs between work cells, based on the process plan flexibility. Second, in order to make real-time scheduling decisions, a framework of multi-task multi-agent reinforcement learning based on centralized training and decentralized execution is proposed. Each agent interacts with the environment and completes three decision-making tasks: job sequencing, machine selection, and process planning. In the process of centralized training, the value network is used to evaluate and optimize the policy network to achieve multi-agent cooperation, and the attention mechanism is introduced into the policy network to realize information sharing among multiple tasks. In the process of decentralized execution, each agent performs multiple task decisions through local observations according to the trained policy network. Then, observation, action, and reward are designed. Rewards include global and local rewards, which are decomposed into sub-rewards corresponding to tasks. The reinforcement learning training algorithm is designed based on a double-deep Q-network. Finally, the scheduling simulation environment is derived from benchmarks, and the experimental results show the effectiveness of the proposed method.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] DeepMAG: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling
    Zhang, Jia-Dong
    He, Zhixiang
    Chan, Wing -Ho
    Chow, Chi -Yin
    KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [2] Multi-Agent Reinforcement Learning for Job Shop Scheduling in Dynamic Environments
    Pu, Yu
    Li, Fang
    Rahimifard, Shahin
    SUSTAINABILITY, 2024, 16 (08)
  • [3] Multi-agent reinforcement learning based on graph convolutional network for flexible job shop scheduling
    Jing, Xuan
    Yao, Xifan
    Liu, Min
    Zhou, Jiajun
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (01) : 75 - 93
  • [4] Multi-agent reinforcement learning based on graph convolutional network for flexible job shop scheduling
    Xuan Jing
    Xifan Yao
    Min Liu
    Jiajun Zhou
    Journal of Intelligent Manufacturing, 2024, 35 : 75 - 93
  • [5] Research on Optimization of Dual-Resource Batch Scheduling in Flexible Job Shop
    Liu, Qinhui
    Gao, Zhijie
    Li, Jiang
    Li, Shuo
    Zhu, Laizheng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 2503 - 2530
  • [6] Real-time scheduling of multi-stage flexible job shop floor
    Ham, Myoungsoo
    Lee, Young Hoon
    Kim, Sun Hoon
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2011, 49 (12) : 3715 - 3730
  • [7] Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems
    Zhang, Yi
    Zhu, Haihua
    Tang, Dunbing
    Zhou, Tong
    Gui, Yong
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78
  • [8] Variable neighbourhood search for dual-resource constrained flexible job shop scheduling
    Lei, Deming
    Guo, Xiuping
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2014, 52 (09) : 2519 - 2529
  • [9] Hierarchical Reinforcement Learning for Multi-Objective Real-Time Flexible Scheduling in a Smart Shop Floor
    Chang, Jingru
    Yu, Dong
    Zhou, Zheng
    He, Wuwei
    Zhang, Lipeng
    MACHINES, 2022, 10 (12)
  • [10] Real-time scheduling for production-logistics collaborative environment using multi-agent deep reinforcement learning
    Li, Yuxin
    Li, Xinyu
    Gao, Liang
    ADVANCED ENGINEERING INFORMATICS, 2025, 65