Dynamic flexible job-shop scheduling by multi-agent reinforcement learning with reward-shaping

被引:9
作者
Zhang, Lixiang [1 ,2 ]
Yan, Yan [1 ]
Yang, Chen [3 ]
Hu, Yaoguang [1 ]
机构
[1] Beijing Inst Technol, Lab Ind & Intelligent Syst Engn, Beijing 100081, Peoples R China
[2] Univ Auckland, Lab Ind Smart Mfg Syst 40, Auckland 1142, New Zealand
[3] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic flexible job-shop scheduling; Deep reinforcement learning; Reward-shaping; Multi-agent system;
D O I
10.1016/j.aei.2024.102872
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Achieving mass personalization presents significant challenges in performance and adaptability when solving dynamic flexible job-shop scheduling problems (DFJSP). Previous studies have struggled to achieve high performance in variable contexts. To tackle this challenge, this paper introduces a novel scheduling strategy founded on heterogeneous multi-agent reinforcement learning. This strategy facilitates centralized optimization and decentralized decision-making through collaboration among job and machine agents while employing historical experiences to support data-driven learning. The DFJSP with transportation time is initially formulated as heterogeneous multi-agent partial observation Markov Decision Processes. This formulation outlines the interactions between decision-making agents and the environment, incorporating a reward-shaping mechanism aimed at organizing job and machine agents to minimize the weighted tardiness of dynamic jobs. Then, we develop a dueling double deep Q-network algorithm incorporating the reward-shaping mechanism to ascertain the optimal strategies for machine allocation and job sequencing in DFJSP. This approach addresses the sparse reward issue and accelerates the learning process. Finally, the efficiency of the proposed method is verified and validated through numerical experiments, which demonstrate its superiority in reducing the weighted tardiness of dynamic jobs when compared to state-of-the-art baselines. The proposed method exhibits remarkable adaptability in encountering new scenarios, underscoring the benefits of adopting a heterogeneous multi-agent reinforcement learning-based scheduling approach in navigating dynamic and flexible challenges.
引用
收藏
页数:13
相关论文
共 50 条
[31]   Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning [J].
Luo, Shu .
APPLIED SOFT COMPUTING, 2020, 91
[32]   Dynamic Job-Shop Scheduling via Graph Attention Networks and Deep Reinforcement Learning [J].
Liu, Chien-Liang ;
Tseng, Chun-Jan ;
Weng, Po-Hao .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) :8662-8672
[33]   A multi-agent system for integrated scheduling and maintenance planning of the flexible job shop [J].
Pal, Manojkumar ;
Mittal, Murari Lal ;
Soni, Gunjan ;
Chouhan, Satyendra S. .
COMPUTERS & OPERATIONS RESEARCH, 2023, 159
[34]   A new immune multi-agent system for the flexible job shop scheduling problem [J].
Xiong, Wei ;
Fu, Dongmei .
JOURNAL OF INTELLIGENT MANUFACTURING, 2018, 29 (04) :857-873
[35]   Tactical Reward Shaping for Large-Scale Combat by Multi-Agent Reinforcement Learning [J].
Duo, Nanxun ;
Wang, Qinzhao ;
Lyu, Qiang ;
Wang, Wei .
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) :1516-1529
[36]   Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-Agent Learning [J].
Zhu, Xiangbin ;
Zhang, Chongjie ;
Lesser, Victor .
2013 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY (IAT 2013), 2013, :321-328
[37]   A new immune multi-agent system for the flexible job shop scheduling problem [J].
Wei Xiong ;
Dongmei Fu .
Journal of Intelligent Manufacturing, 2018, 29 :857-873
[38]   Dynamic scheduling for flexible job shop using a deep reinforcement learning approach [J].
Gui, Yong ;
Tang, Dunbing ;
Zhu, Haihua ;
Zhang, Yi ;
Zhang, Zequn .
COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 180
[39]   Dynamic flexible job shop scheduling algorithm based on deep reinforcement learning [J].
Zhao, Tianrui ;
Wang, Yanhong ;
Tan, Yuanyuan ;
Zhang, Jun .
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, :5099-5104
[40]   A Hierarchical Multi-Action Deep Reinforcement Learning Method for Dynamic Distributed Job-Shop Scheduling Problem With Job Arrivals [J].
Huang, Jiang-Ping ;
Gao, Liang ;
Li, Xin-Yu .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 :2501-2513