Dynamic flexible job-shop scheduling by multi-agent reinforcement learning with reward-shaping

被引：9

作者：

Zhang, Lixiang ^{[1
,2
]}

Yan, Yan ^{[1
]}

Yang, Chen ^{[3
]}

Hu, Yaoguang ^{[1
]}

机构：

[1] Beijing Inst Technol, Lab Ind & Intelligent Syst Engn, Beijing 100081, Peoples R China

[2] Univ Auckland, Lab Ind Smart Mfg Syst 40, Auckland 1142, New Zealand

[3] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China

来源：

ADVANCED ENGINEERING INFORMATICS | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Dynamic flexible job-shop scheduling; Deep reinforcement learning; Reward-shaping; Multi-agent system;

D O I：

10.1016/j.aei.2024.102872

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Achieving mass personalization presents significant challenges in performance and adaptability when solving dynamic flexible job-shop scheduling problems (DFJSP). Previous studies have struggled to achieve high performance in variable contexts. To tackle this challenge, this paper introduces a novel scheduling strategy founded on heterogeneous multi-agent reinforcement learning. This strategy facilitates centralized optimization and decentralized decision-making through collaboration among job and machine agents while employing historical experiences to support data-driven learning. The DFJSP with transportation time is initially formulated as heterogeneous multi-agent partial observation Markov Decision Processes. This formulation outlines the interactions between decision-making agents and the environment, incorporating a reward-shaping mechanism aimed at organizing job and machine agents to minimize the weighted tardiness of dynamic jobs. Then, we develop a dueling double deep Q-network algorithm incorporating the reward-shaping mechanism to ascertain the optimal strategies for machine allocation and job sequencing in DFJSP. This approach addresses the sparse reward issue and accelerates the learning process. Finally, the efficiency of the proposed method is verified and validated through numerical experiments, which demonstrate its superiority in reducing the weighted tardiness of dynamic jobs when compared to state-of-the-art baselines. The proposed method exhibits remarkable adaptability in encountering new scenarios, underscoring the benefits of adopting a heterogeneous multi-agent reinforcement learning-based scheduling approach in navigating dynamic and flexible challenges.

引用

页数：13

共 50 条

[21] Energy-Flexible Job-Shop Scheduling Using Deep Reinforcement Learning [J].

Felder, Mine ;

Steiner, Daniel ;

Busch, Paul ;

Trat, Martin ;

Sun, Chenwei ;

Bender, Janek ;

Ovtcharova, Jivka .

PROCEEDINGS OF THE CONFERENCE ON PRODUCTION SYSTEMS AND LOGISTICS, CPSL 2023-1, 2023, :353-362

[22] Dynamic multi-objective scheduling for flexible job shop by deep reinforcement learning [J].

Luo, Shu ;

Zhang, Linxuan ;

Fan, Yushun .

COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159

[23] Job Shop Dynamic Scheduling Model Based on Multi-Agent [J].

He, Li ;

Liu, Yong-xian ;

Xie, Hua-long ;

Zhang, Yu .

2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, :829-+

[24] Dynamic job-shop scheduling using graph reinforcement learning with auxiliary strategy [J].

Liu, Zhenyu ;

Mao, Haoyang ;

Sa, Guodong ;

Liu, Hui ;

Tan, Jianrong .

JOURNAL OF MANUFACTURING SYSTEMS, 2024, 73 :1-18

[25] Dynamic job-shop scheduling in smart manufacturing using deep reinforcement learning [J].

Wang, Libing ;

Hu, Xin ;

Wang, Yin ;

Xu, Sujie ;

Ma, Shijun ;

Yang, Kexin ;

Liu, Zhijun ;

Wang, Weidong .

COMPUTER NETWORKS, 2021, 190 (190)

[26] A reinforcement learning-based approach for solving multi-agent job shop scheduling problem [J].

Dong, Zhuoran ;

Ren, Tao ;

Qi, Fang ;

Weng, Jiacheng ;

Bai, Danyu ;

Yang, Jie ;

Wu, Chin-Chia .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2025, 63 (10) :3512-3537

[27] Reward Shaping for Job Shop Scheduling [J].

Nasuta, Alexander ;

Kemmerling, Marco ;

Luetticke, Daniel ;

Schmitt, Robert H. .

MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I, 2024, 14505 :197-211

[28] Dynamic flexible job shop scheduling based on deep reinforcement learning [J].

Yang, Dan ;

Shu, Xiantao ;

Yu, Zhen ;

Lu, Guangtao ;

Ji, Songlin ;

Wang, Jiabing ;

He, Kongde .

PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2025, 239 (09) :1251-1264

[29] Dynamic scheduling for multi-objective flexible job shop via deep reinforcement learning [J].

Yuan, Erdong ;

Wang, Liejun ;

Song, Shiji ;

Cheng, Shuli ;

Fan, Wei .

APPLIED SOFT COMPUTING, 2025, 171

[30] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival [J].

Chang, Jingru ;

Yu, Dong ;

Hu, Yi ;

He, Wuwei ;

Yu, Haoyu .

PROCESSES, 2022, 10 (04)

← 1 2 3 4 5 →