Multi-Agent Reinforcement Learning for Distributed Flexible Job Shop Scheduling With Random Job Arrival

被引：0

作者：

Yan, Yuhang ^{[1
]}

Yi, Wenchao ^{[1
]}

Pei, Zhi ^{[1
]}

Chen, Yong ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Coll Mech Engn, Hangzhou 310023, Peoples R China

来源：

IEEE ACCESS | 2025年 / 13卷

基金：

中国国家自然科学基金;

关键词：

Job shop scheduling; Heuristic algorithms; Conferences; Dynamic scheduling; Production facilities; Search problems; Q-learning; Deep reinforcement learning; Manufacturing; Standards; Distributed flexible job shop scheduling; distributed manufacturing; deep reinforcement learning; multi-agent; deep Q network; GENETIC ALGORITHM;

D O I：

10.1109/ACCESS.2025.3564433

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Distributed flexible job shops are increasingly becoming the predominant production method in manufacturing due to their advantages in low-cost production and high customization. In practical production environments, jobs arrive randomly but follow a regular pattern. This paper addresses the scheduling problem of the Distributed Flexible Job shop Scheduling Problem (DFJSP) with random job arrivals. The DFJSP consists of three sub-problems: factory selection, job assignment, and operation sequencing. To tackle this issue, the DFJSP is modeled as a Markov Decision Process (MDP), and a multi-agent approach based on deep reinforcement learning (DRL) is proposed. This approach includes a Distribute Agent (DA) and a Sequence Agent (SA). For the MDP of the DA, we designed 12 state features, 5 candidate actions, and a reward based on the current state of production tardiness. The SA is configured with 7 state features, 6 candidate actions, and rewards that reflect delay conditions. A deep Q-network (DQN) framework that incorporates a linearly decreasing threshold probability was designed to effectively balance exploration and exploitation during the training phase. Comparative experiments conducted on randomly generated instances demonstrate the effectiveness of the DA when used both independently and in conjunction with the SA.

引用

页码：80941 / 80957

页数：17

共 48 条

[1] Reinforcement learning for an intelligent and autonomous production control of complex job-shops under time constraints [J].

Altenmueller, Thomas ;

Stueker, Tillmann ;

Waschneck, Bernd ;

Kuhnle, Andreas ;

Lanza, Gisela .

PRODUCTION ENGINEERING-RESEARCH AND DEVELOPMENT, 2020, 14 (03) :319-328

[2]

Bin Luo, 2021, Journal of Physics: Conference Series, V1848, DOI 10.1088/1742-6596/1848/1/012029

[3] A distributed approach solving partially flexible job-shop scheduling problem with a Q-learning effect [J].

Bouazza, W. ;

Sallez, Y. ;

Beldjilali, B. .

IFAC PAPERSONLINE, 2017, 50 (01) :15890-15895

[4] Application of genetic algorithms with dominant genes in a distributed scheduling problem in flexible manufacturing systems [J].

Chan, FTS ;

Chung, SH ;

Chan, PLY .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2006, 44 (03) :523-543

[5] Optimisation of distributed manufacturing flexible job shop scheduling by using hybrid genetic algorithms [J].

Chang, Hao-Chin ;

Liu, Tung-Kuan .

JOURNAL OF INTELLIGENT MANUFACTURING, 2017, 28 (08) :1973-1986

[6] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival [J].

Chang, Jingru ;

Yu, Dong ;

Hu, Yi ;

He, Wuwei ;

Yu, Haoyu .

PROCESSES, 2022, 10 (04)

[7] A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem [J].

Chen, Ronghua ;

Yang, Bo ;

Li, Shi ;

Wang, Shilong .

COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149

[8] Dynamic Intelligent Scheduling in Low-Carbon Heterogeneous Distributed Flexible Job Shops with Job Insertions and Transfers [J].

Chen, Yi ;

Liao, Xiaojuan ;

Chen, Guangzhu ;

Hou, Yingjie .

SENSORS, 2024, 24 (07)

[9] An Improved Genetic Algorithm for the Distributed and Flexible Job-shop Scheduling problem [J].

De Giovanni, L. ;

Pezzella, F. .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 200 (02) :395-408

[10] Reinforcement learning for industrial process control: A case study in flatness control in steel industry [J].

Deng, Jifei ;

Sierla, Seppo ;

Sun, Jie ;

Vyatkin, Valeriy .

COMPUTERS IN INDUSTRY, 2022, 143

← 1 2 3 4 5 →