A Multi -Action Deep Reinforcement Learning Based on BiLSTM for Flexible Job Shop Scheduling Problem with Tight Time

被引：0

作者：

Wang, Rui ^{[1
]}

Liu, Chang ^{[1
]}

Wang, Xinzhuo ^{[1
]}

Yang, Shengxiang ^{[2
]}

Hou, Yaqi ^{[3
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang, Liaoning, Peoples R China

[2] DeMontfort Univ, Sch Comp Sci & Informat, Leicester, Leics, England

[3] Shuanghui Meat Proc Ltd, Fuxin, Liaoning, Peoples R China

来源：

PROCEEDINGS OF 2024 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE, CSAI 2024 | 2024年

关键词：

Deep reinforcement learning; Job-shop scheduling problem; Intelligent manufacturing systems;

D O I：

10.1145/3709026.3709038

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Flexible Job Shop Scheduling Problem (FJSP) with tight time is a significant challenge in both academic and industrial fields of production scheduling. This paper addresses the FJSP with tight time using a Multi -action Deep Reinforcement Learning (MDRL) method. First, a multi-action Markov Decision Process (MDP) is formulated, integrating operation and machine sets into a unified multi-action space. Then, a scheduling policy is developed using a hi-Directional Long Short-Term Memory Network (BiLSTM) to extract intrinsic scheduling information. Finally, Proximal Policy Optimization (PPO) enhanced with reward shaping is employed to train the model, enabling intelligent decision-making in action selections. Extensive experiments are conducted on four problem instances of varying scales. Comparisons among 20 priority dispatch rules and two closely rated DEL methods demonstrate the superior performance of the proposed MDRL approach.

引用

页码：318 / 326

页数：9

共 28 条

[1] A discrete firefly algorithm for solving the flexible job-shop scheduling problem in a make-to-order manufacturing system [J].

Alvarez-Gil, Nicolas ;

Rosillo, Rafael ;

de la Fuente, David ;

Pino, Raul .

CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, 2021, 29 (04) :1353-1374

[2] Potential-based reward shaping using state-space segmentation for efficiency in reinforcement learning [J].

Bal, Melis Ilayda ;

Aydin, Hueseyin ;

Iyiguen, Cem ;

Polat, Faruk .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 157 :469-484

[3]

Bernabei M, 2023, Global Journal of Flexible Systems Management, V24, P67, DOI [10.1007/s40171-022-00328-7, 10.1007/s40171-022-00328-7, DOI 10.1007/S40171-022-00328-7]

[4] A Systematic Literature Review of Cleanroom Ventilation and Air Distribution Systems [J].

Bhattacharya, Arup ;

Tak, Mohammad Saleh Nikoopayan ;

Shoai-Naini, Shervin ;

Betz, Fred ;

Mousavi, Ehsan .

AEROSOL AND AIR QUALITY RESEARCH, 2023, 23 (07)

[5]

Chellaboina S, 2022, 2022 INT C ADV TECHN, P1, DOI [10.1109/ICONAT53423.2022.9725880, DOI 10.1109/ICONAT53423.2022.9725880]

[6] Hybrid of human learning optimization algorithm and particle swarm optimization algorithm with scheduling strategies for the flexible job-shop scheduling problem [J].

Ding, Haojie ;

Gu, Xingsheng .

NEUROCOMPUTING, 2020, 414 (414) :313-332

[7] Industry 4.0 for sustainable manufacturing: Opportunities at the product, process, and system levels [J].

Enyoghasi, Christian ;

Badurdeen, Fazleena .

RESOURCES CONSERVATION AND RECYCLING, 2021, 166

[8] Proximal Policy Optimization With Policy Feedback [J].

Gu, Yang ;

Cheng, Yuhu ;

Chen, C. L. Philip ;

Wang, Xuesong .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07) :4600-4610

[9] Research on dynamic decision-making for product assembly sequence based on Connector-Linked Model and deep reinforcement learning [J].

Guo, Kai ;

Liu, Rui ;

Duan, Guijiang ;

Liu, Jiajun ;

Cao, Pengyong .

JOURNAL OF MANUFACTURING SYSTEMS, 2023, 71 :451-473

[10]

Huang J. -P., 2024, IEEE Transactions on Automation Science and Engineering

← 1 2 3 →