共 48 条
A Reinforcement Learning Approach for Flexible Job Shop Scheduling Problem With Crane Transportation and Setup Times
被引:89
作者:
Du, Yu
[1
]
Li, Junqing
[1
,2
]
Li, Chengdong
[3
]
Duan, Peiyong
[4
]
机构:
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Peoples R China
[2] Liaocheng Univ, Sch Comp Sci, Liaocheng 252059, Shandong, Peoples R China
[3] Shandong Jianzhu Univ, Sch Informat & Elect Engn, Jinan 252101, Peoples R China
[4] Yantai Univ, Sch Math & Informat Sci, Yantai 264005, Peoples R China
基金:
美国国家科学基金会;
关键词:
Cranes;
Job shop scheduling;
Transportation;
Scheduling;
Optimization;
Heuristic algorithms;
Reinforcement learning;
Deep Q-network (DQN);
flexible job shop scheduling;
multiobjective optimization;
reinforcement learning (RL);
OPTIMIZATION;
ALGORITHM;
HYBRID;
D O I:
10.1109/TNNLS.2022.3208942
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Flexible job shop scheduling problem (FJSP) has attracted research interests as it can significantly improve the energy, cost, and time efficiency of production. As one type of reinforcement learning, deep Q-network (DQN) has been applied to solve numerous realistic optimization problems. In this study, a DQN model is proposed to solve a multiobjective FJSP with crane transportation and setup times (FJSP-CS). Two objectives, i.e., makespan and total energy consumption, are optimized simultaneously based on weighting approach. To better reflect the problem realities, eight different crane transportation stages and three typical machine states including processing, setup, and standby are investigated. Considering the complexity of FJSP-CS, an identification rule is designed to organize the crane transportation in solution decoding. As for the DQN model, 12 state features and seven actions are designed to describe the features in the scheduling process. A novel structure is applied in the DQN topology, saving the calculation resources and improving the performance. In DQN training, double deep Q-network technique and soft target weight update strategy are used. In addition, three reported improvement strategies are adopted to enhance the solution qualities by adjusting scheduling assignments. Extensive computational tests and comparisons demonstrate the effectiveness and advantages of the proposed method in solving FJSP-CS, where the DQN can choose appropriate dispatching rules at various scheduling situations.
引用
收藏
页码:5695 / 5709
页数:15
相关论文