A Reinforcement Learning Approach for Flexible Job Shop Scheduling Problem With Crane Transportation and Setup Times

被引:94
作者
Du, Yu [1 ]
Li, Junqing [1 ,2 ]
Li, Chengdong [3 ]
Duan, Peiyong [4 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Peoples R China
[2] Liaocheng Univ, Sch Comp Sci, Liaocheng 252059, Shandong, Peoples R China
[3] Shandong Jianzhu Univ, Sch Informat & Elect Engn, Jinan 252101, Peoples R China
[4] Yantai Univ, Sch Math & Informat Sci, Yantai 264005, Peoples R China
基金
美国国家科学基金会;
关键词
Cranes; Job shop scheduling; Transportation; Scheduling; Optimization; Heuristic algorithms; Reinforcement learning; Deep Q-network (DQN); flexible job shop scheduling; multiobjective optimization; reinforcement learning (RL); OPTIMIZATION; ALGORITHM; HYBRID;
D O I
10.1109/TNNLS.2022.3208942
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Flexible job shop scheduling problem (FJSP) has attracted research interests as it can significantly improve the energy, cost, and time efficiency of production. As one type of reinforcement learning, deep Q-network (DQN) has been applied to solve numerous realistic optimization problems. In this study, a DQN model is proposed to solve a multiobjective FJSP with crane transportation and setup times (FJSP-CS). Two objectives, i.e., makespan and total energy consumption, are optimized simultaneously based on weighting approach. To better reflect the problem realities, eight different crane transportation stages and three typical machine states including processing, setup, and standby are investigated. Considering the complexity of FJSP-CS, an identification rule is designed to organize the crane transportation in solution decoding. As for the DQN model, 12 state features and seven actions are designed to describe the features in the scheduling process. A novel structure is applied in the DQN topology, saving the calculation resources and improving the performance. In DQN training, double deep Q-network technique and soft target weight update strategy are used. In addition, three reported improvement strategies are adopted to enhance the solution qualities by adjusting scheduling assignments. Extensive computational tests and comparisons demonstrate the effectiveness and advantages of the proposed method in solving FJSP-CS, where the DQN can choose appropriate dispatching rules at various scheduling situations.
引用
收藏
页码:5695 / 5709
页数:15
相关论文
共 48 条
[11]   Improved particle swarm optimization algorithm based novel encoding and decoding schemes for flexible job shop scheduling problem [J].
Ding, Haojie ;
Gu, Xingsheng .
COMPUTERS & OPERATIONS RESEARCH, 2020, 121
[12]   Minimizing total energy cost and tardiness penalty for a scheduling-layout problem in a flexible job shop system: A comparison of four metaheuristic algorithms [J].
Ebrahimi, Ahmad ;
Jeon, Hyun Woo ;
Lee, Seokgi ;
Wang, Chao .
COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 141
[13]   Experienced Gray Wolf Optimization Through Reinforcement Learning and Neural Networks [J].
Emary, E. ;
Zawbaa, Hossam M. ;
Grosan, Crina .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (03) :681-694
[14]   A Review on Swarm Intelligence and Evolutionary Algorithms for Solving Flexible Job Shop Scheduling Problems [J].
Gao, Kaizhou ;
Cao, Zhiguang ;
Zhang, Le ;
Chen, Zhenghua ;
Han, Yuyan ;
Pan, Quanke .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (04) :904-916
[15]   Ship-unloading scheduling optimization for a steel plant [J].
Gao, Zhen ;
Sun, Defeng ;
Zhao, Ren ;
Dong, Yun .
INFORMATION SCIENCES, 2021, 544 :214-226
[16]   Real-time production scheduling in the Industry-4.0 context: Addressing uncertainties in job arrivals and machine breakdowns [J].
Ghaleb, Mageed ;
Zolfagharinia, Hossein ;
Taghipour, Sharareh .
COMPUTERS & OPERATIONS RESEARCH, 2020, 123
[17]   A Time Wave Neural Network Framework for Solving Time-Dependent Project Scheduling Problems [J].
Huang, Wei ;
Gao, Liang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (01) :274-283
[18]   A deep reinforcement learning approach for chemical production scheduling [J].
Hubbs, Christian D. ;
Li, Can ;
Sahinidis, Nikolaos, V ;
Grossmann, Ignacio E. ;
Wassick, John M. .
COMPUTERS & CHEMICAL ENGINEERING, 2020, 141
[19]   Multi-objective optimization based on decomposition for flexible job shop scheduling under time-of-use electricity prices [J].
Jiang, En-da ;
Wang, Ling .
KNOWLEDGE-BASED SYSTEMS, 2020, 204
[20]   A Hybrid Iterated Greedy Algorithm for a Crane Transportation Flexible Job Shop Problem [J].
Li, Jun-Qing ;
Du, Yu ;
Gao, Kai-Zhou ;
Duan, Pei-Yong ;
Gong, Dun-Wei ;
Pan, Quan-Ke ;
Suganthan, P. N. .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (03) :2153-2170