A Double Deep Q-Network framework for a flexible job shop scheduling problem with dynamic job arrivals and urgent job insertions

被引：13

作者：

Lu, Shaojun ^{[1
,2
,3
]}

Wang, Yongqi ^{[1
]}

Kong, Min ^{[1
,4
]}

Wang, Weizhong ^{[4
]}

Tan, Weimin ^{[4
]}

Song, Yingxin ^{[4
]}

机构：

[1] Hefei Univ Technol, Sch Management, Hefei 230009, Peoples R China

[2] Univ Florida, Ctr Appl Optimizat, Dept Ind & Syst Engn, Gainesville, FL USA

[3] Minist Educ, Key Lab Proc Optimizat & Intelligent Decis Making, Hefei 230009, Peoples R China

[4] Anhui Normal Univ, Sch Econ & Management, Wuhu 241000, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 133卷

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Semiconductor manufacture; Dynamic flexible job shop scheduling; Double deep Q-Network; Dynamic job arrivals; Urgent job insertions; OPTIMIZATION; SEARCH;

D O I：

10.1016/j.engappai.2024.108487

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the semiconductor manufacturing industry, the Dynamic Flexible Job Shop Scheduling Problem is regarded as one of the most complex and significant scheduling problems. Existing studies consider the dynamic arrival of jobs, however, the insertion of urgent jobs such as testing chips poses a challenge to the production model, and there is an urgent need for new scheduling methods to improve the dynamic response and self-adjustment of the shop floor. In this work, deep reinforcement learning is utilized to address the dynamic flexible job shop scheduling problem and facilitate near-real-time shop floor decision-making. We extracted eight state features, including machine utilization, operation completion rate, etc., to reflect real-time shop floor production data. After examining machine availability time, the machine's earliest available time is redefined and incorporated into the design of compound scheduling rules. Eight compound scheduling rules have been developed for job selection and machine allocation. By using the state features as inputs to the Double Deep Q-Network, it is possible to acquire the state action values (Q-values) of each compound scheduling rule, and the intelligent agent can learn a reasonable optimization strategy through training. Simulation studies show that the proposed Double Deep Q-Network algorithm outperforms other heuristics and well-known scheduling rules by generating excellent solutions quickly. In most scenarios, the Double Deep Q-Network algorithm outperforms the Deep QNetwork, Q-Learning, and State-Action-Reward-State-Action (SARSA) frameworks. Moreover, the intelligent agent has good generalization ability in terms of optimization for similar objectives.

引用

页数：22

共 53 条

[1] Conflict identification and reconciliation in a collaborative manufacturing scheduling task
Abreu, Claudio F.
May, Jerrold H.
Spangler, William E.
Vargas, Luis G.
[J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2008, 7 (01) : 147 - 174
[2] Greedy randomized adaptive search for dynamic flexible job-shop scheduling
Baykasoglu, Adil
Madenoglu, Fatma S.
Hamzadayi, Alper
[J]. JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 (56) : 425 - 451
[3] A Hybrid Data-Driven Approach for Forecasting the Characteristics of Production Disruptions and Interruptions
Bazargan-Lari, Mohammad Reza
Taghipour, Sharareh
[J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2022, 21 (04) : 1127 - 1154
[4] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival
Chang, Jingru
Yu, Dong
Hu, Yi
He, Wuwei
Yu, Haoyu
[J]. PROCESSES, 2022, 10 (04)
[5] A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem
Chen, Ronghua
Yang, Bo
Li, Shi
Wang, Shilong
[J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149
[6] An adaptive multi-objective algorithm based on decomposition and large neighborhood search for a green machine scheduling problem
Cota, Luciano P.
Guimaraes, Frederico G.
Ribeiro, Roberto G.
Meneghini, Ivan R.
de Oliveira, Fernando B.
Souza, Marcone J. F.
Siarry, Patrick
[J]. SWARM AND EVOLUTIONARY COMPUTATION, 2019, 51
[7] Solving a new robust reverse job shop scheduling problem by meta-heuristic algorithms
Dehghan-Sanej, K.
Eghbali-Zarch, M.
Tavakkoli-Moghaddam, R.
Sajadi, S. M.
Sadjadi, S. J.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 101
[8] A distributed permutation flow-shop considering sustainability criteria and real-time scheduling
Fathollahi-Fard, Amir M.
Woodward, Lyne
Akhrif, Ouassima
[J]. JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2024, 39
[9] Bi-level programming for home health care supply chain considering outsourcing
Fathollahi-Fard, Amir M.
Hajiaghaei-Keshteli, Mostafa
Tavakkoli-Moghaddam, Reza
Smith, Neale R.
[J]. JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2022, 25
[10] Sustainable distributed permutation flow-shop scheduling model based on a triple bottom line concept
Fathollahi-Fard, Amir M.
Woodward, Lyne
Akhrif, Ouassima
[J]. JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2021, 24

← 1 2 3 4 5 6 →