A Double Deep Q-Network framework for a flexible job shop scheduling problem with dynamic job arrivals and urgent job insertions

被引:13
作者
Lu, Shaojun [1 ,2 ,3 ]
Wang, Yongqi [1 ]
Kong, Min [1 ,4 ]
Wang, Weizhong [4 ]
Tan, Weimin [4 ]
Song, Yingxin [4 ]
机构
[1] Hefei Univ Technol, Sch Management, Hefei 230009, Peoples R China
[2] Univ Florida, Ctr Appl Optimizat, Dept Ind & Syst Engn, Gainesville, FL USA
[3] Minist Educ, Key Lab Proc Optimizat & Intelligent Decis Making, Hefei 230009, Peoples R China
[4] Anhui Normal Univ, Sch Econ & Management, Wuhu 241000, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Semiconductor manufacture; Dynamic flexible job shop scheduling; Double deep Q-Network; Dynamic job arrivals; Urgent job insertions; OPTIMIZATION; SEARCH;
D O I
10.1016/j.engappai.2024.108487
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the semiconductor manufacturing industry, the Dynamic Flexible Job Shop Scheduling Problem is regarded as one of the most complex and significant scheduling problems. Existing studies consider the dynamic arrival of jobs, however, the insertion of urgent jobs such as testing chips poses a challenge to the production model, and there is an urgent need for new scheduling methods to improve the dynamic response and self-adjustment of the shop floor. In this work, deep reinforcement learning is utilized to address the dynamic flexible job shop scheduling problem and facilitate near-real-time shop floor decision-making. We extracted eight state features, including machine utilization, operation completion rate, etc., to reflect real-time shop floor production data. After examining machine availability time, the machine's earliest available time is redefined and incorporated into the design of compound scheduling rules. Eight compound scheduling rules have been developed for job selection and machine allocation. By using the state features as inputs to the Double Deep Q-Network, it is possible to acquire the state action values (Q-values) of each compound scheduling rule, and the intelligent agent can learn a reasonable optimization strategy through training. Simulation studies show that the proposed Double Deep Q-Network algorithm outperforms other heuristics and well-known scheduling rules by generating excellent solutions quickly. In most scenarios, the Double Deep Q-Network algorithm outperforms the Deep QNetwork, Q-Learning, and State-Action-Reward-State-Action (SARSA) frameworks. Moreover, the intelligent agent has good generalization ability in terms of optimization for similar objectives.
引用
收藏
页数:22
相关论文
共 53 条
  • [1] Conflict identification and reconciliation in a collaborative manufacturing scheduling task
    Abreu, Claudio F.
    May, Jerrold H.
    Spangler, William E.
    Vargas, Luis G.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2008, 7 (01) : 147 - 174
  • [2] Greedy randomized adaptive search for dynamic flexible job-shop scheduling
    Baykasoglu, Adil
    Madenoglu, Fatma S.
    Hamzadayi, Alper
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 (56) : 425 - 451
  • [3] A Hybrid Data-Driven Approach for Forecasting the Characteristics of Production Disruptions and Interruptions
    Bazargan-Lari, Mohammad Reza
    Taghipour, Sharareh
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2022, 21 (04) : 1127 - 1154
  • [4] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival
    Chang, Jingru
    Yu, Dong
    Hu, Yi
    He, Wuwei
    Yu, Haoyu
    [J]. PROCESSES, 2022, 10 (04)
  • [5] A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem
    Chen, Ronghua
    Yang, Bo
    Li, Shi
    Wang, Shilong
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149
  • [6] An adaptive multi-objective algorithm based on decomposition and large neighborhood search for a green machine scheduling problem
    Cota, Luciano P.
    Guimaraes, Frederico G.
    Ribeiro, Roberto G.
    Meneghini, Ivan R.
    de Oliveira, Fernando B.
    Souza, Marcone J. F.
    Siarry, Patrick
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2019, 51
  • [7] Solving a new robust reverse job shop scheduling problem by meta-heuristic algorithms
    Dehghan-Sanej, K.
    Eghbali-Zarch, M.
    Tavakkoli-Moghaddam, R.
    Sajadi, S. M.
    Sadjadi, S. J.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 101
  • [8] A distributed permutation flow-shop considering sustainability criteria and real-time scheduling
    Fathollahi-Fard, Amir M.
    Woodward, Lyne
    Akhrif, Ouassima
    [J]. JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2024, 39
  • [9] Bi-level programming for home health care supply chain considering outsourcing
    Fathollahi-Fard, Amir M.
    Hajiaghaei-Keshteli, Mostafa
    Tavakkoli-Moghaddam, Reza
    Smith, Neale R.
    [J]. JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2022, 25
  • [10] Sustainable distributed permutation flow-shop scheduling model based on a triple bottom line concept
    Fathollahi-Fard, Amir M.
    Woodward, Lyne
    Akhrif, Ouassima
    [J]. JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2021, 24