Scalable Scheduling of Semiconductor Packaging Facilities Using Deep Reinforcement Learning

被引：40

作者：

Park, In-Beom ^{[1
]}

Park, Jonghun ^{[2
,3
]}

机构：

[1] Myongji Univ, Dept Ind & Management Engn, Yongin 17058, South Korea

[2] Seoul Natl Univ, Dept Ind Engn, Seoul 08826, South Korea

[3] Seoul Natl Univ, Inst Ind Syst Innovat, Seoul 08826, South Korea

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2023年 / 53卷 / 06期

基金：

新加坡国家研究基金会;

关键词：

Job shop scheduling; Processor scheduling; Packaging; Production; Artificial neural networks; Metaheuristics; Aerospace electronics; Deep deterministic policy gradient (DDPG); deep reinforcement learning (DRL); flexible job shop scheduling; semiconductor manufacturing system; semiconductor packaging; SEARCH; ALGORITHM;

D O I：

10.1109/TCYB.2021.3128075

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) has emerged as a promising approach for scheduling semiconductor operations. Yet, it is still challenging to solve large-scale scheduling problems based on an RL method since learning complexity grows fast as the size of shop floor increases. This challenge becomes more apparent when solving the scheduling problems with a diverse number of job types, which leads to the difficulties in exploration and function approximation in RL. This article presents a scheduling method for semiconductor packaging facilities using deep RL in which an agent allocates a job to one of machines in a centralized manner. Specifically, a novel state representation is introduced to effectively accommodate the variations in the number of available machines and the production requirements. Furthermore, we propose a continuous representation of an action to maintain the size of the action space even when the numbers of jobs, machines, and operation types are subject to change. Extensive experiments on large-scale datasets demonstrate that the proposed method mostly outperforms the metaheuristics and rule-based methods, as well as the other RL approaches considered in terms of makespan while requiring much less computation time than the metaheuristics.

引用

页码：3518 / 3531

页数：14

共 53 条

[1]

Ba J.L., 2016, stat, VVolume 29, P3617, DOI 10.48550/arXiv.1607.06450

[2]

Bergstra J, 2012, J MACH LEARN RES, V13, P281

[3] Scheduling Semiconductor Testing Facility by Using Cuckoo Search Algorithm With Reinforcement Learning and Surrogate Modeling [J].

Cao, ZhengCai ;

Lin, ChengRan ;

Zhou, MengChu ;

Huang, Ran .

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (02) :825-837

[4] Setup Change Scheduling for Semiconductor Packaging Facilities Using a Genetic Algorithm With an Operator Recommender [J].

Chung, Beom-suk ;

Lim, Junseok ;

Park, In-Beom ;

Park, Jonghun ;

Seo, Minseok ;

Seo, Jinwook .

IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2014, 27 (03) :377-387

[5] EUCLIDEAN DISTANCE MAPPING [J].

DANIELSSON, PE .

COMPUTER GRAPHICS AND IMAGE PROCESSING, 1980, 14 (03) :227-248

[6] An efficient two-stage genetic algorithm for a flexible job-shop scheduling problem with sequence dependent attached/detached setup, machine release date and lag-time [J].

Defersha, Fantahun M. ;

Rooyani, Danial .

COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 147 (147)

[7]

Degris T., 2015, Deep reinforcement learning in large discrete action spaces

[8]

Finn C, 2017, PR MACH LEARN RES, V70

[9]

Foerster J. N., 2018, ARXIV181011702

[10] Distributed policy search reinforcement learning for job-shop scheduling tasks [J].

Gabel, Thomas ;

Riedmiller, Martin .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2012, 50 (01) :41-61

← 1 2 3 4 5 6 →