Interactive Q-Learning Approach for Pick-and-Place Optimization of the Die Attach Process in the Semiconductor Industry

被引:7
作者
Ahn, Gilseung [1 ]
Park, Myunghwan [1 ]
Park, You-Jin [2 ]
Hur, Sun [1 ]
机构
[1] Hanyang Univ, Dept Ind & Management Engn, Ansan 15588, South Korea
[2] Natl Taipei Univ Technol, Dept Ind Engn & Management, Taipei, Taiwan
基金
新加坡国家研究基金会;
关键词
REINFORCEMENT; ALGORITHMS;
D O I
10.1155/2019/4602052
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In semiconductor back-end production, the die attach process is one of the most critical steps affecting overall productivity. Optimization of this process can be modeled as a pick-and-place problem known to be NP-hard. Typical approaches are rule-based and metaheuristic methods. The two have high or low generalization ability, low or high performance, and short or long search time, respectively. The motivation of this paper is to develop a novel method involving only the strengths of these methods, i.e., high generalization ability and performance and short search time. We develop an interactive Q-learning in which two agents, a pick agent and a place agent, are trained and find a pick-and-place (PAP) path interactively. From experiments, we verified that the proposed approach finds a shorter path than the genetic algorithm given in previous research.
引用
收藏
页数:8
相关论文
共 19 条
[1]   A Reinforcement Learning Technique for Optimizing Downlink Scheduling in an Energy-Limited Vehicular Network [J].
Atallah, Ribal F. ;
Assi, Chadi M. ;
Yu, Jia Yuan .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (06) :4592-4601
[2]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[3]   Genetic Scheduling and Reinforcement Learning in Multirobot Systems for Intelligent Warehouses [J].
Dou, Jiajia ;
Chen, Chunlin ;
Yang, Pei .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[4]   Robust multi-robot coordination in pick-and-place tasks based on part-dispatching rules [J].
Huang, Yanjiang ;
Chiba, Ryosuke ;
Arai, Tamio ;
Ueyama, Tsuyoshi ;
Ota, Jun .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2015, 64 :70-83
[5]   Reinforcement learning approaches for specifying ordering policies of perishable inventory systems [J].
Kara, Ahmet ;
Dogan, Ibrahim .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 :150-158
[6]   Reinforcement learning in robotics: A survey [J].
Kober, Jens ;
Bagnell, J. Andrew ;
Peters, Jan .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1238-1274
[7]   Adaptive neural network tracking control-based reinforcement learning for wheeled mobile robots with skidding and slipping [J].
Li, Shu ;
Ding, Liang ;
Gao, Haibo ;
Chen, Chao ;
Liu, Zhen ;
Deng, Zongquan .
NEUROCOMPUTING, 2018, 283 :20-30
[8]   A hybrid metaheuristic algorithm to optimise a real-world robotic cell [J].
Liu, Shi Qiang ;
Kozan, Erhan .
COMPUTERS & OPERATIONS RESEARCH, 2017, 84 :188-194
[9]   Optimal Route Based on Dynamic Programming for Road Networks [J].
Mainali, Manoj Kanta ;
Shimada, Kaoru ;
Mabu, Shingo ;
Hirasawa, Kotaro .
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2008, 12 (06) :546-553
[10]  
Nakamae K., 2001, Modeling and Simulation of Microsystems, P598