Scheduling of Dual-Gripper Robotic Cells With Reinforcement Learning

被引:22
作者
Kim, Hyun-Jung [1 ]
Lee, Jun-Ho [2 ]
机构
[1] KAIST Korea Adv Inst Sci & Technol, Dept Ind & Syst Engn, Daejeon 34141, South Korea
[2] Chungnam Natl Univ, Sch Business, Daejeon 34134, South Korea
基金
新加坡国家研究基金会;
关键词
Robots; Job shop scheduling; Tools; Task analysis; Manufacturing; Service robots; Mathematical model; Dual-gripper robotic cell; reinforcement learning (RL); scheduling; time variations; ARMED CLUSTER TOOLS; TIME ANALYSIS; BOUND ALGORITHM; COMPLETION-TIME; HOIST; PARTS; OPTIMIZATION; CONSTRAINTS;
D O I
10.1109/TASE.2020.3047924
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A dual-gripper robotic cell consists of multiple processing machines and one material handling robot, which can perform an unloading or a loading task one at a time but can hold two parts at the same time. We address a scheduling problem of the robotic cell that determines a robot task sequence when two part types are processed in a different set of machines and all machines have variable processing times within a given interval. The objective is to minimize the makespan. This study proposes a learning-based method, i.e., a reinforcement learning (RL) approach, for the first time, to address a dual-gripper robotic cell scheduling problem. The problem is modeled with a Petri net, a graphical and mathematical modeling tool, which is used as an environment in RL. The states, actions, and rewards are defined by using flow shop scheduling properties, features from a Petri net, and knowledge from previous studies of scheduling robotized tools. Then, the RL approach is compared to the first-in-first-out (FIFO) rule, which is generally used in practice, a swap sequence, which is widely used for cyclic scheduling of dual-gripper robotic cells, and a lower bound. The extensive experiments show that the proposed method performs better than FIFO and the swap sequence; moreover, the gap between the makespan of the proposed method and the lower bound is not large.
引用
收藏
页码:1120 / 1136
页数:17
相关论文
共 54 条
[1]   Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem [J].
Arviv, Kfir ;
Stern, Helman ;
Edan, Yael .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2016, 54 (04) :1196-1209
[2]   Dynamic job-shop scheduling using reinforcement learning agents [J].
Aydin, ME ;
Öztemel, E .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2000, 33 (2-3) :169-178
[3]   An optimization-based heuristic for the robotic cell problem [J].
Carlier, Jacques ;
Haouari, Mohamed ;
Kharbeche, Mohamed ;
Moukrim, Aziz .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 202 (03) :636-645
[4]   An efficient bicriteria algorithm for stable robotic flow shop scheduling [J].
Che, Ada ;
Kats, Vladimir ;
Levner, Eugene .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 260 (03) :964-971
[5]   Robust optimization for the cyclic hoist scheduling problem [J].
Che, Ada ;
Feng, Jianguang ;
Chen, Haoxun ;
Chu, Chengbin .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 240 (03) :627-636
[6]   Optimal cyclic scheduling of a hoist and multi-type parts with fixed processing times [J].
Che, Ada ;
Yan, Pengyu ;
Yang, Naiding ;
Chu, Chengbin .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2010, 48 (05) :1225-1243
[7]   A reinforcement learning based approach for a multiple-load carrier scheduling problem [J].
Chen, Ci ;
Xia, Beixin ;
Zhou, Bing-hai ;
Xi, Lifeng .
JOURNAL OF INTELLIGENT MANUFACTURING, 2015, 26 (06) :1233-1245
[8]   Cyclic scheduling in robotic flowshops [J].
Crama, Y ;
Kats, V ;
van de Klundert, J ;
Levner, E .
ANNALS OF OPERATIONS RESEARCH, 2000, 96 (1-4) :97-124
[9]   Sequencing and scheduling in robotic cells: Recent developments [J].
Dawande, M ;
Geismar, HN ;
Sethi, SP ;
Sriskandarajah, C .
JOURNAL OF SCHEDULING, 2005, 8 (05) :387-426
[10]   Throughput optimization in dual-gripper interval robotic cells [J].
Dawande, Milind ;
Geismar, H. Neil ;
Pinedo, Michael ;
Sriskandarajah, Chelliah .
IIE TRANSACTIONS, 2010, 42 (01) :1-15