Scheduling of Dual-Gripper Robotic Cells With Reinforcement Learning

被引：22

作者：

Kim, Hyun-Jung ^{[1
]}

Lee, Jun-Ho ^{[2
]}

机构：

[1] KAIST Korea Adv Inst Sci & Technol, Dept Ind & Syst Engn, Daejeon 34141, South Korea

[2] Chungnam Natl Univ, Sch Business, Daejeon 34134, South Korea

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2022年 / 19卷 / 02期

基金：

新加坡国家研究基金会;

关键词：

Robots; Job shop scheduling; Tools; Task analysis; Manufacturing; Service robots; Mathematical model; Dual-gripper robotic cell; reinforcement learning (RL); scheduling; time variations; ARMED CLUSTER TOOLS; TIME ANALYSIS; BOUND ALGORITHM; COMPLETION-TIME; HOIST; PARTS; OPTIMIZATION; CONSTRAINTS;

D O I：

10.1109/TASE.2020.3047924

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A dual-gripper robotic cell consists of multiple processing machines and one material handling robot, which can perform an unloading or a loading task one at a time but can hold two parts at the same time. We address a scheduling problem of the robotic cell that determines a robot task sequence when two part types are processed in a different set of machines and all machines have variable processing times within a given interval. The objective is to minimize the makespan. This study proposes a learning-based method, i.e., a reinforcement learning (RL) approach, for the first time, to address a dual-gripper robotic cell scheduling problem. The problem is modeled with a Petri net, a graphical and mathematical modeling tool, which is used as an environment in RL. The states, actions, and rewards are defined by using flow shop scheduling properties, features from a Petri net, and knowledge from previous studies of scheduling robotized tools. Then, the RL approach is compared to the first-in-first-out (FIFO) rule, which is generally used in practice, a swap sequence, which is widely used for cyclic scheduling of dual-gripper robotic cells, and a lower bound. The extensive experiments show that the proposed method performs better than FIFO and the swap sequence; moreover, the gap between the makespan of the proposed method and the lower bound is not large.

引用

页码：1120 / 1136

页数：17

共 54 条

[1] Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem [J].

Arviv, Kfir ;

Stern, Helman ;

Edan, Yael .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2016, 54 (04) :1196-1209

[2] Dynamic job-shop scheduling using reinforcement learning agents [J].

Aydin, ME ;

Öztemel, E .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2000, 33 (2-3) :169-178

[3] An optimization-based heuristic for the robotic cell problem [J].

Carlier, Jacques ;

Haouari, Mohamed ;

Kharbeche, Mohamed ;

Moukrim, Aziz .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 202 (03) :636-645

[4] An efficient bicriteria algorithm for stable robotic flow shop scheduling [J].

Che, Ada ;

Kats, Vladimir ;

Levner, Eugene .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 260 (03) :964-971

[5] Robust optimization for the cyclic hoist scheduling problem [J].

Che, Ada ;

Feng, Jianguang ;

Chen, Haoxun ;

Chu, Chengbin .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 240 (03) :627-636

[6] Optimal cyclic scheduling of a hoist and multi-type parts with fixed processing times [J].

Che, Ada ;

Yan, Pengyu ;

Yang, Naiding ;

Chu, Chengbin .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2010, 48 (05) :1225-1243

[7] A reinforcement learning based approach for a multiple-load carrier scheduling problem [J].

Chen, Ci ;

Xia, Beixin ;

Zhou, Bing-hai ;

Xi, Lifeng .

JOURNAL OF INTELLIGENT MANUFACTURING, 2015, 26 (06) :1233-1245

[8] Cyclic scheduling in robotic flowshops [J].

Crama, Y ;

Kats, V ;

van de Klundert, J ;

Levner, E .

ANNALS OF OPERATIONS RESEARCH, 2000, 96 (1-4) :97-124

[9] Sequencing and scheduling in robotic cells: Recent developments [J].

Dawande, M ;

Geismar, HN ;

Sethi, SP ;

Sriskandarajah, C .

JOURNAL OF SCHEDULING, 2005, 8 (05) :387-426

[10] Throughput optimization in dual-gripper interval robotic cells [J].

Dawande, Milind ;

Geismar, H. Neil ;

Pinedo, Michael ;

Sriskandarajah, Chelliah .

IIE TRANSACTIONS, 2010, 42 (01) :1-15

← 1 2 3 4 5 6 →