Motion Planning for Industrial Robots using Reinforcement Learning

被引：46

作者：

Meyes, Richard ^{[1
]}

Tercan, Hasan ^{[1
]}

Roggendorf, Simon ^{[2
]}

Thiele, Thomas ^{[1
]}

Buescher, Christian ^{[3
]}

Obdenbusch, Markus ^{[2
]}

Brecher, Christian ^{[2
]}

Jeschke, Sabina ^{[1
]}

Meisen, Tobias ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, Inst Informat Management Mech Engn IMA, Dennewartstr 27, D-52064 Aachen, Germany

[2] Rhein Westfal TH Aachen, Lab Machine Tools & Prod Engn WZL, Steinbachstr 19, D-52074 Aachen, Germany

[3] St Gobain Sekurit Deutschland GmbH & Co KG, Glasstr 1, D-52134 Herzogenrath, Germany

来源：

MANUFACTURING SYSTEMS 4.0 | 2017年 / 63卷

关键词：

Reinforcement Learning; Cyber-Physical Production Systems (CPPS); Self-Optimization; MARKOV DECISION-PROCESSES;

D O I：

10.1016/j.procir.2017.03.095

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

A major challenge of today's production systems in the context of Industry 4.0 and Cyber-Physical Production Systems is to be flexible and adaptive whilst being robust and economically efficient. Specifically, the implementation of motion planning processes for industrial robots need to be refined concerning their variability of the motion task and the ability to adaptively deal with variations in the environment. In this paper, we propose a reinforcement learning (RL) based, cognition-enhanced six-axis industrial robot for complex motion planning along continuous trajectories as e.g. needed for welding, gluing or cutting processes in production. Our prototype demonstrator is inspired by the classic wire loop game which involves guiding a metal loop along the path of a curved wire from start to finish while avoiding any contact between the wire and the loop. Our work shows that the RL-agent is capable of learning how to control the robot to successfully play the wire loop game without the need of modeling the wire or programming the robot motion beforehand. Furthermore, the extension of the system by a visual sensor (a camera) allows the agent to sufficiently generalize the learning problem so that it can solve new or reshaped wires without the need of additional learning. We conclude that the applicability of RL for industrial robots and production systems in general provides vast and unexplored potential for processes that feature variability to some extent and thus require a general and robust approach for process automation. (C) 2017 The Authors. Published by Elsevier B.V.

引用

页码：107 / 112

页数：6

共 25 条

[1]

[Anonymous], 2003, A First Course in Stochastic Models

[2]

[Anonymous], INTRO PROBABILITY TH

[3]

[Anonymous], 2016, UR5 TECHN SPEC

[4]

[Anonymous], 2010, Artificial Intelligence-Foundations of Computational Agents

[5]

[Anonymous], 2013, TOO BIG IGNORE BUSIN

[6]

[Anonymous], 2015, DEMYSTIFYING DEEP RE

[7]

[Anonymous], 1992, Reinforcement Learning:An Introduction, DOI DOI 10.1007/978-1-4615-3618-51

[8]

[Anonymous], 2013, SECURING FUTURE GERM

[9]

[Anonymous], 2005, P 5 EUR WORKSH AD AG

[10]

[Anonymous], REINFORCEMENT LEAR 2

← 1 2 3 →