User-guided motion planning with reinforcement learning for human-robot collaboration in smart manufacturing

被引：11

作者：

Yu, Tian ^{[1
]}

Chang, Qing ^{[1
]}

机构：

[1] Univ Virginia, Dept Mech & Aerosp Engn, Charlottesville, VA 22904 USA

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2022年 / 209卷

基金：

美国国家科学基金会;

关键词：

Human -robot collaboration; Learning from demonstration; Motion planning; Reinforcement learning;

D O I：

10.1016/j.eswa.2022.118291

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In today's manufacturing system, robots are expected to perform increasingly complex manipulation tasks in collaboration with humans. However, current industrial robots are still largely preprogrammed with very little autonomy and still required to be reprogramed by robotics experts for even slightly changed tasks. Therefore, it is highly desirable that robots can adapt to certain task changes with motion planning strategies to easily work with non-robotic experts in manufacturing environments. In this paper, we propose a user-guided motion planning algorithm in combination with reinforcement learning (RL) method to enable robots automatically generate their motion plans for new tasks by learning from a few kinesthetic human demonstrations. Features of common human demonstrated tasks in a specific application environment, e.g., desk assembly or warehouse loading/ unloading are abstracted and saved in a library. The definition of semantical similarity between features in the library and features of a new task is proposed and further used to construct the reward function in RL. To achieve an adaptive motion plan facing task changes or new task requirements, features embedded in the library are mapped to appropriate task segments based on the trained motion planning policy using Q-learning. A new task can be either learned as a combination of a few features in the library or a requirement for further human demonstration if the current library is insufficient for the new task. We evaluate our approach on a 6 DOF UR5e robot on multiple tasks and scenarios and show the effectiveness of our method with respect to different scenarios.

引用

页数：13

共 32 条

[11] Hwang JH, 2003, IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, P1444
[12] Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors
Ijspeert, Auke Jan
Nakanishi, Jun
Hoffmann, Heiko
Pastor, Peter
Schaal, Stefan
[J]. NEURAL COMPUTATION, 2013, 25 (02) : 328 - 373
[13] Path Planning Under Kinematic Constraints by Rapidly Exploring Manifolds
Jaillet, Leonard
Porta, Josep M.
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (01) : 105 - 117
[14] Semantic segmentation based stereo visual servoing of nonholonomic mobile robot in intelligent manufacturing environment
Jokic, Aleksandar
Petrovic, Milica
Miljkovic, Zoran
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 190
[15] Jurczyk-Bunkowska M, 2020, STUD SYST DECIS CONT, V198, P19, DOI 10.1007/978-3-030-11274-5_3
[16] Kavan L., 2006, TCDCS200646
[17] Probabilistic roadmaps for path planning in high-dimensional configuration spaces
Kavraki, LE
Svestka, P
Latombe, JC
Overmars, MH
[J]. IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1996, 12 (04): : 566 - 580
[18] A UNIFIED APPROACH FOR MOTION AND FORCE CONTROL OF ROBOT MANIPULATORS - THE OPERATIONAL SPACE FORMULATION
KHATIB, O
[J]. IEEE JOURNAL OF ROBOTICS AND AUTOMATION, 1987, 3 (01): : 43 - 53
[19] REVIEW OF PSEUDOINVERSE CONTROL FOR USE WITH KINEMATICALLY REDUNDANT MANIPULATORS
KLEIN, CA
HUANG, CH
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (02): : 245 - 250
[20] From Skills to Symbols: Learning Symbolic Representations for Abstract High-Level Planning
Konidaris, George
Kaelbling, Leslie Pack
Lozano-Perez, Tomas
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 : 215 - 289

← 1 2 3 4 →