User-guided motion planning with reinforcement learning for human-robot collaboration in smart manufacturing

被引:11
作者
Yu, Tian [1 ]
Chang, Qing [1 ]
机构
[1] Univ Virginia, Dept Mech & Aerosp Engn, Charlottesville, VA 22904 USA
基金
美国国家科学基金会;
关键词
Human -robot collaboration; Learning from demonstration; Motion planning; Reinforcement learning;
D O I
10.1016/j.eswa.2022.118291
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In today's manufacturing system, robots are expected to perform increasingly complex manipulation tasks in collaboration with humans. However, current industrial robots are still largely preprogrammed with very little autonomy and still required to be reprogramed by robotics experts for even slightly changed tasks. Therefore, it is highly desirable that robots can adapt to certain task changes with motion planning strategies to easily work with non-robotic experts in manufacturing environments. In this paper, we propose a user-guided motion planning algorithm in combination with reinforcement learning (RL) method to enable robots automatically generate their motion plans for new tasks by learning from a few kinesthetic human demonstrations. Features of common human demonstrated tasks in a specific application environment, e.g., desk assembly or warehouse loading/ unloading are abstracted and saved in a library. The definition of semantical similarity between features in the library and features of a new task is proposed and further used to construct the reward function in RL. To achieve an adaptive motion plan facing task changes or new task requirements, features embedded in the library are mapped to appropriate task segments based on the trained motion planning policy using Q-learning. A new task can be either learned as a combination of a few features in the library or a requirement for further human demonstration if the current library is insufficient for the new task. We evaluate our approach on a 6 DOF UR5e robot on multiple tasks and scenarios and show the effectiveness of our method with respect to different scenarios.
引用
收藏
页数:13
相关论文
共 32 条
  • [11] Hwang JH, 2003, IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, P1444
  • [12] Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors
    Ijspeert, Auke Jan
    Nakanishi, Jun
    Hoffmann, Heiko
    Pastor, Peter
    Schaal, Stefan
    [J]. NEURAL COMPUTATION, 2013, 25 (02) : 328 - 373
  • [13] Path Planning Under Kinematic Constraints by Rapidly Exploring Manifolds
    Jaillet, Leonard
    Porta, Josep M.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (01) : 105 - 117
  • [14] Semantic segmentation based stereo visual servoing of nonholonomic mobile robot in intelligent manufacturing environment
    Jokic, Aleksandar
    Petrovic, Milica
    Miljkovic, Zoran
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 190
  • [15] Jurczyk-Bunkowska M, 2020, STUD SYST DECIS CONT, V198, P19, DOI 10.1007/978-3-030-11274-5_3
  • [16] Kavan L., 2006, TCDCS200646
  • [17] Probabilistic roadmaps for path planning in high-dimensional configuration spaces
    Kavraki, LE
    Svestka, P
    Latombe, JC
    Overmars, MH
    [J]. IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1996, 12 (04): : 566 - 580
  • [18] A UNIFIED APPROACH FOR MOTION AND FORCE CONTROL OF ROBOT MANIPULATORS - THE OPERATIONAL SPACE FORMULATION
    KHATIB, O
    [J]. IEEE JOURNAL OF ROBOTICS AND AUTOMATION, 1987, 3 (01): : 43 - 53
  • [19] REVIEW OF PSEUDOINVERSE CONTROL FOR USE WITH KINEMATICALLY REDUNDANT MANIPULATORS
    KLEIN, CA
    HUANG, CH
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (02): : 245 - 250
  • [20] From Skills to Symbols: Learning Symbolic Representations for Abstract High-Level Planning
    Konidaris, George
    Kaelbling, Leslie Pack
    Lozano-Perez, Tomas
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 : 215 - 289