ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing

被引:0
|
作者
Akbulut, M. Tuluhan [1 ]
Oztop, Erhan [2 ]
Seker, M. Yunus [1 ]
Xue, Honghu [3 ]
Tekden, Ahmet E. [1 ]
Ugur, Emre [1 ]
机构
[1] Bogazici Univ, Istanbul, Turkey
[2] Ozyegin Univ, Istanbul, Turkey
[3] Univ Lubeck, Lubeck, Germany
来源
CONFERENCE ON ROBOT LEARNING, VOL 155 | 2020年 / 155卷
关键词
Learning from Demonstration; Reinforcement Learning; Deep Learning; Representation Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To equip robots with dexterous skills, an effective approach is to first transfer the desired skill via Learning from Demonstration (LfD), then let the robot improve it by self-exploration via Reinforcement Learning (RL). In this paper, we propose a novel LfD+RL framework, namely Adaptive Conditional Neural Movement Primitives (ACNMP), that allows efficient policy improvement in novel environments and effective skill transfer between different agents. This is achieved through exploiting the latent representation learned by the underlying Conditional Neural Process (CNP) model, and simultaneous training of the model with supervised learning (SL) for acquiring the demonstrated trajectories and via RL for new trajectory discovery. Through simulation experiments, we show that (i) ACNMP enables the system to extrapolate to situations where pure LfD fails; (ii) Simultaneous training of the system through SL and RL preserves the shape of demonstrations while adapting to novel situations due to the shared representations used by both learners; (iii) ACNMP enables order-of-magnitude sample-efficient RL in extrapolation of reaching tasks compared to the existing approaches; (iv) ACNMPs can be used to implement skill transfer between robots having different morphology, with competitive learning speeds and importantly with less number of assumptions compared to the state-of-the-art approaches. Finally, we show the real-world suitability of ACNMPs through real robot experiments that involve obstacle avoidance, pick and place and pouring actions.
引用
收藏
页码:1896 / 1907
页数:12
相关论文
共 50 条
  • [41] MAML2: meta reinforcement learning via meta-learning for task categories
    FU Qiming
    WANG Zhechao
    FANG Nengwei
    XING Bin
    ZHANG Xiao
    CHEN Jianping
    Frontiers of Computer Science, 2023, 17 (04)
  • [42] MAML2: meta reinforcement learning via meta-learning for task categories
    Fu, Qiming
    Wang, Zhechao
    Fang, Nengwei
    Xing, Bin
    Zhang, Xiao
    Chen, Jianping
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)
  • [43] Efficient Reinforcement Learning of Task Planners for Robotic Palletization Through Iterative Action Masking Learning
    Wu, Zheng
    Li, Yichuan
    Zhan, Wei
    Liu, Changliu
    Liu, Yun-Hui
    Tomizuka, Masayoshi
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9303 - 9310
  • [44] Multi-agent Transfer Learning in Reinforcement Learning-based Ride-sharing Systems
    Castagna, Alberto
    Dusparic, Ivana
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 120 - 130
  • [45] Automated Robot Skill Learning from Demonstration for Various Robot Systems
    Gutzeit, Lisa
    Fabisch, Alexander
    Petzoldt, Christoph
    Wiese, Hendrik
    Kirchner, Frank
    ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2019, 2019, 11793 : 168 - 181
  • [46] Enhancing HVAC control systems through transfer learning with deep reinforcement learning agents
    Kadamala, Kevlyn
    Chambers, Des
    Barrett, Enda
    SMART ENERGY, 2024, 13
  • [47] Enhancing fog load balancing through lifelong transfer learning of reinforcement learning agents
    Ebrahim, Maad
    Hafid, Abdelhakim
    Abid, Mohamed Riduan
    COMPUTER COMMUNICATIONS, 2025, 231
  • [48] Enhancing Manycore Lifetime Through Reinforcement Learning Task Mapping and Migration
    Weber, Tama Ianiski
    Zanini, Vitor Balbinot
    Moraes, Fernando Gehm
    2024 37TH SBC/SBMICRO/IEEE SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, SBCCI 2024, 2024, : 185 - 189
  • [49] An Optimal Transfer of Knowledge in Reinforcement Learning through Greedy Approach
    Kumari, Deepika
    Chaudhary, Mahima
    Mishra, Ashish Kumar
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [50] Improving Batch Reinforcement Learning Performance through Transfer of Samples
    Lazaric, Alessandro
    Restelli, Marcello
    Bonarini, Andrea
    STAIRS 2008, 2008, 179 : 106 - 117