Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning

被引:1
|
作者
Liu, Gaoyuan [1 ,2 ]
de Winter, Joris [1 ]
Durodie, Yuri [1 ,2 ]
Steckelmacher, Denis [3 ]
Nowe, Ann [3 ]
Vanderborght, Bram [1 ,2 ]
机构
[1] Vrije Univ Brussel, Brubot, B-1050 Brussels, Belgium
[2] IMEC, B-3001 Leuven, Belgium
[3] Vrije Univ Brussel, Artificial Intelligence AI Lab, B-1050 Brussels, Belgium
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 06期
关键词
Manipulation planning; reinforcement learning; task and motion planning; SAMPLING-BASED METHODS;
D O I
10.1109/LRA.2024.3398402
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Task and motion planning (TAMP) for robotics manipulation necessitates long-horizon reasoning involving versatile actions and skills. While deterministic actions can be crafted by sampling or optimizing with certain constraints, planning actions with uncertainty, i.e., probabilistic actions, remains a challenge for TAMP. On the contrary, Reinforcement Learning (RL) excels in acquiring versatile, yet short-horizon, manipulation skills that are robust with uncertainties. In this letter, we design a method that integrates RL skills into TAMP pipelines. Besides the policy, a RL skill is defined with data-driven logical components that enable the skill to be deployed by symbolic planning. A plan refinement sub-routine is designed to further tackle the inevitable effect uncertainties. In the experiments, we compare our method with baseline hierarchical planning from both TAMP and RL fields and illustrate the strength of the method. The results show that by embedding RL skills, we extend the capability of TAMP to domains with probabilistic skills, and improve the planning efficiency compared to the previous methods.
引用
收藏
页码:5974 / 5981
页数:8
相关论文
共 50 条
  • [31] Embodied Lifelong Learning for Task and Motion Planning
    Mendez-Mendez, Jorge
    Kaelbling, Leslie Pack
    Lozano-Perez, Tomas
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [32] A Reinforcement Learning-Based Adaptive Learning System
    Shawky, Doaa
    Badawi, Ashraf
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 221 - 231
  • [33] Comprehensive survey on reinforcement learning-based task offloading techniques in aerial edge computing
    Nabi, Ahmadun
    Baidya, Tanmay
    Moh, Sangman
    INTERNET OF THINGS, 2024, 28
  • [34] Robotic Disassembly Task Training and Skill Transfer Using Reinforcement Learning
    Qu, Mo
    Wang, Yongjing
    Pham, Duc Truong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (11) : 10934 - 10943
  • [35] Reinforcement Learning-based Task Offloading of MEC-assisted UAVs in Precision Agriculture
    Yang, Zih-Yi
    Chiu, Te-Chuan
    Sheu, Jang-Ping
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 5587 - 5592
  • [36] Representation, learning, and planning algorithms for geometric task and motion planning
    Kim, Beomjoon
    Shimanuki, Luke
    Kaelbling, Leslie Pack
    Lozano-Perez, Tomas
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (02) : 210 - 231
  • [37] Deep reinforcement learning-based critical element identification and demolition planning of frame structures
    Zhu, Shaojun
    Ohsaki, Makoto
    Hayashi, Kazuki
    Zong, Shaohan
    Guo, Xiaonong
    FRONTIERS OF STRUCTURAL AND CIVIL ENGINEERING, 2022, 16 (11) : 1397 - 1414
  • [38] Reinforcement Learning-Based Collision Avoidance and Optimal Trajectory Planning in UAV Communication Networks
    Hsu, Yu-Hsin
    Gau, Rung-Hung
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (01) : 306 - 320
  • [39] Motion Planning for Industrial Robots using Reinforcement Learning
    Meyes, Richard
    Tercan, Hasan
    Roggendorf, Simon
    Thiele, Thomas
    Buescher, Christian
    Obdenbusch, Markus
    Brecher, Christian
    Jeschke, Sabina
    Meisen, Tobias
    MANUFACTURING SYSTEMS 4.0, 2017, 63 : 107 - 112
  • [40] Curiosity driven reinforcement learning for motion planning on humanoids
    Frank, Mikhail
    Leitner, Juregen
    Stollenga, Marijn
    Foerster, Alexander
    Schmidhuber, Juergen
    FRONTIERS IN NEUROROBOTICS, 2014, 7 : 1 - 15