Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning

被引:1
|
作者
Liu, Gaoyuan [1 ,2 ]
de Winter, Joris [1 ]
Durodie, Yuri [1 ,2 ]
Steckelmacher, Denis [3 ]
Nowe, Ann [3 ]
Vanderborght, Bram [1 ,2 ]
机构
[1] Vrije Univ Brussel, Brubot, B-1050 Brussels, Belgium
[2] IMEC, B-3001 Leuven, Belgium
[3] Vrije Univ Brussel, Artificial Intelligence AI Lab, B-1050 Brussels, Belgium
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 06期
关键词
Manipulation planning; reinforcement learning; task and motion planning; SAMPLING-BASED METHODS;
D O I
10.1109/LRA.2024.3398402
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Task and motion planning (TAMP) for robotics manipulation necessitates long-horizon reasoning involving versatile actions and skills. While deterministic actions can be crafted by sampling or optimizing with certain constraints, planning actions with uncertainty, i.e., probabilistic actions, remains a challenge for TAMP. On the contrary, Reinforcement Learning (RL) excels in acquiring versatile, yet short-horizon, manipulation skills that are robust with uncertainties. In this letter, we design a method that integrates RL skills into TAMP pipelines. Besides the policy, a RL skill is defined with data-driven logical components that enable the skill to be deployed by symbolic planning. A plan refinement sub-routine is designed to further tackle the inevitable effect uncertainties. In the experiments, we compare our method with baseline hierarchical planning from both TAMP and RL fields and illustrate the strength of the method. The results show that by embedding RL skills, we extend the capability of TAMP to domains with probabilistic skills, and improve the planning efficiency compared to the previous methods.
引用
收藏
页码:5974 / 5981
页数:8
相关论文
共 50 条
  • [1] Synergistic Task and Motion Planning With Reinforcement Learning-Based Non-Prehensile Actions
    Liu, Gaoyuan
    de Winter, Joris
    Steckelmacher, Denis
    Hota, Roshan Kumar
    Nowe, Ann
    Vanderborght, Bram
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) : 2764 - 2771
  • [2] Reinforcement Learning-Based Motion Planning for Automatic Parking System
    Zhang, Jiren
    Chen, Hui
    Song, Shaoyu
    Hu, Fengwei
    IEEE ACCESS, 2020, 8 : 154485 - 154501
  • [3] A review of mobile robot motion planning methods: from classical motion planning workflows to reinforcement learning-based architectures
    Dong, Lu
    He, Zichen
    Song, Chunwei
    Sun, Changyin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (02) : 439 - 459
  • [4] A Reinforcement Learning-Based Framework for Robot Manipulation Skill Acquisition
    Liu, Dong
    Wang, Zitu
    Lu, Binpeng
    Cong, Ming
    Yu, Honghua
    Zou, Qiang
    IEEE ACCESS, 2020, 8 : 108429 - 108437
  • [5] A Reinforcement Learning-based Path Planning for Collaborative UAVs
    Rahim, Shahnila
    Razaq, Mian Muaz
    Chang, Shih Yu
    Peng, Limei
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1938 - 1943
  • [6] Reinforcement Learning-Based Skill Acquisition Study for Linearly Heated Robots
    Zhang, Zhiliang
    Ikeura, Ryojun
    Hayakawa, Soichiro
    Wang, Zheng
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 1397 - 1402
  • [7] Automatic Parking Motion Planning Based on Reinforcement Learning
    Zhang J.
    Chen H.
    Song S.
    Hu F.
    Tongji Daxue Xuebao/Journal of Tongji University, 2019, 47 : 186 - 190
  • [8] Learning to Search in Task and Motion Planning With Streams
    Khodeir, Mohamed
    Agro, Ben
    Shkurti, Florian
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1983 - 1990
  • [9] Learning-Based Probabilistic LTL Motion Planning With Environment and Motion Uncertainties
    Cai, Mingyu
    Peng, Hao
    Li, Zhijun
    Kan, Zhen
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (05) : 2386 - 2392
  • [10] A Reinforcement Learning-Based Path Planning Considering Degree of Observability
    Cho, Yong Hyeon
    Park, Chan Gook
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 502 - 505