Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation

被引:0
作者
Chen, Yuanpei [1 ]
Wang, Chen [1 ]
Li Fei-Fei [1 ]
Liu, Karen [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
来源
CONFERENCE ON ROBOT LEARNING, VOL 229 | 2023年 / 229卷
基金
美国国家科学基金会;
关键词
Dexterous Manipulation; Long-Horizon Manipulation; Reinforcement Learning; MOTION; TASK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real-world manipulation tasks consist of a series of subtasks that are significantly different from one another. Such long-horizon, complex tasks highlight the potential of dexterous hands, which possess adaptability and versatility, capable of seamlessly transitioning between different modes of functionality without the need for re-grasping or external tools. However, the challenges arise due to the high-dimensional action space of dexterous hand and complex compositional dynamics of the long-horizon tasks. We present Sequential Dexterity, a general system based on reinforcement learning (RL) that chains multiple dexterous policies for achieving long-horizon task goals. The core of the system is a transition feasibility function that progressively finetunes the sub-policies for enhancing chaining success rate, while also enables autonomous policy-switching for recovery from failures and bypassing redundant stages. Despite being trained only in simulation with a few task objects, our system demonstrates generalization capability to novel object shapes and is able to zero-shot transfer to a real-world robot equipped with a dexterous hand. Code and videos are available at sequential-dexterity.github.io.
引用
收藏
页数:21
相关论文
共 66 条
  • [1] Agia C., 2022, ARXIV
  • [2] Ahn Michael, 2022, arXiv
  • [3] Akkaya Ilge, 2019, Solving rubik's cube with a robot hand
  • [4] Learning dexterous in-hand manipulation
    Andrychowicz, Marcin
    Baker, Bowen
    Chociej, Maciek
    Jozefowicz, Rafal
    McGrew, Bob
    Pachocki, Jakub
    Petron, Arthur
    Plappert, Matthias
    Powell, Glenn
    Ray, Alex
    Schneider, Jonas
    Sidor, Szymon
    Tobin, Josh
    Welinder, Peter
    Weng, Lilian
    Zaremba, Wojciech
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) : 3 - 20
  • [5] [Anonymous], 2021, PMLR
  • [6] Arunachalam S. P., 2022, ARXIV
  • [7] Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
  • [8] Bai YF, 2014, IEEE INT CONF ROBOT, P1560, DOI 10.1109/ICRA.2014.6907059
  • [9] Chen T., 2021, C ROB LEARN
  • [10] Chen T., 2022, arXiv