Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation

被引：0

作者：

Chen, Yuanpei ^{[1
]}

Wang, Chen ^{[1
]}

Li Fei-Fei ^{[1
]}

Liu, Karen ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

来源：

CONFERENCE ON ROBOT LEARNING, VOL 229 | 2023年 / 229卷

基金：

美国国家科学基金会;

关键词：

Dexterous Manipulation; Long-Horizon Manipulation; Reinforcement Learning; MOTION; TASK;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many real-world manipulation tasks consist of a series of subtasks that are significantly different from one another. Such long-horizon, complex tasks highlight the potential of dexterous hands, which possess adaptability and versatility, capable of seamlessly transitioning between different modes of functionality without the need for re-grasping or external tools. However, the challenges arise due to the high-dimensional action space of dexterous hand and complex compositional dynamics of the long-horizon tasks. We present Sequential Dexterity, a general system based on reinforcement learning (RL) that chains multiple dexterous policies for achieving long-horizon task goals. The core of the system is a transition feasibility function that progressively finetunes the sub-policies for enhancing chaining success rate, while also enables autonomous policy-switching for recovery from failures and bypassing redundant stages. Despite being trained only in simulation with a few task objects, our system demonstrates generalization capability to novel object shapes and is able to zero-shot transfer to a real-world robot equipped with a dexterous hand. Code and videos are available at sequential-dexterity.github.io.

引用

页数：21

共 66 条

[1] Agia C., 2022, ARXIV
[2] Ahn Michael, 2022, arXiv
[3] Akkaya Ilge, 2019, Solving rubik's cube with a robot hand
[4] Learning dexterous in-hand manipulation
Andrychowicz, Marcin
Baker, Bowen
Chociej, Maciek
Jozefowicz, Rafal
McGrew, Bob
Pachocki, Jakub
Petron, Arthur
Plappert, Matthias
Powell, Glenn
Ray, Alex
Schneider, Jonas
Sidor, Szymon
Tobin, Josh
Welinder, Peter
Weng, Lilian
Zaremba, Wojciech
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) : 3 - 20
[5] [Anonymous], 2021, PMLR
[6] Arunachalam S. P., 2022, ARXIV
[7] Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
[8] Bai YF, 2014, IEEE INT CONF ROBOT, P1560, DOI 10.1109/ICRA.2014.6907059
[9] Chen T., 2021, C ROB LEARN
[10] Chen T., 2022, arXiv

← 1 2 3 4 5 6 7 →