Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation

被引：0

作者：

Fang, Kuan ^{[1
]}

Zhu, Yuke ^{[1
,2
]}

Garg, Animesh ^{[2
,3
,4
]}

Savarese, Silvio ^{[1
]}

Li Fei-Fei ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

[2] Nvidia, Santa Clara, CA USA

[3] Univ Toronto, Toronto, ON, Canada

[4] Vector Inst, Toronto, ON, Canada

来源：

CONFERENCE ON ROBOT LEARNING, VOL 100 | 2019年 / 100卷

关键词：

dynamics modeling; latent-space planning; variational inference;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The fundamental challenge of planning for multi-step manipulation is to find effective and plausible action sequences that lead to the task goal. We present Cascaded Variational Inference Planner (CAVIN), a model-based method that hierarchically generates plans by sampling from latent spaces. To facilitate planning over long time horizons, our method learns latent representations that decouple the prediction of high-level effects from the generation of low-level motions through cascaded variational inference. This enables us to model dynamics at two different levels of temporal resolutions for hierarchical planning. We evaluate our approach in three multi-step robotic manipulation tasks in cluttered tabletop environments given raw visual observations. Empirical results demonstrate that the proposed method outperforms state-of-the-art model-based approaches by strategically planning for interactions with multiple objects. See more details at pair.stanford.edu/cavin

引用

页数：11

共 30 条

[1] Agrawal P., 2016, Advances in neural information processing systems, P5074, DOI DOI 10.48550/ARXIV.1606.07419
[2] Andreas J, 2017, PR MACH LEARN RES, V70
[3] Neural Module Networks
Andreas, Jacob
Rohrbach, Marcus
Darrell, Trevor
Klein, Dan
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 39 - 48
[4] Calli B, 2015, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), P510, DOI 10.1109/ICAR.2015.7251504
[5] Camacho E.F., 2007, ADV TK CONT SIGN PRO, DOI 10.1007/978-0-85729-398-5
[6] Chandak Y, 2019, Arxiv, DOI arXiv:1902.00183
[7] Coumans E, 2016, PYBULLET PYTHON MODU
[8] Co-Reyes JD, 2018, Arxiv, DOI arXiv:1806.02813
[9] Danielczuk M., 2019, arXiv
[10] Fang K, 2018, ROBOTICS: SCIENCE AND SYSTEMS XIV

← 1 2 3 →