Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation

被引:0
作者
Fang, Kuan [1 ]
Zhu, Yuke [1 ,2 ]
Garg, Animesh [2 ,3 ,4 ]
Savarese, Silvio [1 ]
Li Fei-Fei [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Nvidia, Santa Clara, CA USA
[3] Univ Toronto, Toronto, ON, Canada
[4] Vector Inst, Toronto, ON, Canada
来源
CONFERENCE ON ROBOT LEARNING, VOL 100 | 2019年 / 100卷
关键词
dynamics modeling; latent-space planning; variational inference;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The fundamental challenge of planning for multi-step manipulation is to find effective and plausible action sequences that lead to the task goal. We present Cascaded Variational Inference Planner (CAVIN), a model-based method that hierarchically generates plans by sampling from latent spaces. To facilitate planning over long time horizons, our method learns latent representations that decouple the prediction of high-level effects from the generation of low-level motions through cascaded variational inference. This enables us to model dynamics at two different levels of temporal resolutions for hierarchical planning. We evaluate our approach in three multi-step robotic manipulation tasks in cluttered tabletop environments given raw visual observations. Empirical results demonstrate that the proposed method outperforms state-of-the-art model-based approaches by strategically planning for interactions with multiple objects. See more details at pair.stanford.edu/cavin
引用
收藏
页数:11
相关论文
共 30 条
  • [1] Agrawal P., 2016, Advances in neural information processing systems, P5074, DOI DOI 10.48550/ARXIV.1606.07419
  • [2] Andreas J, 2017, PR MACH LEARN RES, V70
  • [3] Neural Module Networks
    Andreas, Jacob
    Rohrbach, Marcus
    Darrell, Trevor
    Klein, Dan
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 39 - 48
  • [4] Calli B, 2015, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), P510, DOI 10.1109/ICAR.2015.7251504
  • [5] Camacho E.F., 2007, ADV TK CONT SIGN PRO, DOI 10.1007/978-0-85729-398-5
  • [6] Chandak Y, 2019, Arxiv, DOI arXiv:1902.00183
  • [7] Coumans E, 2016, PYBULLET PYTHON MODU
  • [8] Co-Reyes JD, 2018, Arxiv, DOI arXiv:1806.02813
  • [9] Danielczuk M., 2019, arXiv
  • [10] Fang K, 2018, ROBOTICS: SCIENCE AND SYSTEMS XIV