DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion

被引:651
作者
Wang, Chen [2 ]
Xu, Danfei [1 ]
Zhu, Yuke [1 ]
Martin-Martin, Roberto [1 ]
Lu, Cewu [2 ]
Li Fei-Fei [1 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai, Peoples R China
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
RECOGNITION; SINGLE;
D O I
10.1109/CVPR.2019.00346
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key technical challenge in performing 6D object pose estimation from RGB-D image is to fully leverage the two complementary data sources. Prior works either extract information from the RGB image and depth separately or use costly post-processing steps, limiting their performances in highly cluttered scenes and real-time applications. In this work, we present DenseFusion, a generic framework for estimating 6D pose of a set of known objects from RGBD images. DenseFusion is a heterogeneous architecture that processes the two data sources individually and uses a novel dense fusion network to extract pixel-wise dense feature embedding, from which the pose is estimated. Furthermore, we integrate an end-to-end iterative pose refinement procedure that further improves the pose estimation while achieving near real-time inference. Our experiments show that our method outperforms state-of-the-art approaches in two datasets, YCB-Video and LineMOD. We also deploy our proposed method to a real robot to grasp and manipulate objects based on the estimated pose. Our code and video are available at https://sites.google.com/view/densefusion/.
引用
收藏
页码:3338 / 3347
页数:10
相关论文
共 44 条
  • [1] [Anonymous], 2017, P IEEE CVF C COMPUTE
  • [2] [Anonymous], 2017, P INT C COMP VIS ICC
  • [3] [Anonymous], 2017, Frustum pointnets for 3d object detection from rgb-d data
  • [4] [Anonymous], 2018, ARXIV180400175
  • [5] [Anonymous], 2018, ARXIV180703146
  • [6] [Anonymous], 2017, ARXIV171106396
  • [7] Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models
    Aubry, Mathieu
    Maturana, Daniel
    Efros, Alexei A.
    Russell, Bryan C.
    Sivic, Josef
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3762 - 3769
  • [8] A METHOD FOR REGISTRATION OF 3-D SHAPES
    BESL, PJ
    MCKAY, ND
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (02) : 239 - 256
  • [9] Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35
  • [10] Rotational Subgroup Voting and Pose Clustering for Robust 3D Object Recognition
    Buch, Anders Glent
    Kiforenko, Lilita
    Kraft, Dirk
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4137 - 4145