Weakly Supervised 6D Pose Estimation for Robotic Grasping

被引:2
作者
Li, Yaoxin [1 ]
Sun, Jinghua [2 ]
Li, Xiaoqian [2 ]
Zhang, Zhanpeng [1 ]
Cheng, Hui [3 ]
Wang, Xiaogang [4 ]
机构
[1] Sensetime Grp Ltd, Hong Kong, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
[3] Sun Yat Sen Univ, Guangzhou, Guangdong, Peoples R China
[4] Chinese Univ Hong Kong, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE 16TH ACM SIGGRAPH INTERNATIONAL CONFERENCE ON VIRTUAL-REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY (VRCAI 2018) | 2018年
关键词
weak supervision; pose estimation; robotic grasping;
D O I
10.1145/3284398.3284408
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Learning based robotic grasping methods achieve substantial progress with the development of the deep neural networks. However, the requirement of large-scale training data in the real world limits the application scopes of these methods. Given the 3D models of the target objects, we propose a new learning-based grasping approach built on 6D object poses estimation from a monocular RGB image. We aim to leverage both a large-scale synthesized 6D object pose dataset and a small scale of the real-world weakly labeled dataset (e.g., mark the number of objects in the image), to reduce the system deployment difficulty. In particular, the deep network combines the 6D pose estimation task and an auxiliary task of weak labels to perform knowledge transfer between the synthesized and real world data. We demonstrate the effectiveness of the method in a real robotic environment and show substantial improvements in the successful grasping rate (about 11.9% on average) to the proposed knowledge transfer scheme.
引用
收藏
页数:8
相关论文
共 33 条
  • [1] Bo L., 2013, EXPT ROBOTICS, P387, DOI DOI 10.1007/978-3-319-00065-7
  • [2] Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35
  • [3] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [4] Finn C, 2016, IEEE INT CONF ROBOT, P512, DOI 10.1109/ICRA.2016.7487173
  • [5] Object detection via a multi-region & semantic segmentation-aware CNN model
    Gidaris, Spyros
    Komodakis, Nikos
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1134 - 1142
  • [6] Hinterstoisser S., 2012, P AS C COMP VIS, P548, DOI DOI 10.1007/978-3-642-37331-2_42
  • [7] Gradient Response Maps for Real-Time Detection of Textureless Objects
    Hinterstoisser, Stefan
    Cagniart, Cedric
    Ilic, Slobodan
    Sturm, Peter
    Navab, Nassir
    Fua, Pascal
    Lepetit, Vincent
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (05) : 876 - 888
  • [8] Hinterstoisser S, 2011, IEEE I CONF COMP VIS, P858, DOI 10.1109/ICCV.2011.6126326
  • [9] Dominant Orientation Templates for Real-Time Detection of Texture-Less Objects
    Hinterstoisser, Stefan
    Lepetit, Vincent
    Ilic, Slobodan
    Fua, Pascal
    Navab, Nassir
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2257 - 2264
  • [10] T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects
    Hodan, Tomas
    Haluza, Pavel
    Obdrzalek, Stepan
    Matas, Jiri
    Lourakis, Manolis
    Zabulis, Xenophon
    [J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 880 - 888