Weakly Supervised 6D Pose Estimation for Robotic Grasping

被引：2

作者：

Li, Yaoxin ^{[1
]}

Sun, Jinghua ^{[2
]}

Li, Xiaoqian ^{[2
]}

Zhang, Zhanpeng ^{[1
]}

Cheng, Hui ^{[3
]}

Wang, Xiaogang ^{[4
]}

机构：

[1] Sensetime Grp Ltd, Hong Kong, Peoples R China

[2] Tsinghua Univ, Beijing, Peoples R China

[3] Sun Yat Sen Univ, Guangzhou, Guangdong, Peoples R China

[4] Chinese Univ Hong Kong, Hong Kong, Peoples R China

来源：

PROCEEDINGS OF THE 16TH ACM SIGGRAPH INTERNATIONAL CONFERENCE ON VIRTUAL-REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY (VRCAI 2018) | 2018年

关键词：

weak supervision; pose estimation; robotic grasping;

D O I：

10.1145/3284398.3284408

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Learning based robotic grasping methods achieve substantial progress with the development of the deep neural networks. However, the requirement of large-scale training data in the real world limits the application scopes of these methods. Given the 3D models of the target objects, we propose a new learning-based grasping approach built on 6D object poses estimation from a monocular RGB image. We aim to leverage both a large-scale synthesized 6D object pose dataset and a small scale of the real-world weakly labeled dataset (e.g., mark the number of objects in the image), to reduce the system deployment difficulty. In particular, the deep network combines the 6D pose estimation task and an auxiliary task of weak labels to perform knowledge transfer between the synthesized and real world data. We demonstrate the effectiveness of the method in a real robotic environment and show substantial improvements in the successful grasping rate (about 11.9% on average) to the proposed knowledge transfer scheme.

引用

页数：8

共 33 条

[1]

Bo L., 2013, EXPT ROBOTICS, P387, DOI DOI 10.1007/978-3-319-00065-7

[2]

Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35

[3] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[4]

Finn C, 2016, IEEE INT CONF ROBOT, P512, DOI 10.1109/ICRA.2016.7487173

[5] Object detection via a multi-region & semantic segmentation-aware CNN model [J].

Gidaris, Spyros ;

Komodakis, Nikos .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1134-1142

[6]

Hinterstoisser S., 2012, P AS C COMP VIS, P548, DOI DOI 10.1007/978-3-642-37331-2_42

[7] Gradient Response Maps for Real-Time Detection of Textureless Objects [J].

Hinterstoisser, Stefan ;

Cagniart, Cedric ;

Ilic, Slobodan ;

Sturm, Peter ;

Navab, Nassir ;

Fua, Pascal ;

Lepetit, Vincent .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (05) :876-888

[8]

Hinterstoisser S, 2011, IEEE I CONF COMP VIS, P858, DOI 10.1109/ICCV.2011.6126326

[9] Dominant Orientation Templates for Real-Time Detection of Texture-Less Objects [J].

Hinterstoisser, Stefan ;

Lepetit, Vincent ;

Ilic, Slobodan ;

Fua, Pascal ;

Navab, Nassir .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :2257-2264

[10] T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects [J].

Hodan, Tomas ;

Haluza, Pavel ;

Obdrzalek, Stepan ;

Matas, Jiri ;

Lourakis, Manolis ;

Zabulis, Xenophon .

2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :880-888

← 1 2 3 4 →