Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose Estimation

被引:166
作者
Oberweger, Markus [1 ]
Rad, Mahdi [1 ]
Lepetit, Vincent [1 ,2 ]
机构
[1] Graz Univ Technol, Inst Comp Graph & Vis, Graz, Austria
[2] Univ Bordeaux, Lab Bordelais Rech Informat, Bordeaux, France
来源
COMPUTER VISION - ECCV 2018, PT 15 | 2018年 / 11219卷
关键词
3D object pose estimation; Heatmaps; Occlusions;
D O I
10.1007/978-3-030-01267-0_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel method for robust and accurate 3D object pose estimation from a single color image under large occlusions. Following recent approaches, we first predict the 2D projections of 3D points related to the target object and then compute the 3D pose from these correspondences using a geometric method. Unfortunately, as the results of our experiments show, predicting these 2D projections using a regular CNN or a Convolutional Pose Machine is highly sensitive to partial occlusions, even when these methods are trained with partially occluded examples. Our solution is to predict heatmaps from multiple small patches independently and to accumulate the results to obtain accurate and robust predictions. Training subsequently becomes challenging because patches with similar appearances but different positions on the object correspond to different heatmaps. However, we provide a simple yet effective solution to deal with such ambiguities. We show that our approach outperforms existing methods on two challenging datasets: The Occluded LineMOD dataset and the YCB-Video dataset, both exhibiting cluttered scenes with highly occluded objects.
引用
收藏
页码:125 / 141
页数:17
相关论文
共 35 条
[1]  
[Anonymous], 2012, PROC ASIAN C COMPUT
[2]  
[Anonymous], 2000, Multiple View Geometry in Computer Vision
[3]  
[Anonymous], 2015, INT C MACH LEARN
[4]  
[Anonymous], 2014, P BRIT MACH VIS C NO
[5]  
[Anonymous], 2017, P BRIT MACH VIS C
[6]   Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [J].
Brachmann, Eric ;
Michel, Frank ;
Krull, Alexander ;
Yang, Michael Ying ;
Gumhold, Stefan ;
Rother, Carsten .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3364-3372
[7]  
Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35
[8]   Rotational Subgroup Voting and Pose Clustering for Robust 3D Object Recognition [J].
Buch, Anders Glent ;
Kiforenko, Lilita ;
Kraft, Dirk .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4137-4145
[9]   Yale-CMU-Berkeley dataset for robotic manipulation research [J].
Calli, Berk ;
Singh, Arjun ;
Bruce, James ;
Walsman, Aaron ;
Konolige, Kurt ;
Srinivasa, Siddhartha ;
Abbeel, Pieter ;
Dollar, Aaron M. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (03) :261-268
[10]   Robust 3D Object Tracking from Monocular Images Using Stable Parts [J].
Crivellaro, Alberto ;
Rad, Mahdi ;
Verdie, Yannick ;
Yi, Kwang Moo ;
Fua, Pascal ;
Lepetit, Vincent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (06) :1465-1479