Reconstruct Locally, Localize Globally: A Model Free Method for Object Pose Estimation

被引:12
作者
Cai, Ming [1 ]
Reid, Ian [1 ]
机构
[1] Univ Adelaide, Adelaide, SA, Australia
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年
基金
澳大利亚研究理事会;
关键词
D O I
10.1109/CVPR42600.2020.00322
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Six degree-of-freedom pose estimation of a known object in a single image is a long-standing computer vision objective. It is classically posed as a correspondence problem between a known geometric model, such as a CAD model, and image locations. If a CAD model is not available, it is possible to use multi-view visual reconstruction methods to create a geometric model, and use this in the same manner. Instead, we propose a learning-based method whose input is a collection of images of a target object, and whose output is the pose of the object in a novel view At inference time, our method maps from the RoI features of the input image to a dense collection of object-centric 3D coordinates, one per pixel. This dense 2D-3D mapping is then used to determine 6dof pose using standard PnP plus RANSAC. The model that maps 2D to object 3D coordinates is established at training time by automatically discovering and matching image landmarks that are consistent across multiple views. We show that this method eliminates the requirement for a 3D CAD model (needed by classical geometry-based methods and state-of-the-art learning based methods alike) but still achieves performance on a par with the prior art.
引用
收藏
页码:3150 / 3160
页数:11
相关论文
共 57 条
[31]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[32]  
Marchand E., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P262, DOI 10.1109/ICCV.1999.791229
[33]   MOPED: A Scalable and Low Latency Object Recognition and Pose Estimation System [J].
Martinez, Manuel ;
Collet, Alvaro ;
Srinivasa, Siddhartha S. .
2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, :2043-2049
[34]   ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras [J].
Mur-Artal, Raul ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2017, 33 (05) :1255-1262
[35]   Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose Estimation [J].
Oberweger, Markus ;
Rad, Mahdi ;
Lepetit, Vincent .
COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 :125-141
[36]  
Pan Qi., 2009, BMVC, V2, P6
[37]  
Park Kiru, 2019, IEEE INT C COMP VIS
[38]   Real-time Model-based Rigid Object Pose Estimation and Tracking Combining Dense and Sparse Visual Cues [J].
Pauwels, Karl ;
Rubio, Leonardo ;
Diaz, Javier ;
Ros, Eduardo .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2347-2354
[39]   PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation [J].
Peng, Sida ;
Liu, Yuan ;
Huang, Qixing ;
Zhou, Xiaowei ;
Bao, Hujun .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4556-4565
[40]  
Qi Pan, 2010, 2010 33rd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), P252