OSOP: A Multi-Stage One Shot Object Pose Estimation Framework

被引:47
作者
Shugurov, Ivan [1 ,3 ]
Li, Fu [1 ,2 ]
Busam, Benjamin [1 ]
Ilic, Slobodan [1 ,3 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] NUDT, Changsha, Peoples R China
[3] Siemens AG, Munich, Germany
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.00671
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel one-shot method for object detection and 6 DoF pose estimation, that does not require training on target objects. At test time, it takes as input a target image and a textured 3D query model. The core idea is to represent a 3D model with a number of 2D templates rendered from different viewpoints. This enables CNN-based direct dense feature extraction and matching. The object is first localized in 2D, then its approximate viewpoint is estimated, followed by dense 2D-3D correspondence prediction. The final pose is computed with PnP. We evaluate the method on LineMOD, Occlusion, Homebrewed, YCB-V and TLESS datasets and report very competitive performance in comparison to the state-of-the-art methods trained on synthetic data, even though our method is not trained on the object models used for testing.
引用
收藏
页码:6825 / 6834
页数:10
相关论文
共 59 条
[41]  
Redmon J, 2018, Arxiv, DOI arXiv:1804.02767
[42]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[43]  
Rocco I., 2018, TPAMI, P3
[44]   End-to-end weakly-supervised semantic alignment [J].
Rocco, Ignacio ;
Arandjelovic, Relja ;
Sivic, Josef .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6917-6925
[45]   Convolutional neural network architecture for geometric matching [J].
Rocco, Ignacio ;
Arandjelovic, Relja ;
Sivic, Josef .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :39-48
[46]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241
[47]  
Shugurov I., 2021, IEEE ROBOTICS AUTOMA, V1
[48]  
Shugurov Ivan, 2021, IEEE ROBOTICS AUTOMA, V2
[49]  
Shugurov Ivan, 2021, TPAMI, V1, P2
[50]  
Song Chen, 2020, ICCV