OnePose: One-Shot Object Pose Estimation without CAD Models

被引:61
作者
Sun, Jiaming [1 ,2 ]
Wang, Zihao [1 ]
Zhang, Siyu [2 ]
He, Xingyi [1 ]
Zhao, Hongcheng [3 ]
Zhang, Guofeng [1 ]
Zhou, Xiaowei [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] SenseTime Res, Shanghai, Peoples R China
[3] TUM, Munich, Germany
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.00670
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new method named OnePose for object pose estimation. Unlike existing instance-level or category-level methods, OnePose does not rely on CAD models and can handle objects in arbitrary categories without instance-or category-specific network training. OnePose draws the idea from visual localization and only requires a simple RGB video scan of the object to build a sparse SfM model of the object. Men, this model is registered to new query images with a generic feature matching network. To mitigate the slow runtime of existing visual localization methods, we propose a new graph attention network that directly matches 2D interest points in the query image with the 3D points in the SJM model, resulting in efficient and robust pose estimation. Coinbined with a feature-based pose tracker, OnePose is able to stably detect and track 6D poses of everyday household objects in real-time. We also collected a large-scale dataset that consists of 450 sequences of 150 objects. Code and data are available at the project page: https://zju3dv.github.io/onepose/.
引用
收藏
页码:6815 / 6824
页数:10
相关论文
共 43 条
[1]  
Ahmadyan Adel, 2021, CVPR
[2]  
[Anonymous], 2004, International journal of computer vision, V60, P91, DOI [DOI 10.1023/B:VISI.0000029664.99615.94, 10.1023/B:VISI.0000029664.99615.94]
[3]  
[Anonymous], 2020, NeurIPS
[4]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00469
[5]  
[Anonymous], 2018, ECCV, DOI DOI 10.1007/978-3-030-01267-0_8
[6]   Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [J].
Brachmann, Eric ;
Michel, Frank ;
Krull, Alexander ;
Yang, Michael Ying ;
Gumhold, Stefan ;
Rother, Carsten .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3364-3372
[7]  
Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35
[8]  
Cai Ming, 2020, CVPR
[9]  
Chen X, 2020, ECCV
[10]  
Collet Alvaro, 2009, ICRA, P8