SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

被引:84
作者
Di, Yan [1 ]
Manhardt, Fabian [2 ]
Wang, Gu [3 ]
Ji, Xiangyang [3 ]
Navab, Nassir [1 ]
Tombari, Federico [1 ,2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Google, Mountain View, CA 94043 USA
[3] Tsinghua Univ, Beijing, Peoples R China
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.01217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Directly regressing all 6 degrees-of-freedom (6DoF) for the object pose (i.e. the 3D rotation and translation) in a cluttered environment from a single RGB image is a challenging problem. While end-to-end methods have recently demonstrated promising results at high efficiency, they are still inferior when compared with elaborate PnP/RANSAC-based approaches in terms of pose accuracy. In this work, we address this shortcoming by means of a novel reasoning about self-occlusion, in order to establish a two-layer representation for 3D objects which considerably enhances the accuracy of end-to-end 6D pose estimation. Our framework, named SO-Pose, takes a single RGB image as input and respectively generates 2D-3D correspondences as well as self-occlusion information harnessing a shared encoder and two separate decoders. Both outputs are then fused to directly regress the 6DoF pose parameters. Incorporating cross-layer consistencies that align correspondences, self-occlusion and 6D pose, we can further improve accuracy and robustness, surpassing or rivaling all other state-of-the-art approaches on various challenging datasets.
引用
收藏
页码:12376 / 12385
页数:10
相关论文
共 49 条
[1]  
[Anonymous], 2018, ECCV, DOI DOI 10.1007/978-3-030-01264-9_49
[2]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00469
[3]  
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00375
[4]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00322
[5]  
Azad P, 2007, 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, P925
[6]   End-to-End Learnable Geometric Vision by Backpropagating PnP Optimization [J].
Chen, Bo ;
Parra, Alvaro ;
Cao, Jiewei ;
Li, Nan ;
Chin, Tat-Jun .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8097-8106
[7]   Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [J].
Brachmann, Eric ;
Michel, Frank ;
Krull, Alexander ;
Yang, Michael Ying ;
Gumhold, Stefan ;
Rother, Carsten .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3364-3372
[8]  
Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35
[9]  
Girshick R., 2017, P IEEE C COMP VIS PA, DOI [DOI 10.1109/CVPR.2017.106, 10.1109/CVPR.2017.106]
[10]   3D Pose Estimation and 3D Model Retrieval for Objects in the Wild [J].
Grabner, Alexander ;
Roth, Peter M. ;
Lepetit, Vincent .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3022-3031