SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

被引：84

作者：

Di, Yan ^{[1
]}

Manhardt, Fabian ^{[2
]}

Wang, Gu ^{[3
]}

Ji, Xiangyang ^{[3
]}

Navab, Nassir ^{[1
]}

Tombari, Federico ^{[1
,2
]}

机构：

[1] Tech Univ Munich, Munich, Germany

[2] Google, Mountain View, CA 94043 USA

[3] Tsinghua Univ, Beijing, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

关键词：

D O I：

10.1109/ICCV48922.2021.01217

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Directly regressing all 6 degrees-of-freedom (6DoF) for the object pose (i.e. the 3D rotation and translation) in a cluttered environment from a single RGB image is a challenging problem. While end-to-end methods have recently demonstrated promising results at high efficiency, they are still inferior when compared with elaborate PnP/RANSAC-based approaches in terms of pose accuracy. In this work, we address this shortcoming by means of a novel reasoning about self-occlusion, in order to establish a two-layer representation for 3D objects which considerably enhances the accuracy of end-to-end 6D pose estimation. Our framework, named SO-Pose, takes a single RGB image as input and respectively generates 2D-3D correspondences as well as self-occlusion information harnessing a shared encoder and two separate decoders. Both outputs are then fused to directly regress the 6DoF pose parameters. Incorporating cross-layer consistencies that align correspondences, self-occlusion and 6D pose, we can further improve accuracy and robustness, surpassing or rivaling all other state-of-the-art approaches on various challenging datasets.

引用

页码：12376 / 12385

页数：10

共 49 条

[1]

[Anonymous], 2018, ECCV, DOI DOI 10.1007/978-3-030-01264-9_49

[2]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00469

[3]

[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00375

[4]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00322

[5]

Azad P, 2007, 2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, P925

[6] End-to-End Learnable Geometric Vision by Backpropagating PnP Optimization [J].

Chen, Bo ;

Parra, Alvaro ;

Cao, Jiewei ;

Li, Nan ;

Chin, Tat-Jun .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8097-8106

[7] Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [J].

Brachmann, Eric ;

Michel, Frank ;

Krull, Alexander ;

Yang, Michael Ying ;

Gumhold, Stefan ;

Rother, Carsten .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3364-3372

[8]

Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35

[9]

Girshick R., 2017, P IEEE C COMP VIS PA, DOI [DOI 10.1109/CVPR.2017.106, 10.1109/CVPR.2017.106]

[10] 3D Pose Estimation and 3D Model Retrieval for Objects in the Wild [J].

Grabner, Alexander ;

Roth, Peter M. ;

Lepetit, Vincent .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3022-3031

← 1 2 3 4 5 →