TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation

被引：2

作者：

Zhan, Yue ^{[1
,2
]}

Wang, Xin ^{[1
,2
]}

Nie, Lang ^{[1
,2
]}

Zhao, Yang ^{[3
]}

Yang, Tangwen ^{[1
,2
]}

Ruan, Qiuqi ^{[1
,2
]}

机构：

[1] Beijing Jiaotong Univ, Inst Informat Sci, Sch Comp Sci & Technol, Beijing 100044, Peoples R China

[2] Beijing Jiaotong Univ, Sch Comp Sci & Technol, Beijing Key Lab Adv Informat Sci & Network Technol, Beijing 100044, Peoples R China

[3] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong 999077, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Pose estimation; Point cloud compression; Shape; Feature extraction; Solid modeling; Task analysis; Geometry; Category-level 6D object pose estimation; topological data analysis; persistent homology;

D O I：

10.1109/TMM.2024.3398291

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Category-level 6D object pose estimation aims to estimate the pose and size of unseen objects with known categories. Existing methods mainly focus on capturing geometric features to handle shape variations, and are prone to failure in occlusion and noisy environments. In this paper, we propose TG-Pose, a unified pose estimation framework that delves into topology and geometry to deal with the above issues. To exploit topological properties, we first propose a topological feature predictor and a topological label generator to dig into the underlying structural details from encoded features using persistent homology. Then, the topological and geometric features are employed to facilitate the symmetry reconstruction of the original point cloud to obtain a reliable and coherent object shape, which, in turn, guides the pose estimation. For each object category, we construct geometric and topological templates by leveraging inherent intra-class similarities. These templates enhance the reliability of pose estimation and the completeness of object structure through geometric alignment and topological guidance, especially when handling incomplete objects. Moreover, a pose-aware enhancement strategy is designed to enhance the encoder in learning pose-sensitive features and robustness to noisy point clouds. Experimental results show that TG-Pose outperforms the State-of-the-Art solutions on public benchmarks and achieves better generalization in real-world datasets.

引用

页码：9749 / 9762

页数：14

共 59 条

[1]

Adams H, 2017, J MACH LEARN RES, V18

[2]

Ba J, 2014, ACS SYM SER

[3]

Beksi WJ, 2016, IEEE INT CONF ROBOT, P5046, DOI 10.1109/ICRA.2016.7487710

[4] Topology-Aware Surface Reconstruction for Point Clouds [J].

Bruel-Gabrielsson, Rickard ;

Ganapathi-Subramanian, Vignesh ;

Skraba, Primoz ;

Guibas, Leonidas J. .

COMPUTER GRAPHICS FORUM, 2020, 39 (05) :197-207

[5] OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation [J].

Cai, Dingding ;

Heikkia, Janne ;

Rahtu, Esa .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :6793-6803

[6] Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation [J].

Chen, Dengsheng ;

Li, Jun ;

Wang, Zheng ;

Xu, Kai .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11970-11979

[7] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation [J].

Chen, Hansheng ;

Wang, Pichao ;

Wang, Fan ;

Tian, Wei ;

Xiong, Lu ;

Li, Hao .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :2771-2780

[8] SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation [J].

Chen, Kai ;

Dou, Qi .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :2753-2762

[9] FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism [J].

Chen, Wei ;

Jia, Xi ;

Chang, Hyung Jin ;

Duan, Jinming ;

Shen, Linlin ;

Leonardis, Ales .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1581-1590

[10] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

← 1 2 3 4 5 6 →