Leveraging SE(3) Equivariance for Self-Supervised Category-Level Object Pose Estimation

被引：0

作者：

Li, Xiaolong ^{[1
]}

Weng, Yijia ^{[2
]}

Yi, Li ^{[3
]}

Guibas, Leonidas ^{[4
]}

Abbott, A. Lynn ^{[1
]}

Song, Shuran ^{[5
]}

Wang, He ^{[2
]}

机构：

[1] Virginia Tech, Blacksburg, VA USA

[2] Peking Univ, Beijing, Peoples R China

[3] Tsinghua Univ, Beijing, Peoples R China

[4] Stanford Univ, Stanford, CA 94305 USA

[5] Columbia Univ, New York, NY 10027 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Category-level object pose estimation aims to find 6D object poses of previously unseen object instances from known categories without access to object CAD models. To reduce the huge amount of pose annotations needed for category-level learning, we propose for the first time a self-supervised learning framework to estimate category-level 6D object pose from single 3D point clouds. During training, our method assumes no ground-truth pose annotations, no CAD models, and no multi-view supervision. The key to our method is to disentangle shape and pose through an invariant shape reconstruction module and an equivariant pose estimation module, empowered by SE(3) equivariant point cloud networks. The invariant shape reconstruction module learns to perform aligned reconstructions, yielding a category-level reference frame without using any annotations. In addition, the equivariant pose estimation module achieves category-level pose estimation accuracy that is comparable to some fully supervised methods. Extensive experiments demonstrate the effectiveness of our approach on both complete and partial depth point clouds from the ModelNet40 benchmark, and on real depth point clouds from the NOCS-REAL 275 dataset. The project page with code and visualizations can be found at: dragonlong.github.io/equi-pose.

引用

页数：12

共 27 条

[1]

[Anonymous], 2021, P 2021 C EMP METH NA, DOI DOI 10.1109/ITSC48978.2021.9564752

[2]

[Anonymous], 2015, PROC CVPR IEEE

[3]

[Anonymous], 2020, EUR C COMP VIS, DOI DOI 10.1007/978-3-030-20008-410

[4]

[Anonymous], 2017, arXiv preprint arXiv:1711.00199

[5]

Averkiou Melinos, 2016, COMPUTER GRAPHICS FO, P261

[6]

Chang A X, 2015, COMPUTER SCI, V1512, P3

[7] Alignment of 3D models [J].

Chaouch, Mohamed ;

Verroust-Blondet, Anne .

GRAPHICAL MODELS, 2009, 71 (1-6) :63-76

[8] Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation [J].

Chen, Dengsheng ;

Li, Jun ;

Wang, Zheng ;

Xu, Kai .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11970-11979

[9]

Chen Wang, 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA), P10059, DOI 10.1109/ICRA40945.2020.9196679

[10] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

← 1 2 3 →