Unsupervised 3D Shape Representation Learning Using Normalizing Flow

被引：0

作者：

Li, Xiang ^{[1
]}

Wen, Congcong ^{[2
]}

Huang, Hao ^{[2
]}

机构：

[1] King Abdullah Univ Sci & Technol, Thuwal, Saudi Arabia

[2] New York Univ Abu Dhabi, Abu Dhabi, U Arab Emirates

来源：

COMPUTER VISION - ACCV 2022, PT I | 2023年 / 13841卷

关键词：

Shape representation learning; Normalizing flow; Contrastive learning;

D O I：

10.1007/978-3-031-26319-4_10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning robust and compact shape representation learning plays an important role in many 3D vision tasks. Existing supervised learning-based methods have achieved remarkable performance, meanwhile requiring large-scale human-annotated datasets for model training. Self-supervised/unsupervised methods provide an attractive solution to this issue that can learn shape representations without the need for ground truth labels. In this paper, we introduce a novel self-supervised method for shape representation learning using normalizing flows. Specifically, we build a model upon a variational normalizing flow framework where a sequence of normalizing flow layers are adopted to model exact posterior latent distribution and enhance the representation power of the learned latent code. To further encourage inter-shape separability and intra-shape compactness among a batch of shapes, we design a contrastive-center loss that performs metric learning on features on a hypersphere. We validate the representation learning ability of our model on downstream classification tasks. Experiments on ModelNet40/10, ScanobjectNN, and ScanNet datasets demonstrate the superior performance of our method compared with current state-of-the-art methods.

引用

页码：158 / 175

页数：18

共 50 条

[11] Temporal Representation Learning on Monocular Videos for 3D Human Pose Estimation
Honari, Sina
Constantin, Victor
Rhodin, Helge
Salzmann, Mathieu
Fua, Pascal
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6415 - 6427
[12] Multi-Modal 3D Shape Clustering with Dual Contrastive Learning
Lin, Guoting
Zheng, Zexun
Chen, Lin
Qin, Tianyi
Song, Jiahui
APPLIED SCIENCES-BASEL, 2022, 12 (15):
[13] Self-supervised Secondary Landmark Detection via 3D Representation Learning
Bala, Praneet
Zimmermann, Jan
Park, Hyun Soo
Hayden, Benjamin Y.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 1980 - 1994
[14] Self-supervised Secondary Landmark Detection via 3D Representation Learning
Praneet Bala
Jan Zimmermann
Hyun Soo Park
Benjamin Y. Hayden
International Journal of Computer Vision, 2023, 131 : 1980 - 1994
[15] 3D seismic Fault Detection via Contrastive-Reconstruction Representation Learning
Dou, Yimin
Li, Kewen
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
[16] Self-Supervised 3D Behavior Representation Learning Based on Homotopic Hyperbolic Embedding
Chen, Jinghong
Jin, Zhihao
Wang, Qicong
Meng, Hongying
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6061 - 6074
[17] Self-Supervised 3D Behavior Representation Learning Based on Homotopic Hyperbolic Embedding
Chen, Jinghong
Jin, Zhihao
Wang, Qicong
Meng, Hongying
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6061 - 6074
[18] Prompted Contrast with Masked Motion Modeling: Towards Versatile 3D Action Representation Learning
Zhang, Jiahang
Lin, Lilang
Liu, Jiaying
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7175 - 7183
[19] Trusted 3D self-supervised representation learning with cross-modal settings
Han, Xu
Cheng, Haozhe
Shi, Pengcheng
Zhu, Jihua
MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
[20] Action-conditioned contrastive learning for 3D human pose and shape estimation in videos
Song, Inpyo
Ryu, Moonwook
Lee, Jangwon
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249

← 1 2 3 4 5 →