Unsupervised 3D Shape Representation Learning Using Normalizing Flow

被引:0
作者
Li, Xiang [1 ]
Wen, Congcong [2 ]
Huang, Hao [2 ]
机构
[1] King Abdullah Univ Sci & Technol, Thuwal, Saudi Arabia
[2] New York Univ Abu Dhabi, Abu Dhabi, U Arab Emirates
来源
COMPUTER VISION - ACCV 2022, PT I | 2023年 / 13841卷
关键词
Shape representation learning; Normalizing flow; Contrastive learning;
D O I
10.1007/978-3-031-26319-4_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning robust and compact shape representation learning plays an important role in many 3D vision tasks. Existing supervised learning-based methods have achieved remarkable performance, meanwhile requiring large-scale human-annotated datasets for model training. Self-supervised/unsupervised methods provide an attractive solution to this issue that can learn shape representations without the need for ground truth labels. In this paper, we introduce a novel self-supervised method for shape representation learning using normalizing flows. Specifically, we build a model upon a variational normalizing flow framework where a sequence of normalizing flow layers are adopted to model exact posterior latent distribution and enhance the representation power of the learned latent code. To further encourage inter-shape separability and intra-shape compactness among a batch of shapes, we design a contrastive-center loss that performs metric learning on features on a hypersphere. We validate the representation learning ability of our model on downstream classification tasks. Experiments on ModelNet40/10, ScanobjectNN, and ScanNet datasets demonstrate the superior performance of our method compared with current state-of-the-art methods.
引用
收藏
页码:158 / 175
页数:18
相关论文
共 50 条
  • [11] Temporal Representation Learning on Monocular Videos for 3D Human Pose Estimation
    Honari, Sina
    Constantin, Victor
    Rhodin, Helge
    Salzmann, Mathieu
    Fua, Pascal
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6415 - 6427
  • [12] Multi-Modal 3D Shape Clustering with Dual Contrastive Learning
    Lin, Guoting
    Zheng, Zexun
    Chen, Lin
    Qin, Tianyi
    Song, Jiahui
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [13] Self-supervised Secondary Landmark Detection via 3D Representation Learning
    Bala, Praneet
    Zimmermann, Jan
    Park, Hyun Soo
    Hayden, Benjamin Y.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 1980 - 1994
  • [14] Self-supervised Secondary Landmark Detection via 3D Representation Learning
    Praneet Bala
    Jan Zimmermann
    Hyun Soo Park
    Benjamin Y. Hayden
    International Journal of Computer Vision, 2023, 131 : 1980 - 1994
  • [15] 3D seismic Fault Detection via Contrastive-Reconstruction Representation Learning
    Dou, Yimin
    Li, Kewen
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [16] Self-Supervised 3D Behavior Representation Learning Based on Homotopic Hyperbolic Embedding
    Chen, Jinghong
    Jin, Zhihao
    Wang, Qicong
    Meng, Hongying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6061 - 6074
  • [17] Self-Supervised 3D Behavior Representation Learning Based on Homotopic Hyperbolic Embedding
    Chen, Jinghong
    Jin, Zhihao
    Wang, Qicong
    Meng, Hongying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6061 - 6074
  • [18] Prompted Contrast with Masked Motion Modeling: Towards Versatile 3D Action Representation Learning
    Zhang, Jiahang
    Lin, Lilang
    Liu, Jiaying
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7175 - 7183
  • [19] Trusted 3D self-supervised representation learning with cross-modal settings
    Han, Xu
    Cheng, Haozhe
    Shi, Pengcheng
    Zhu, Jihua
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [20] Action-conditioned contrastive learning for 3D human pose and shape estimation in videos
    Song, Inpyo
    Ryu, Moonwook
    Lee, Jangwon
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249