Self-Supervised Point Cloud Understanding via Mask Transformer and Contrastive Learning

被引:1
|
作者
Wang, Di [1 ]
Yang, Zhi-Xin [1 ]
机构
[1] Univ Macau, Dept Electromach Engn, State Key Lab Internet Things Smart City, Macau 999078, Peoples R China
关键词
Self-supervision; point cloud understanding; mask Transformer; contrastive learning;
D O I
10.1109/LRA.2022.3224370
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Self-supervised point cloud understanding can pre-train the point cloud learning network on a large dataset, which helps boost the performance of fine-tuning on other smaller datasets in downstream tasks. Motivated to design an efficient self-supervised pre-training strategy and capture useful and discriminative representations of the 3D point cloud, we propose ContrastMPCT, a self-reconstruction scheme with the contrastive learning principle. Specifically, two contrastive loss functions are designed for 3D point clouds to maximize the dependence between the input tokens and output tokens of the encoder and fasten the convergence of the model. Extensive experiments show that our pre-training strategy of ContrastMPCT can effectively improve the fine-tuning performance on the downstream tasks, including object classification and part segmentation. Moreover, compared with both CNN-based and Transformer-based existing works, the superior results indicate the efficacy of the proposed method.
引用
收藏
页码:184 / 191
页数:8
相关论文
共 50 条
  • [1] PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud Videos
    Shen, Zhiqiang
    Sheng, Xiaoxiao
    Wang, Longguang
    Guo, Yulan
    Liu, Qiong
    Zhou, Xi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1212 - 1222
  • [2] Contrastive Predictive Autoencoders for Dynamic Point Cloud Self-Supervised Learning
    Sheng, Xiaoxiao
    Shen, Zhiqiang
    Xiao, Gang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9802 - 9810
  • [3] Self-Supervised Intra-Modal and Cross-Modal Contrastive Learning for Point Cloud Understanding
    Wu, Yue
    Liu, Jiaming
    Gong, Maoguo
    Gong, Peiran
    Fan, Xiaolong
    Qin, A. K.
    Miao, Qiguang
    Ma, Wenping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1626 - 1638
  • [4] Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos
    Sheng, Xiaoxiao
    Shen, Zhiqiang
    Xiao, Gang
    Wang, Longguang
    Guo, Yulan
    Fan, Hehe
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16469 - 16478
  • [5] Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud Learning
    Du, Bi'an
    Gao, Xiang
    Hu, Wei
    Li, Xin
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3133 - 3142
  • [6] Generative Variational-Contrastive Learning for Self-Supervised Point Cloud Representation
    Wang, Bohua
    Tian, Zhiqiang
    Ye, Aixue
    Wen, Feng
    Du, Shaoyi
    Gao, Yue
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6154 - 6166
  • [7] CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
    Afham, Mohamed
    Dissanayake, Isuru
    Dissanayake, Dinithi
    Dharmasiri, Amaya
    Thilakarathna, Kanchana
    Rodrigo, Ranga
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9892 - 9902
  • [8] Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
    Zhao, Yucheng
    Wang, Guangting
    Luo, Chong
    Zeng, Wenjun
    Zha, Zheng-Jun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10140 - 10149
  • [9] Self-supervised Variational Contrastive Learning with Applications to Face Understanding
    Yavuz, Mehmet Can
    Yanikoglu, Berrin
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [10] A Cross Branch Fusion-Based Contrastive Learning Framework for Point Cloud Self-supervised Learning
    Wu, Chengzhi
    Huang, Qianliang
    Jin, Kun
    Pfrommer, Julius
    Beyerer, Juergen
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 528 - 538