Self-supervised learning of rotation-invariant 3D point set features using transformer and its self-distillation

被引:0
|
作者
Furuya, Takahiko [1 ]
Chen, Zhoujie [1 ,2 ]
Ohbuchi, Ryutarou [1 ]
Kuang, Zhenzhong [2 ]
机构
[1] Univ Yamanashi, Dept Comp Sci & Engn, 4-3-11 Takeda, Kofu, Yamanashi 4008511, Japan
[2] Hangzhou Dianzi Univ, Sch Comp Sci, Hangzhou 310000, Peoples R China
基金
日本学术振兴会;
关键词
Deep learning; Self-supervised learning; 3D point set; Feature representation; Rotation invariance;
D O I
10.1016/j.cviu.2024.104025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Invariance against rotations of 3D objects is an important property in analyzing 3D point set data. Conventional 3D point set DNNs having rotation invariance typically obtain accurate 3D shape features via supervised learning by using labeled 3D point sets as training samples. However, due to the rapid increase in 3D point set data and the high cost of labeling, a framework to learn rotation -invariant 3D shape features from numerous unlabeled 3D point sets is required. This paper proposes a novel self -supervised learning framework for acquiring accurate and rotation -invariant 3D point set features at object -level. Our proposed lightweight DNN architecture decomposes an input 3D point set into multiple global -scale regions, called tokens, that preserve the spatial layout of partial shapes composing the 3D object. We employ a self -attention mechanism to refine the tokens and aggregate them into an expressive rotation -invariant feature per 3D point set. Our DNN is effectively trained by using pseudo -labels generated by a self -distillation framework. To facilitate the learning of accurate features, we propose to combine multi -crop and cut -mix data augmentation techniques to diversify 3D point sets for training. Through a comprehensive evaluation, we empirically demonstrate that, (1) existing rotation -invariant DNN architectures designed for supervised learning do not necessarily learn accurate 3D shape features under a self -supervised learning scenario, and (2) our proposed algorithm learns rotation -invariant 3D point set features that are more accurate than those learned by existing algorithms.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Explainable rotation-invariant self-supervised representation learning ☆
    Singh, Devansh
    Marathe, Aboli
    Roy, Sidharth
    Walambe, Rahee
    Kotecha, Ketan
    METHODSX, 2024, 13
  • [2] MaskLRF: Self-Supervised Pretraining via Masked Autoencoding of Local Reference Frames for Rotation-Invariant 3D Point Set Analysis
    Furuya, Takahiko
    IEEE ACCESS, 2024, 12 : 73340 - 73353
  • [3] Rotation-Invariant Analysis of 3D Point Set Using Self-Attention
    Karikomi, Yoshihiro
    Furuya, Takahiko
    Ohbuchi, Ryutarou
    Journal of the Institute of Image Electronics Engineers of Japan, 2023, 52 (04) : 516 - 526
  • [4] DeSD: Self-Supervised Learning with Deep Self-Distillation for 3D Medical Image Segmentation
    Ye, Yiwen
    Zhang, Jianpeng
    Chen, Ziyang
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 545 - 555
  • [5] Self-supervised learning with self-distillation on COVID-19 medical image classification
    Tan, Zhiyong
    Yu, Yuhai
    Meng, Jiana
    Liu, Shuang
    Li, Wei
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
  • [6] Exploring Self-Supervised Learning for 3D Point Cloud Registration
    Yuan, Mingzhi
    Huang, Qiao
    Shen, Ao
    Huang, Xiaoshui
    Wang, Manning
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 25 - 31
  • [7] 3D Human Pose Machines with Self-Supervised Learning
    Wang, Keze
    Lin, Liang
    Jiang, Chenhan
    Qian, Chen
    Wei, Pengxu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1069 - 1082
  • [8] PatchMixing Masked Autoencoders for 3D Point Cloud Self-Supervised Learning
    Lin, Chengxing
    Xu, Wenju
    Zhu, Jian
    Nie, Yongwei
    Cai, Ruichu
    Xu, Xuemiao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9882 - 9897
  • [9] Self-supervised Adversarial Masking for 3D Point Cloud Representation Learning
    Szachniewicz, Michal
    Kozlowski, Wojciech
    Stypulkowski, Michal
    Zieba, Maciej
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, ACIIDS 2024, 2024, 14796 : 156 - 168
  • [10] Self-supervised 3D Anatomy Segmentation Using Self-distilled Masked Image Transformer (SMIT)
    Jiang, Jue
    Tyagi, Neelam
    Tringale, Kathryn
    Crane, Christopher
    Veeraraghavan, Harini
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 556 - 566