Self-supervised learning of rotation-invariant 3D point set features using transformer and its self-distillation

被引:0
|
作者
Furuya, Takahiko [1 ]
Chen, Zhoujie [1 ,2 ]
Ohbuchi, Ryutarou [1 ]
Kuang, Zhenzhong [2 ]
机构
[1] Univ Yamanashi, Dept Comp Sci & Engn, 4-3-11 Takeda, Kofu, Yamanashi 4008511, Japan
[2] Hangzhou Dianzi Univ, Sch Comp Sci, Hangzhou 310000, Peoples R China
基金
日本学术振兴会;
关键词
Deep learning; Self-supervised learning; 3D point set; Feature representation; Rotation invariance;
D O I
10.1016/j.cviu.2024.104025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Invariance against rotations of 3D objects is an important property in analyzing 3D point set data. Conventional 3D point set DNNs having rotation invariance typically obtain accurate 3D shape features via supervised learning by using labeled 3D point sets as training samples. However, due to the rapid increase in 3D point set data and the high cost of labeling, a framework to learn rotation -invariant 3D shape features from numerous unlabeled 3D point sets is required. This paper proposes a novel self -supervised learning framework for acquiring accurate and rotation -invariant 3D point set features at object -level. Our proposed lightweight DNN architecture decomposes an input 3D point set into multiple global -scale regions, called tokens, that preserve the spatial layout of partial shapes composing the 3D object. We employ a self -attention mechanism to refine the tokens and aggregate them into an expressive rotation -invariant feature per 3D point set. Our DNN is effectively trained by using pseudo -labels generated by a self -distillation framework. To facilitate the learning of accurate features, we propose to combine multi -crop and cut -mix data augmentation techniques to diversify 3D point sets for training. Through a comprehensive evaluation, we empirically demonstrate that, (1) existing rotation -invariant DNN architectures designed for supervised learning do not necessarily learn accurate 3D shape features under a self -supervised learning scenario, and (2) our proposed algorithm learns rotation -invariant 3D point set features that are more accurate than those learned by existing algorithms.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Self-Supervised 3D Representation Learning of Dressed Humans From Social Media Videos
    Jafarian, Yasamin
    Park, Hyun Soo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8969 - 8983
  • [42] Self-supervised 3D human pose estimation from video
    Gholami, Mohsen
    Rezaei, Ahmad
    Rhodin, Helge
    Ward, Rabab
    Wang, Z. Jane
    NEUROCOMPUTING, 2022, 488 : 97 - 106
  • [43] Ssman: self-supervised masked adaptive network for 3D human pose estimation
    Yu Shi
    Tianyi Yue
    Hu Zhao
    Guoping He
    Keyan Ren
    Machine Vision and Applications, 2024, 35
  • [44] Hybrid Supervised and Self-Supervised Learning for 3D Printing Optimization: A Masked Supervised Bootstrap Your Own Latent Approach
    Nguyen, Phuong Dong
    Dao, Manh Binh
    Nguyen, Thanh Q.
    3D PRINTING AND ADDITIVE MANUFACTURING, 2025,
  • [45] Ssman: self-supervised masked adaptive network for 3D human pose estimation
    Shi, Yu
    Yue, Tianyi
    Zhao, Hu
    He, Guoping
    Ren, Keyan
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [46] Self-supervised Image-based 3D Model Retrieval
    Song, Dan
    Zhang, Chu-Meng
    Zhao, Xiao-Qian
    Wang, Teng
    Nie, Wei-Zhi
    Li, Xuan-Ya
    Liu, An-An
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [47] 3D SCENEFLOWNET: SELF-SUPERVISED 3D SCENE FLOWESTIMATION BASED ON GRAPH CNN
    Lu, Yawen
    Zhu, Yuhao
    Lu, Guoyu
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3647 - 3651
  • [48] Integration of Patch Features Through Self-supervised Learning and Transformer for Survival Analysis on Whole Slide Images
    Huang, Ziwang
    Chai, Hua
    Wang, Ruoqi
    Wang, Haitao
    Yang, Yuedong
    Wu, Hejun
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 561 - 570
  • [49] S3L: Spectrum Transformer for Self-Supervised Learning in Hyperspectral Image Classification
    Guo, Hufeng
    Liu, Wenyi
    REMOTE SENSING, 2024, 16 (06)
  • [50] SELF-SUPERVISED LEARNING GUIDED TRANSFORMER FOR SURVIVAL PREDICTION OF LUNG CANCER USING PATHOLOGICAL IMAGES
    Zhao, Lu
    Hou, Runping
    Zhao, Wangyuan
    Qiu, Lu
    Teng, Haohua
    Han, Yuchen
    Fu, Xiaolong
    Zhao, Jun
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,