Point Clouds are Specialized Images: A Knowledge Transfer Approach for 3D Understanding

被引:0
|
作者
Kang, Jiachen [1 ]
Jia, Wenjing [1 ]
He, Xiangjian [2 ]
Lam, Kin Man [3 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Sydney, NSW 2007, Australia
[2] Univ Nottingham Ningbo, Sch Comp Sci, Ningbo 315100, Peoples R China
[3] Hong Kong Polytech Univ, Dept Elect & Elect Engn, Kowloon, Hong Kong, Peoples R China
关键词
Point cloud compression; Three-dimensional displays; Transformers; Task analysis; Data models; Image coding; Knowledge transfer; Cross-modal learning; point cloud understanding; self-supervision; transfer learning;
D O I
10.1109/TMM.2024.3412330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised representation learning (SSRL) has gained increasing attention in point cloud understanding, in addressing the challenges posed by 3D data scarcity and high annotation costs. This paper presents PCExpert, a novel SSRL approach that reinterprets point clouds as "specialized images". This conceptual shift allows PCExpert to leverage knowledge derived from large-scale image modality in a more direct and deeper manner, via extensively sharing the parameters with a pre-trained image encoder in a multi-way Transformer architecture. The parameter sharing strategy, combined with an additional pretext task for pre-training, i.e., transformation estimation, empowers PCExpert to outperform the state of the arts in a variety of tasks, with a remarkable reduction in the number of trainable parameters. Notably, PCExpert's performance under LINEAR fine-tuning (e.g., yielding a 90.02% overall accuracy on ScanObjectNN) has already closely approximated the results obtained with FULL model fine-tuning (92.66%), demonstrating its effective representation capability.
引用
收藏
页码:10755 / 10765
页数:11
相关论文
共 50 条
  • [11] 3D Vehicle Detection Using Multi-Level Fusion From Point Clouds and Images
    Zhao, Kun
    Ma, Lingfei
    Meng, Yu
    Liu, Li
    Wang, Junbo
    Marcato, Jose, Jr.
    Goncalves, Wesley Nunes
    Li, Jonathan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15146 - 15154
  • [12] ACF-Net: Asymmetric Cascade Fusion for 3D Detection With LiDAR Point Clouds and Images
    Tian, Yonglin
    Zhang, Xianjing
    Wang, Xiao
    Xu, Jintao
    Wang, Jiangong
    Ai, Rui
    Gu, Weihao
    Ding, Weiping
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3360 - 3371
  • [13] Perceptual Quality Assessment of Colored 3D Point Clouds
    Liu, Qi
    Su, Honglei
    Duanmu, Zhengfang
    Liu, Wentao
    Wang, Zhou
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (08) : 3642 - 3655
  • [14] Bitstream-Based Perceptual Quality Assessment of Compressed 3D Point Clouds
    Su, Honglei
    Liu, Qi
    Liu, Yuxin
    Yuan, Hui
    Yang, Huan
    Pan, Zhenkuan
    Wang, Zhou
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1815 - 1828
  • [15] Imperceptible Transfer Attack and Defense on 3D Point Cloud Classification
    Liu, Daizong
    Hu, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4727 - 4746
  • [16] Unsupervised 3D Object Segmentation of Point Clouds by Geometry Consistency
    Song, Ziyang
    Yang, Bo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8459 - 8473
  • [17] Hierarchical Attention Learning of Scene Flow in 3D Point Clouds
    Wang, Guangming
    Wu, Xinrui
    Liu, Zhe
    Wang, Hesheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5168 - 5181
  • [18] Self-Supervised Learning for 3-D Point Clouds Based on a Masked Linear Autoencoder
    Yang, Hongxin
    Wang, Ruisheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 11
  • [19] PERCEPTUAL QUALITY ASSESSMENT OF 3D POINT CLOUDS
    Su, Honglei
    Duanmu, Zhengfang
    Liu, Wentao
    Liu, Qi
    Wang, Zhou
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3182 - 3186
  • [20] A Hybrid Compression Framework for Color Attributes of Static 3D Point Clouds
    Liu, Hao
    Yuan, Hui
    Liu, Qi
    Hou, Junhui
    Zeng, Huanqiang
    Kwong, Sam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1564 - 1577