Point Clouds are Specialized Images: A Knowledge Transfer Approach for 3D Understanding

被引:0
|
作者
Kang, Jiachen [1 ]
Jia, Wenjing [1 ]
He, Xiangjian [2 ]
Lam, Kin Man [3 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Sydney, NSW 2007, Australia
[2] Univ Nottingham Ningbo, Sch Comp Sci, Ningbo 315100, Peoples R China
[3] Hong Kong Polytech Univ, Dept Elect & Elect Engn, Kowloon, Hong Kong, Peoples R China
关键词
Point cloud compression; Three-dimensional displays; Transformers; Task analysis; Data models; Image coding; Knowledge transfer; Cross-modal learning; point cloud understanding; self-supervision; transfer learning;
D O I
10.1109/TMM.2024.3412330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised representation learning (SSRL) has gained increasing attention in point cloud understanding, in addressing the challenges posed by 3D data scarcity and high annotation costs. This paper presents PCExpert, a novel SSRL approach that reinterprets point clouds as "specialized images". This conceptual shift allows PCExpert to leverage knowledge derived from large-scale image modality in a more direct and deeper manner, via extensively sharing the parameters with a pre-trained image encoder in a multi-way Transformer architecture. The parameter sharing strategy, combined with an additional pretext task for pre-training, i.e., transformation estimation, empowers PCExpert to outperform the state of the arts in a variety of tasks, with a remarkable reduction in the number of trainable parameters. Notably, PCExpert's performance under LINEAR fine-tuning (e.g., yielding a 90.02% overall accuracy on ScanObjectNN) has already closely approximated the results obtained with FULL model fine-tuning (92.66%), demonstrating its effective representation capability.
引用
收藏
页码:10755 / 10765
页数:11
相关论文
共 50 条
  • [1] Transformer for 3D Point Clouds
    Wang, Jiayun
    Chakraborty, Rudrasis
    Yu, Stella X.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4419 - 4431
  • [2] 3D Scene Graph Generation From Point Clouds
    Wei, Wenwen
    Wei, Ping
    Qin, Jialu
    Liao, Zhimin
    Wang, Shuaijie
    Cheng, Xiang
    Liu, Meiqin
    Zheng, Nanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5358 - 5368
  • [3] Learning a Task-Specific Descriptor for Robust Matching of 3D Point Clouds
    Zhang, Zhiyuan
    Dai, Yuchao
    Fan, Bin
    Sun, Jiadai
    He, Mingyi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8462 - 8475
  • [4] Accelerated Lloyd's Method for Resampling 3D Point Clouds
    Xiao, Yanyang
    Zhang, Tieyi
    Cao, Juan
    Chen, Zhonggui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1033 - 1046
  • [5] Unsupervised Domain Adaptation for 3D Point Clouds by Searched Transformations
    Kang, Dongmin
    Nam, Yeongwoo
    Kyung, Daeun
    Choi, Jonghyun
    IEEE ACCESS, 2022, 10 : 56901 - 56913
  • [6] Intrinsic and Isotropic Resampling for 3D Point Clouds
    Lv, Chenlei
    Lin, Weisi
    Zhao, Baoquan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3274 - 3291
  • [7] Structural Relation Modeling of 3D Point Clouds
    Zheng, Yu
    Lu, Jiwen
    Duan, Yueqi
    Zhou, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4867 - 4881
  • [8] Deep Learning for 3D Point Clouds: A Survey
    Guo, Yulan
    Wang, Hanyun
    Hu, Qingyong
    Liu, Hao
    Liu, Li
    Bennamoun, Mohammed
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) : 4338 - 4364
  • [9] Interdimensional Knowledge Transfer for Semantic Segmentation on LiDAR Point Clouds
    Ha, Seongheon
    Kim, Yeogyeong
    Park, Jinsun
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7501 - 7508
  • [10] 3D Cascade RCNN: High Quality Object Detection in Point Clouds
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Mei, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5706 - 5719