Point Clouds are Specialized Images: A Knowledge Transfer Approach for 3D Understanding

被引:0
|
作者
Kang, Jiachen [1 ]
Jia, Wenjing [1 ]
He, Xiangjian [2 ]
Lam, Kin Man [3 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Sydney, NSW 2007, Australia
[2] Univ Nottingham Ningbo, Sch Comp Sci, Ningbo 315100, Peoples R China
[3] Hong Kong Polytech Univ, Dept Elect & Elect Engn, Kowloon, Hong Kong, Peoples R China
关键词
Point cloud compression; Three-dimensional displays; Transformers; Task analysis; Data models; Image coding; Knowledge transfer; Cross-modal learning; point cloud understanding; self-supervision; transfer learning;
D O I
10.1109/TMM.2024.3412330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised representation learning (SSRL) has gained increasing attention in point cloud understanding, in addressing the challenges posed by 3D data scarcity and high annotation costs. This paper presents PCExpert, a novel SSRL approach that reinterprets point clouds as "specialized images". This conceptual shift allows PCExpert to leverage knowledge derived from large-scale image modality in a more direct and deeper manner, via extensively sharing the parameters with a pre-trained image encoder in a multi-way Transformer architecture. The parameter sharing strategy, combined with an additional pretext task for pre-training, i.e., transformation estimation, empowers PCExpert to outperform the state of the arts in a variety of tasks, with a remarkable reduction in the number of trainable parameters. Notably, PCExpert's performance under LINEAR fine-tuning (e.g., yielding a 90.02% overall accuracy on ScanObjectNN) has already closely approximated the results obtained with FULL model fine-tuning (92.66%), demonstrating its effective representation capability.
引用
收藏
页码:10755 / 10765
页数:11
相关论文
共 50 条
  • [31] A Lightweight and Detector-Free 3D Single Object Tracker on Point Clouds
    Xia, Yan
    Wu, Qiangqiang
    Li, Wei
    Chan, Antoni B. B.
    Stilla, Uwe
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 5543 - 5554
  • [32] MSL-Net: Sharp Feature Detection Network for 3D Point Clouds
    Jiao, Xianhe
    Lv, Chenlei
    Yi, Ran
    Zhao, Junli
    Pan, Zhenkuan
    Wu, Zhongke
    Liu, Yong-Jin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) : 6433 - 6446
  • [33] LFS-Aware Surface Reconstruction From Unoriented 3D Point Clouds
    Fu, Rao
    Hormann, Kai
    Alliez, Pierre
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 11415 - 11427
  • [34] Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking With Transformer
    Luo, Zhipeng
    Zhou, Changqing
    Pan, Liang
    Zhang, Gongjie
    Liu, Tianrui
    Luo, Yueru
    Zhao, Haiyu
    Liu, Ziwei
    Lu, Shijian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 5921 - 5935
  • [35] Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds
    Lei, Huan
    Akhtar, Naveed
    Mian, Ajmal
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3664 - 3680
  • [36] On Active Labeling 3D Point Clouds via Contrastive Learning
    Yang G.
    Lai W.
    Huang H.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (11): : 1664 - 1673
  • [37] Relation Graph Network for 3D Object Detection in Point Clouds
    Feng, Mingtao
    Gilani, Syed Zulqarnain
    Wang, Yaonan
    Zhang, Liang
    Mian, Ajmal
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 92 - 107
  • [38] OctFormer: Octree-based Transformers for 3D Point Clouds
    Wang, Peng-Shuai
    ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (04):
  • [39] Monitoring Critical Infrastructure Using 3D LiDAR Point Clouds
    Sharifisoraki, Z.
    Dey, A.
    Selzler, R.
    Amini, M.
    Green, J. R.
    Rajan, S.
    Kwamena, F. A.
    IEEE ACCESS, 2023, 11 : 314 - 336
  • [40] Automatic Pairwise Coarse Registration of Terrestrial Point Clouds Using 3D Line Features
    Fu, Yongjian
    Li, Zongchun
    Xiong, Feng
    He, Hua
    Deng, Yong
    Wang, Wenqi
    IEEE ACCESS, 2022, 10 : 115007 - 115024