Curriculumformer: Taming Curriculum Pre-Training for Enhanced 3-D Point Cloud Understanding

被引:0
|
作者
Fei, Ben [1 ]
Luo, Tianyue [1 ]
Yang, Weidong [1 ]
Liu, Liwen [1 ]
Zhang, Rui [1 ]
He, Ying [2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Point cloud compression; Transformers; Task analysis; Representation learning; Geometry; Data models; Accuracy; 3-D representation learning; curriculum learning; point clouds; self-supervised learning; transformer;
D O I
10.1109/TNNLS.2024.3406587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning universal representations of 3-D point clouds is essential for reducing the need for manual annotation of large-scale and irregular point cloud datasets. The current modus operandi for representative learning is self-supervised learning, which has shown great potential for improving point cloud understanding. Nevertheless, it remains an open problem how to employ auto-encoding for learning universal 3-D representations of irregularly structured point clouds, as previous methods focus on either global shapes or local geometries. To this end, we present a cascaded self-supervised point cloud representation learning framework, dubbed Curriculumformer, aiming to tame curriculum pre-training for enhanced point cloud understanding. Our main idea lies in devising a progressive pre-training strategy, which trains the Transformer in an easy-to-hard manner. Specifically, we first pre-train the Transformer using an upsampling strategy, which allows it to learn global information. Then, we follow up with a completion strategy, which enables the Transformer to gain insight into local geometries. Finally, we propose a Multi-Modal Multi-Modality Contrastive Learning (M4CL) strategy to enhance the ability of representation learning by enriching the Transformer with semantic information. In this way, the pre-trained Transformer can be easily transferred to a wide range of downstream applications. We demonstrate the superior performance of Curriculumformer on various discriminant and generative tasks, outperforming state-of-the-art methods. Moreover, Curriculumformer can also be integrated into other off-the-shelf methods to promote their performance. Our code is available at https://github.com/Fayeben/Curriculumformer.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] Nonlocal Low-Rank Point Cloud Denoising for 3-D Measurement Surfaces
    Zhu, Dingkun
    Chen, Honghua
    Wang, Weiming
    Xie, Haoran
    Cheng, Gary
    Wei, Mingqiang
    Wang, Jun
    Wang, Fu Lee
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [22] Adaptive Multiview Graph Convolutional Network for 3-D Point Cloud Classification and Segmentation
    Niu, Wanhao
    Wang, Haowen
    Zhuang, Chungang
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (06) : 2043 - 2054
  • [23] PS-Net: Point Shift Network for 3-D Point Cloud Completion
    Zhang, Yirui
    Xu, Jiabo
    Zou, Yanni
    Liu, Peter X.
    Liu, Jie
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [24] PointVST: Self-Supervised Pre-Training for 3D Point Clouds via View-Specific Point-to-Image Translation
    Zhang, Qijian
    Hou, Junhui
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (10) : 6900 - 6912
  • [25] TGNet: Geometric Graph CNN on 3-D Point Cloud Segmentation
    Li, Ying
    Ma, Lingfei
    Zhong, Zilong
    Cao, Dongpu
    Li, Jonathan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (05): : 3588 - 3600
  • [26] Point-Cloud Transformer for 3-D Electrical Impedance Tomography
    Chen, Zhou
    Zhang, Haijing
    Hu, Delin
    Tan, Chao
    Liu, Zhe
    Yang, Yunjie
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [27] Fully Sparse Transformer 3-D Detector for LiDAR Point Cloud
    Zhang, Diankun
    Zheng, Zhijie
    Niu, Haoyu
    Wang, Xueqing
    Liu, Xiaojun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 12
  • [28] 3-D Point Cloud Map Compression for Connected Intelligent Vehicles
    Choi, Youngjoon
    Baek, Hannah
    Jeong, Jinseop
    Kim, Kanghee
    IEEE INTERNET COMPUTING, 2024, 28 (01) : 53 - 60
  • [29] PointWavelet: Learning in Spectral Domain for 3-D Point Cloud Analysis
    Wen, Cheng
    Long, Jianzhi
    Yu, Baosheng
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 36 (03) : 1 - 13
  • [30] Motor Diagnosis Based on 3-D Spherical Projected Point Cloud
    Long, Zhuo
    Xu, Zhiyuan
    Wu, Gongping
    Deng, Feng
    Sun, Meidi
    Wang, Ming-Hao
    Huang, Zhiwen
    Feng, Wenshan
    IEEE SENSORS JOURNAL, 2025, 25 (01) : 835 - 844