Curriculumformer: Taming Curriculum Pre-Training for Enhanced 3-D Point Cloud Understanding

被引：0

作者：

Fei, Ben ^{[1
]}

Luo, Tianyue ^{[1
]}

Yang, Weidong ^{[1
]}

Liu, Liwen ^{[1
]}

Zhang, Rui ^{[1
]}

He, Ying ^{[2
]}

机构：

[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China

[2] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

基金：

中国国家自然科学基金;

关键词：

Point cloud compression; Transformers; Task analysis; Representation learning; Geometry; Data models; Accuracy; 3-D representation learning; curriculum learning; point clouds; self-supervised learning; transformer;

D O I：

10.1109/TNNLS.2024.3406587

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning universal representations of 3-D point clouds is essential for reducing the need for manual annotation of large-scale and irregular point cloud datasets. The current modus operandi for representative learning is self-supervised learning, which has shown great potential for improving point cloud understanding. Nevertheless, it remains an open problem how to employ auto-encoding for learning universal 3-D representations of irregularly structured point clouds, as previous methods focus on either global shapes or local geometries. To this end, we present a cascaded self-supervised point cloud representation learning framework, dubbed Curriculumformer, aiming to tame curriculum pre-training for enhanced point cloud understanding. Our main idea lies in devising a progressive pre-training strategy, which trains the Transformer in an easy-to-hard manner. Specifically, we first pre-train the Transformer using an upsampling strategy, which allows it to learn global information. Then, we follow up with a completion strategy, which enables the Transformer to gain insight into local geometries. Finally, we propose a Multi-Modal Multi-Modality Contrastive Learning (M4CL) strategy to enhance the ability of representation learning by enriching the Transformer with semantic information. In this way, the pre-trained Transformer can be easily transferred to a wide range of downstream applications. We demonstrate the superior performance of Curriculumformer on various discriminant and generative tasks, outperforming state-of-the-art methods. Moreover, Curriculumformer can also be integrated into other off-the-shelf methods to promote their performance. Our code is available at https://github.com/Fayeben/Curriculumformer.

引用

页码：1 / 15

页数：15

共 50 条

[21] Nonlocal Low-Rank Point Cloud Denoising for 3-D Measurement Surfaces
Zhu, Dingkun
Chen, Honghua
Wang, Weiming
Xie, Haoran
Cheng, Gary
Wei, Mingqiang
Wang, Jun
Wang, Fu Lee
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[22] Adaptive Multiview Graph Convolutional Network for 3-D Point Cloud Classification and Segmentation
Niu, Wanhao
Wang, Haowen
Zhuang, Chungang
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (06) : 2043 - 2054
[23] PS-Net: Point Shift Network for 3-D Point Cloud Completion
Zhang, Yirui
Xu, Jiabo
Zou, Yanni
Liu, Peter X.
Liu, Jie
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[24] PointVST: Self-Supervised Pre-Training for 3D Point Clouds via View-Specific Point-to-Image Translation
Zhang, Qijian
Hou, Junhui
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (10) : 6900 - 6912
[25] TGNet: Geometric Graph CNN on 3-D Point Cloud Segmentation
Li, Ying
Ma, Lingfei
Zhong, Zilong
Cao, Dongpu
Li, Jonathan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (05): : 3588 - 3600
[26] Point-Cloud Transformer for 3-D Electrical Impedance Tomography
Chen, Zhou
Zhang, Haijing
Hu, Delin
Tan, Chao
Liu, Zhe
Yang, Yunjie
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[27] Fully Sparse Transformer 3-D Detector for LiDAR Point Cloud
Zhang, Diankun
Zheng, Zhijie
Niu, Haoyu
Wang, Xueqing
Liu, Xiaojun
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 12
[28] 3-D Point Cloud Map Compression for Connected Intelligent Vehicles
Choi, Youngjoon
Baek, Hannah
Jeong, Jinseop
Kim, Kanghee
IEEE INTERNET COMPUTING, 2024, 28 (01) : 53 - 60
[29] PointWavelet: Learning in Spectral Domain for 3-D Point Cloud Analysis
Wen, Cheng
Long, Jianzhi
Yu, Baosheng
Tao, Dacheng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 36 (03) : 1 - 13
[30] Motor Diagnosis Based on 3-D Spherical Projected Point Cloud
Long, Zhuo
Xu, Zhiyuan
Wu, Gongping
Deng, Feng
Sun, Meidi
Wang, Ming-Hao
Huang, Zhiwen
Feng, Wenshan
IEEE SENSORS JOURNAL, 2025, 25 (01) : 835 - 844

← 1 2 3 4 5 →