Enhancing Few-Shot 3D Point Cloud Classification With Soft Interaction and Self-Attention

被引:2
作者
Khan, Abdullah Aman [1 ,2 ]
Shao, Jie [1 ,2 ]
Shafiq, Sidra [2 ]
Zhu, Shuyuan [2 ]
Shen, Heng Tao [1 ,2 ]
机构
[1] Sichuan Artificial Intelligence Res Inst, Yibin 644000, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud compression; Three-dimensional displays; Few shot learning; Solid modeling; Data models; Shape; Feature extraction; Benchmark testing; Predictive models; Tuning; Few-shot learning; point clouds; feature embeddings; self-attention;
D O I
10.1109/TMM.2024.3521849
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot learning is a crucial aspect of modern machine learning that enables models to recognize and classify objects efficiently with limited training data. The shortage of labeled 3D point cloud data calls for innovative solutions, particularly when novel classes emerge more frequently. In this paper, we propose a novel few-shot learning method for recognizing 3D point clouds. More specifically, this paper addresses the challenges of applying few-shot learning to 3D point cloud data, which poses unique difficulties due to the unordered and irregular nature of these data. We propose two new modules for few-shot based 3D point cloud classification, i.e., the Soft Interaction Module (SIM) and Self-Attention Residual Feedforward (SARF) Module. These modules balance and enhance the feature representation by enabling more relevant feature interactions and capturing long-range dependencies between query and support features. To validate the effectiveness of the proposed method, extensive experiments are conducted on benchmark datasets, including ModelNet40, ShapeNetCore, and ScanObjectNN. Our approach demonstrates superior performance in handling abrupt feature changes occurring during the meta-learning process. The results of the experiments indicate the superiority of our proposed method by demonstrating its robust generalization ability and better classification performance for 3D point cloud data with limited training samples.
引用
收藏
页码:1127 / 1141
页数:15
相关论文
共 46 条
[1]  
Chen Guangyan, 2023, Advances in Neural Information Processing Systems
[2]   ViewNet: A Novel Projection-Based Backbone with View Pooling for Few-shot Point Cloud Classification [J].
Chen, Jiajing ;
Yang, Minmin ;
Velipasalar, Senem .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :17652-17660
[3]  
Chen W., 2019, P INT C LEARN REPR I
[4]  
Feng H., 2022, PROC IEEE INT C ACOU
[5]  
Garcia-Acosta Virginia., 2018, INT C LEARN REPR, V1st, P1
[6]   Meta-learning Approaches for Few-Shot Learning: A Survey of Recent Advances [J].
Gharoun, Hassan ;
Momenifar, Fereshteh ;
Chen, Fang ;
Gandomi, Amir H. .
ACM COMPUTING SURVEYS, 2024, 56 (12)
[7]   Dynamic Few-Shot Visual Learning without Forgetting [J].
Gidaris, Spyros ;
Komodakis, Nikos .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4367-4375
[8]   Font-ProtoNet: Prototypical Network based Font Identification of Document Images in Low Data Regime [J].
Goel, Nikita ;
Sharma, Monika ;
Vig, Lovekesh .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :2369-2376
[9]   Deep Learning for 3D Point Clouds: A Survey [J].
Guo, Yulan ;
Wang, Hanyun ;
Hu, Qingyong ;
Liu, Hao ;
Liu, Li ;
Bennamoun, Mohammed .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) :4338-4364
[10]   Residual learning with annularly convolutional neural networks for classification and segmentation of 3D point clouds [J].
Hassan, R. ;
Fraz, M. M. ;
Rajput, A. ;
Shahzad, M. .
NEUROCOMPUTING, 2023, 526 :96-108