Point Cloud-Based 3D Object Classification With Non Local Attention and Lightweight Convolution Neural Networks

被引:1
作者
Karthik, R. [1 ]
Inamdar, Rohan [2 ]
Sundarr, S. Kavin [2 ]
Cho, Jaehyuk [3 ]
Veerappampalayam Easwaramoorthy, Sathishkumar [4 ]
机构
[1] Vellore Inst Technol, CCPS, Chennai 600127, India
[2] Vellore Inst Technol, Sch Comp Sci & Engn SCOPE, Chennai 600127, India
[3] Jeonbuk Natl Univ, Dept Software Engn, Div Elect & Informat Engn, Jeonju Si 54896, Jeonrabug Do, South Korea
[4] Sunway Univ, Sch Engn & Technol, Selangor 47500, Malaysia
关键词
Three-dimensional displays; Feature extraction; Shape; Point cloud compression; Solid modeling; Object recognition; Convolutional neural networks; Accuracy; Adaptation models; Convolution; 3D object classification; lightweight CNN; point cloud; non-local attention; dualpath block; adaptive sampling; RECOGNITION; CNN;
D O I
10.1109/ACCESS.2024.3485906
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional (3D) object classification is crucial in various applications, including autonomous driving, robotics, and augmented reality, where precise detection of objects in 3D space is required. Traditional techniques for 3D object classification encompass voxel-based representations, multi-view projections, and point cloud analysis. However, these approaches can be computationally intensive, and might not capture complete structure of the underlying object. This research introduces a novel lightweight model for 3D object classification based on point clouds that achieves higher accuracy while requiring minimal computing overhead. The proposed system incorporates adaptive sampling, an Attentive-Convolution Feature block, a Dualpath feature processing block, and a non-local attention mechanism to enhance feature representation and classification performance. To our knowledge, this is the first study to propose a lightweight TnetLight module as a transformation network that aligns and converts input point clouds, capturing subtle geometric variations and highlighting differences between objects to enhance shape recognition and differentiation accuracy. Additionally, the model integrates an Attentive-Convolution Feature extraction block that combines local geometric features with global contextual information, enhancing the network's ability to capture both detailed and broader characteristics for better differentiation between objects with slight variations. The network was evaluated on the ModelNet10 dataset, achieving an accuracy of 94.4%.
引用
收藏
页码:158530 / 158545
页数:16
相关论文
共 44 条
[1]   FinerPCN: High fidelity point cloud completion network using pointwise convolution [J].
Chang, Yakun ;
Jung, Cheolkon ;
Xu, Yuanquan .
NEUROCOMPUTING, 2021, 460 :266-276
[2]   DDGCN: graph convolution network based on direction and distance for point cloud learning [J].
Chen, Lifang ;
Zhang, Qian .
VISUAL COMPUTER, 2023, 39 (03) :863-873
[3]   End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling [J].
Chen, Tong ;
Liu, Haojie ;
Ma, Zhan ;
Shen, Qiu ;
Cao, Xun ;
Wang, Yao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3179-3191
[4]   Learning point cloud context information based on 3D transformer for more accurate and efficient classification [J].
Chen, Yiping ;
Zhang, Shuai ;
Lin, Weisheng ;
Zhang, Shuhang ;
Zhang, Wuming .
PHOTOGRAMMETRIC RECORD, 2023, 38 (184) :603-616
[5]   General-Purpose Deep Point Cloud Feature Extractor [J].
Dominguez, Miguel ;
Dhamdhere, Rohan ;
Petkar, Atir ;
Jain, Saloni ;
Sah, Shagan ;
Ptucha, Raymond .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :1972-1981
[6]   A voxelized point clouds representation for object classification and segmentation on 3D data [J].
Gezawa, Abubakar Sulaiman ;
Bello, Zikirillahi A. ;
Wang, Qicong ;
Lei Yunqi .
JOURNAL OF SUPERCOMPUTING, 2022, 78 (01) :1479-1500
[7]   Adaptive Deep Learning-Based Point Cloud Geometry Coding [J].
Guarda, Andre F. R. ;
Rodrigues, Nuno M. M. ;
Pereira, Fernando .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) :415-430
[8]   Deep Learning for 3D Point Clouds: A Survey [J].
Guo, Yulan ;
Wang, Hanyun ;
Hu, Qingyong ;
Liu, Hao ;
Liu, Li ;
Bennamoun, Mohammed .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) :4338-4364
[9]   FNE-PCT: An Efficient Transformer Network for 3D Classification [J].
Han, Ming ;
Sha, Jianjun ;
Wang, Yanheng ;
Ma, Chengyuan ;
Zhang, Xiang .
PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, :1046-1050
[10]   M-GCN: Multi-scale Graph Convolutional Network for 3D Point Cloud Classification [J].
Hu, Jing ;
Wang, Xincheng ;
Liao, Ziheng ;
Xiao, Tingsong .
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, :924-929