Point Cloud-Based 3D Object Classification With Non Local Attention and Lightweight Convolution Neural Networks

被引：1

作者：

Karthik, R. ^{[1
]}

Inamdar, Rohan ^{[2
]}

Sundarr, S. Kavin ^{[2
]}

Cho, Jaehyuk ^{[3
]}

Veerappampalayam Easwaramoorthy, Sathishkumar ^{[4
]}

机构：

[1] Vellore Inst Technol, CCPS, Chennai 600127, India

[2] Vellore Inst Technol, Sch Comp Sci & Engn SCOPE, Chennai 600127, India

[3] Jeonbuk Natl Univ, Dept Software Engn, Div Elect & Informat Engn, Jeonju Si 54896, Jeonrabug Do, South Korea

[4] Sunway Univ, Sch Engn & Technol, Selangor 47500, Malaysia

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Three-dimensional displays; Feature extraction; Shape; Point cloud compression; Solid modeling; Object recognition; Convolutional neural networks; Accuracy; Adaptation models; Convolution; 3D object classification; lightweight CNN; point cloud; non-local attention; dualpath block; adaptive sampling; RECOGNITION; CNN;

D O I：

10.1109/ACCESS.2024.3485906

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Three-dimensional (3D) object classification is crucial in various applications, including autonomous driving, robotics, and augmented reality, where precise detection of objects in 3D space is required. Traditional techniques for 3D object classification encompass voxel-based representations, multi-view projections, and point cloud analysis. However, these approaches can be computationally intensive, and might not capture complete structure of the underlying object. This research introduces a novel lightweight model for 3D object classification based on point clouds that achieves higher accuracy while requiring minimal computing overhead. The proposed system incorporates adaptive sampling, an Attentive-Convolution Feature block, a Dualpath feature processing block, and a non-local attention mechanism to enhance feature representation and classification performance. To our knowledge, this is the first study to propose a lightweight TnetLight module as a transformation network that aligns and converts input point clouds, capturing subtle geometric variations and highlighting differences between objects to enhance shape recognition and differentiation accuracy. Additionally, the model integrates an Attentive-Convolution Feature extraction block that combines local geometric features with global contextual information, enhancing the network's ability to capture both detailed and broader characteristics for better differentiation between objects with slight variations. The network was evaluated on the ModelNet10 dataset, achieving an accuracy of 94.4%.

引用

页码：158530 / 158545

页数：16

共 44 条

[1] FinerPCN: High fidelity point cloud completion network using pointwise convolution [J].