3D Directional Encoding for Point Cloud Analysis

被引：0

作者：

Jung, Yoonjae ^{[1
]}

Lee, Sang-Hyun ^{[2
]}

Seo, Seung-Woo ^{[1
]}

机构：

[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 08826, South Korea

[2] Ajou Univ, Dept AI Mobil Engn, Suwon 16499, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Feature extraction; Vectors; Point cloud compression; Three-dimensional displays; Encoding; Transformers; Network architecture; Data mining; Computer architecture; Neural networks; Information retrieval; Classification; deep learning; directional feature extraction; efficient neural network; point cloud; segmentation;

D O I：

10.1109/ACCESS.2024.3472301

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Extracting informative local features in point clouds is crucial for accurately understanding spatial information inside 3D point data. Previous works utilize either complex network designs or simple multi-layer perceptrons (MLP) to extract the local features. However, complex networks often incur high computational cost, whereas simple MLP may struggle to capture the spatial relations among local points effectively. These challenges limit their scalability to delicate and real-time tasks, such as autonomous driving and robot navigation. To address these challenges, we propose a novel 3D Directional Encoding Network (3D-DENet) capable of effectively encoding spatial relations with low computational cost. 3D-DENet extracts spatial and point features separately. The key component of 3D-DENet for spatial feature extraction is Directional Encoding (DE), which encodes the cosine similarity between direction vectors of local points and trainable direction vectors. To extract point features, we also propose Local Point Feature Multi-Aggregation (LPFMA), which integrates various aspects of local point features using diverse aggregation functions. By leveraging DE and LPFMA in a hierarchical structure, 3D-DENet efficiently captures both detailed spatial and high-level semantic features from point clouds. Experiments show that 3D-DENet is effective and efficient in classification and segmentation tasks. In particular, 3D-DENet achieves an overall accuracy of 90.7% and a mean accuracy of 90.1% on ScanObjectNN, outperforming the current state-of-the-art method while using only 47% floating point operations.

引用

页码：144533 / 144543

页数：11

共 50 条

[41] Mutual Information Maximization Based Similarity Operation for 3D Point Cloud Completion Network
Wang, Di
Tang, Lulu
Zhu, Lei
Yang, Zhi-Xin
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1217 - 1221
[42] TransMRE: Multiple Observation Planes Representation Encoding With Fully Sparse Voxel Transformers for 3-D Object Detection
Zhu, Ziming
Zhu, Yu
Zhang, Kezhi
Li, Hangyu
Ling, Xiaofeng
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[43] Transformer for 3D Point Clouds
Wang, Jiayun
Chakraborty, Rudrasis
Yu, Stella X.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4419 - 4431
[44] An Effective Encoding Method Based on Local Information for 3D Point Cloud Classification
Song, Yanan
Gao, Liang
Li, Xinyu
Pan, Quan-Ke
IEEE ACCESS, 2019, 7 : 39369 - 39377
[45] Background-Aware 3-D Point Cloud Segmentation With Dynamic Point Feature Aggregation
Chen, Jiajing
Kakillioglu, Burak
Velipasalar, Senem
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[46] Revisiting 3D point cloud analysis with Markov process
Jiang, Chenru
Ma, Wuwei
Huang, Kaizhu
Wang, Qiufeng
Yang, Xi
Zhao, Weiguang
Wu, Junwei
Wang, Xinheng
Xiao, Jimin
Niu, Zhenxing
PATTERN RECOGNITION, 2025, 158
[47] Hypergraph Spectral Analysis and Processing in 3D Point Cloud
Zhang, Songyang
Cui, Shuguang
Ding, Zhi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1193 - 1206
[48] Automatic Stockpile Extraction and Measurement Using 3D Point Cloud and Multi-Scale Directional Curvature
Yang, Xingyu
Huang, Yuchun
Zhang, Qiulan
REMOTE SENSING, 2020, 12 (06)
[49] 3D Target Detection Incorporating Point Cloud Columnarization and Attention Mechanisms in Intelligent Driving Systems
Wang, Hongliang
Zhang, Jingzhu
IEEE ACCESS, 2024, 12 : 75124 - 75135
[50] Progressive Framework of Learning 3D Object Classes and Orientations from Deep Point Cloud Representation
Lee, Sukhan
Cheng, Wencan
PROCEEDINGS OF THE 2020 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM), 2020,

← 1 2 3 4 5 →