3D Directional Encoding for Point Cloud Analysis

被引:0
|
作者
Jung, Yoonjae [1 ]
Lee, Sang-Hyun [2 ]
Seo, Seung-Woo [1 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 08826, South Korea
[2] Ajou Univ, Dept AI Mobil Engn, Suwon 16499, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Vectors; Point cloud compression; Three-dimensional displays; Encoding; Transformers; Network architecture; Data mining; Computer architecture; Neural networks; Information retrieval; Classification; deep learning; directional feature extraction; efficient neural network; point cloud; segmentation;
D O I
10.1109/ACCESS.2024.3472301
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting informative local features in point clouds is crucial for accurately understanding spatial information inside 3D point data. Previous works utilize either complex network designs or simple multi-layer perceptrons (MLP) to extract the local features. However, complex networks often incur high computational cost, whereas simple MLP may struggle to capture the spatial relations among local points effectively. These challenges limit their scalability to delicate and real-time tasks, such as autonomous driving and robot navigation. To address these challenges, we propose a novel 3D Directional Encoding Network (3D-DENet) capable of effectively encoding spatial relations with low computational cost. 3D-DENet extracts spatial and point features separately. The key component of 3D-DENet for spatial feature extraction is Directional Encoding (DE), which encodes the cosine similarity between direction vectors of local points and trainable direction vectors. To extract point features, we also propose Local Point Feature Multi-Aggregation (LPFMA), which integrates various aspects of local point features using diverse aggregation functions. By leveraging DE and LPFMA in a hierarchical structure, 3D-DENet efficiently captures both detailed spatial and high-level semantic features from point clouds. Experiments show that 3D-DENet is effective and efficient in classification and segmentation tasks. In particular, 3D-DENet achieves an overall accuracy of 90.7% and a mean accuracy of 90.1% on ScanObjectNN, outperforming the current state-of-the-art method while using only 47% floating point operations.
引用
收藏
页码:144533 / 144543
页数:11
相关论文
共 50 条
  • [41] Mutual Information Maximization Based Similarity Operation for 3D Point Cloud Completion Network
    Wang, Di
    Tang, Lulu
    Zhu, Lei
    Yang, Zhi-Xin
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1217 - 1221
  • [42] TransMRE: Multiple Observation Planes Representation Encoding With Fully Sparse Voxel Transformers for 3-D Object Detection
    Zhu, Ziming
    Zhu, Yu
    Zhang, Kezhi
    Li, Hangyu
    Ling, Xiaofeng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [43] Transformer for 3D Point Clouds
    Wang, Jiayun
    Chakraborty, Rudrasis
    Yu, Stella X.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4419 - 4431
  • [44] An Effective Encoding Method Based on Local Information for 3D Point Cloud Classification
    Song, Yanan
    Gao, Liang
    Li, Xinyu
    Pan, Quan-Ke
    IEEE ACCESS, 2019, 7 : 39369 - 39377
  • [45] Background-Aware 3-D Point Cloud Segmentation With Dynamic Point Feature Aggregation
    Chen, Jiajing
    Kakillioglu, Burak
    Velipasalar, Senem
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [46] Revisiting 3D point cloud analysis with Markov process
    Jiang, Chenru
    Ma, Wuwei
    Huang, Kaizhu
    Wang, Qiufeng
    Yang, Xi
    Zhao, Weiguang
    Wu, Junwei
    Wang, Xinheng
    Xiao, Jimin
    Niu, Zhenxing
    PATTERN RECOGNITION, 2025, 158
  • [47] Hypergraph Spectral Analysis and Processing in 3D Point Cloud
    Zhang, Songyang
    Cui, Shuguang
    Ding, Zhi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1193 - 1206
  • [48] Automatic Stockpile Extraction and Measurement Using 3D Point Cloud and Multi-Scale Directional Curvature
    Yang, Xingyu
    Huang, Yuchun
    Zhang, Qiulan
    REMOTE SENSING, 2020, 12 (06)
  • [49] 3D Target Detection Incorporating Point Cloud Columnarization and Attention Mechanisms in Intelligent Driving Systems
    Wang, Hongliang
    Zhang, Jingzhu
    IEEE ACCESS, 2024, 12 : 75124 - 75135
  • [50] Progressive Framework of Learning 3D Object Classes and Orientations from Deep Point Cloud Representation
    Lee, Sukhan
    Cheng, Wencan
    PROCEEDINGS OF THE 2020 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM), 2020,