An improved fused feature residual network for 3D point cloud data

被引:0
作者
Gezawa, Abubakar Sulaiman [1 ]
Liu, Chibiao [1 ]
Jia, Heming [1 ]
Nanehkaran, Y. A. [2 ]
Almutairi, Mubarak S. [3 ]
Chiroma, Haruna [4 ]
机构
[1] Sanming Univ, Coll Informat Engn, Fujian Key Lab Agr IOT Applicat, Sanming, Fujian, Peoples R China
[2] Yancheng Teachers Univ, Sch Informat Engn, Dept Software Engn, Yancheng, Jiangsu, Peoples R China
[3] Univ Hafr Al Batin, Coll Comp Sci & Engn, Hafar al Batin, Saudi Arabia
[4] Univ Hafr Al Batin, Coll Comp Sci & Engn Technol, Appl Coll, Hafar Al Batin, Saudi Arabia
关键词
point clouds; part segmentation; classification; shape features; 3D objects recognition; CLASSIFICATION; SEGMENTATION;
D O I
10.3389/fncom.2023.1204445
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Point clouds have evolved into one of the most important data formats for 3D representation. It is becoming more popular as a result of the increasing affordability of acquisition equipment and growing usage in a variety of fields. Volumetric grid-based approaches are among the most successful models for processing point clouds because they fully preserve data granularity while additionally making use of point dependency. However, using lower order local estimate functions to close 3D objects, such as the piece-wise constant function, necessitated the use of a high-resolution grid in order to capture detailed features that demanded vast computational resources. This study proposes an improved fused feature network as well as a comprehensive framework for solving shape classification and segmentation tasks using a two-branch technique and feature learning. We begin by designing a feature encoding network with two distinct building blocks: layer skips within, batch normalization (BN), and rectified linear units (ReLU) in between. The purpose of using layer skips is to have fewer layers to propagate across, which will speed up the learning process and lower the effect of gradients vanishing. Furthermore, we develop a robust grid feature extraction module that consists of multiple convolution blocks accompanied by max-pooling to represent a hierarchical representation and extract features from an input grid. We overcome the grid size constraints by sampling a constant number of points in each grid using a simple K-points nearest neighbor (KNN) search, which aids in learning approximation functions in higher order. The proposed method outperforms or is comparable to state-of-the-art approaches in point cloud segmentation and classification tasks. In addition, a study of ablation is presented to show the effectiveness of the proposed method.
引用
收藏
页数:16
相关论文
共 75 条
  • [41] 3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification
    Lu, Dening
    Xie, Qian
    Gao, Kyle
    Xu, Linlin
    Li, Jonathan
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24854 - 24865
  • [42] Ma C., 2017, British Machine Vision Conference 2017, BMVC 2017
  • [43] Maturana D, 2015, IEEE INT C INT ROBOT, P922, DOI 10.1109/IROS.2015.7353481
  • [44] Nair V., 2010, P 27 INT C MACH LEAR, P807
  • [45] Qi C R, 2017, P 31 INT C NEUR INF, V2017, P5099
  • [46] Qi ZK, 2023, Arxiv, DOI arXiv:2302.02318
  • [47] OctNet: Learning Deep 3D Representations at High Resolutions
    Riegler, Gernot
    Ulusoy, Ali Osman
    Geiger, Andreas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6620 - 6629
  • [48] Sfikas K., 2017, EUR WORKSH 3D OBJ RE, DOI [10.2312/3dor.20171045, DOI 10.2312/3DOR.20171045]
  • [49] DeepPano: Deep Panoramic Representation for 3-D Shape Recognition
    Shi, Baoguang
    Bai, Song
    Zhou, Zhichao
    Bai, Xiang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (12) : 2339 - 2343
  • [50] Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs
    Simonovsky, Martin
    Komodakis, Nikos
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 29 - 38