An improved fused feature residual network for 3D point cloud data

被引：0

作者：

Gezawa, Abubakar Sulaiman ^{[1
]}

Liu, Chibiao ^{[1
]}

Jia, Heming ^{[1
]}

Nanehkaran, Y. A. ^{[2
]}

Almutairi, Mubarak S. ^{[3
]}

Chiroma, Haruna ^{[4
]}

机构：

[1] Sanming Univ, Coll Informat Engn, Fujian Key Lab Agr IOT Applicat, Sanming, Fujian, Peoples R China

[2] Yancheng Teachers Univ, Sch Informat Engn, Dept Software Engn, Yancheng, Jiangsu, Peoples R China

[3] Univ Hafr Al Batin, Coll Comp Sci & Engn, Hafar al Batin, Saudi Arabia

[4] Univ Hafr Al Batin, Coll Comp Sci & Engn Technol, Appl Coll, Hafar Al Batin, Saudi Arabia

来源：

FRONTIERS IN COMPUTATIONAL NEUROSCIENCE | 2023年 / 17卷

关键词：

point clouds; part segmentation; classification; shape features; 3D objects recognition; CLASSIFICATION; SEGMENTATION;

D O I：

10.3389/fncom.2023.1204445

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Point clouds have evolved into one of the most important data formats for 3D representation. It is becoming more popular as a result of the increasing affordability of acquisition equipment and growing usage in a variety of fields. Volumetric grid-based approaches are among the most successful models for processing point clouds because they fully preserve data granularity while additionally making use of point dependency. However, using lower order local estimate functions to close 3D objects, such as the piece-wise constant function, necessitated the use of a high-resolution grid in order to capture detailed features that demanded vast computational resources. This study proposes an improved fused feature network as well as a comprehensive framework for solving shape classification and segmentation tasks using a two-branch technique and feature learning. We begin by designing a feature encoding network with two distinct building blocks: layer skips within, batch normalization (BN), and rectified linear units (ReLU) in between. The purpose of using layer skips is to have fewer layers to propagate across, which will speed up the learning process and lower the effect of gradients vanishing. Furthermore, we develop a robust grid feature extraction module that consists of multiple convolution blocks accompanied by max-pooling to represent a hierarchical representation and extract features from an input grid. We overcome the grid size constraints by sampling a constant number of points in each grid using a simple K-points nearest neighbor (KNN) search, which aids in learning approximation functions in higher order. The proposed method outperforms or is comparable to state-of-the-art approaches in point cloud segmentation and classification tasks. In addition, a study of ablation is presented to show the effectiveness of the proposed method.

引用

页数：16

共 75 条

[41] 3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification
Lu, Dening
Xie, Qian
Gao, Kyle
Xu, Linlin
Li, Jonathan
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24854 - 24865
[42] Ma C., 2017, British Machine Vision Conference 2017, BMVC 2017
[43] Maturana D, 2015, IEEE INT C INT ROBOT, P922, DOI 10.1109/IROS.2015.7353481
[44] Nair V., 2010, P 27 INT C MACH LEAR, P807
[45] Qi C R, 2017, P 31 INT C NEUR INF, V2017, P5099
[46] Qi ZK, 2023, Arxiv, DOI arXiv:2302.02318
[47] OctNet: Learning Deep 3D Representations at High Resolutions
Riegler, Gernot
Ulusoy, Ali Osman
Geiger, Andreas
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6620 - 6629
[48] Sfikas K., 2017, EUR WORKSH 3D OBJ RE, DOI [10.2312/3dor.20171045, DOI 10.2312/3DOR.20171045]
[49] DeepPano: Deep Panoramic Representation for 3-D Shape Recognition
Shi, Baoguang
Bai, Song
Zhou, Zhichao
Bai, Xiang
[J]. IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (12) : 2339 - 2343
[50] Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs
Simonovsky, Martin
Komodakis, Nikos
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 29 - 38

← 1 2 3 4 5 6 7 8 →