EDGCNet: Joint dynamic hyperbolic graph convolution and dual squeeze-and-attention for 3D point cloud segmentation

被引：13

作者：

Cheng, Haozhe ^{[1
,2
]}

Zhu, Jihua ^{[1
,2
]}

Lu, Jian ^{[3
]}

Han, Xu ^{[1
,2
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China

[2] Shaanxi Joint Key Lab Artifact Intelligence, Xian, Peoples R China

[3] Xian Polytech Univ, Sch Elect & Informat, Xian, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 237卷

关键词：

3D point cloud segmentation; Hyperbolic graph convolution; Channel attention mechanism; NETWORKS;

D O I：

10.1016/j.eswa.2023.121551

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel 3D point cloud segmentation network called EDGCNet. Structurally, the network combines the encoder-decoder structure and graph convolution to improve the processing efficiency and the exploration of point-to-point correlation. In addition, it contains two sub-modules: the dynamic hyperbaric graph convolution module and the dual squeeze-and-attention module. To mitigate the issue of incorrect inference in existing graph convolution networks for multi-category hybrid regions, the dynamic hyperbolic graph convolution module maps the features captured in the Euclidean space to the hyperbolic space to aggregate the neighborhoods and dynamically update the edge weights. Compared to Euclidean geometry, hyperbolic embedding with learnable curvature can deeply fit the geometric topology of point cloud by transforming the manifold shape to enrich feature representation. The dual squeeze-and-attention module recalibrates the original features by modeling the correlation between channels from both global and local perspectives to improve the feature shift caused by the deepening of the convolution. Empirical experiments on three public datasets verify that EDGCNet has an excellent segmentation effect on both artificial shapes and real scenes. Furthermore, ablation studies and confirmatory experiments show that the modules in EDGCNet are contributing, low-complexity and robust.

引用

页数：16

共 86 条

[1] 3D Semantic Parsing of Large-Scale Indoor Spaces [J].

Armeni, Iro ;

Sener, Ozan ;

Zamir, Amir R. ;

Jiang, Helen ;

Brilakis, Ioannis ;

Fischer, Martin ;

Savarese, Silvio .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543

[2] Point Convolutional Neural Networks by Extension Operators [J].

Atzmon, Matan ;

Maron, Haggai ;

Lipman, Yaron .

ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04)

[3]

Bachmann G, 2020, PR MACH LEARN RES, V119

[4] Micro-expression recognition using 3D DenseNet fused Squeeze-and-Excitation Networks [J].

Cai, Linqin ;

Li, Hao ;

Dong, Wei ;

Fang, Haodu .

APPLIED SOFT COMPUTING, 2022, 119

[5]

Chami I, 2019, ADV NEUR IN, V32

[6]

Chen C, 2019, Arxiv, DOI arXiv:1905.08705

[7] HAPGN: Hierarchical Attentive Pooling Graph Network for Point Cloud Segmentation [J].

Chen, Chaofan ;

Qian, Shengsheng ;

Fang, Quan ;

Xu, Changsheng .

IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :2335-2346

[8]

Chen LZ, 2019, Arxiv, DOI arXiv:1905.05442

[9] PTANet: Triple Attention Network for point cloud semantic segmentation [J].

Cheng, Haozhe ;

Lu, Jian ;

Luo, Maoxin ;

Liu, Wei ;

Zhang, Kaibing .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 102

[10] TransRVNet: LiDAR Semantic Segmentation With Transformer [J].

Cheng, Hui-Xian ;

Han, Xian-Feng ;

Xiao, Guo-Qiang .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) :5895-5907

← 1 2 3 4 5 6 7 8 9 →