DeepGCNs-Att: Point cloud semantic segmentation with contextual point representations

被引：1

作者：

Jiang, Bin

Wang, Xinyu

Huang, Li

Xiao, Jian ^{[1
,2
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Elect & Opt Engn, Nanjing, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Coll Microelect, Nanjing, Peoples R China

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2022年 / 42卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Point cloud processing; semantic segmentation; graph convolutional network; attention module; deep learning; CLASSIFICATION;

D O I：

10.3233/JIFS-212030

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph Convolutional Networks are able to characterize non-Euclidean spaces effectively compared with traditional Convolutional Neural Networks, which can extract the local features of the point cloud using deep neural networks, but it cannot make full use of the global features of the point cloud for semantic segmentation. To solve this problem, this paper proposes a novel network structure called DeepGCNs-Att that enables deep Graph Convolutional Network to aggregate global context features efficiently. Moreover, to speed up the computation, we add an Attention layer after the Graph Convolutional Network Backbone Block to mutually enhance the connection between the distant points of the non-Euclidean space. Our model is tested on the standard benchmark S3DIS. By comparing with other deep Graph Convolutional Networks, our DeepGCNs-Att's mIoU has at least two percent higher than that of all other models and even shows excellent results in space complexity and computational complexity under the same number of Graph Convolutional Network layers.

引用

页码：3827 / 3836

页数：10

共 47 条

[1] [Anonymous], 4 INT C LEARNING REP
[2] Armeni S., 2017, ARXIV170201105
[3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[4] Boulch A., 2017, 3dor@eurographics, P17, DOI [DOI 10.2312/3DOR.20171047, 10.2312/3dor.20171047]
[5] Caesar H., 2020, IEEE CVF C COMP VIS
[6] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
Dai, Angela
Qi, Charles Ruizhongtai
Niessner, Matthias
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
[7] Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds
Engelmann, Francis
Kontogianni, Theodora
Hermans, Alexander
Leibe, Bastian
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 716 - 724
[8] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
[9] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[10] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587

← 1 2 3 4 5 →