DeepGCNs-Att: Point cloud semantic segmentation with contextual point representations

被引:1
作者
Jiang, Bin
Wang, Xinyu
Huang, Li
Xiao, Jian [1 ,2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Elect & Opt Engn, Nanjing, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Microelect, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud processing; semantic segmentation; graph convolutional network; attention module; deep learning; CLASSIFICATION;
D O I
10.3233/JIFS-212030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Convolutional Networks are able to characterize non-Euclidean spaces effectively compared with traditional Convolutional Neural Networks, which can extract the local features of the point cloud using deep neural networks, but it cannot make full use of the global features of the point cloud for semantic segmentation. To solve this problem, this paper proposes a novel network structure called DeepGCNs-Att that enables deep Graph Convolutional Network to aggregate global context features efficiently. Moreover, to speed up the computation, we add an Attention layer after the Graph Convolutional Network Backbone Block to mutually enhance the connection between the distant points of the non-Euclidean space. Our model is tested on the standard benchmark S3DIS. By comparing with other deep Graph Convolutional Networks, our DeepGCNs-Att's mIoU has at least two percent higher than that of all other models and even shows excellent results in space complexity and computational complexity under the same number of Graph Convolutional Network layers.
引用
收藏
页码:3827 / 3836
页数:10
相关论文
共 47 条
  • [1] [Anonymous], 4 INT C LEARNING REP
  • [2] Armeni S., 2017, ARXIV170201105
  • [3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [4] Boulch A., 2017, 3dor@eurographics, P17, DOI [DOI 10.2312/3DOR.20171047, 10.2312/3dor.20171047]
  • [5] Caesar H., 2020, IEEE CVF C COMP VIS
  • [6] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [7] Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds
    Engelmann, Francis
    Kontogianni, Theodora
    Hermans, Alexander
    Leibe, Bastian
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 716 - 724
  • [8] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [9] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [10] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587