DeepGCNs-Att: Point cloud semantic segmentation with contextual point representations

被引:1
作者
Jiang, Bin
Wang, Xinyu
Huang, Li
Xiao, Jian [1 ,2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Elect & Opt Engn, Nanjing, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Microelect, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud processing; semantic segmentation; graph convolutional network; attention module; deep learning; CLASSIFICATION;
D O I
10.3233/JIFS-212030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Convolutional Networks are able to characterize non-Euclidean spaces effectively compared with traditional Convolutional Neural Networks, which can extract the local features of the point cloud using deep neural networks, but it cannot make full use of the global features of the point cloud for semantic segmentation. To solve this problem, this paper proposes a novel network structure called DeepGCNs-Att that enables deep Graph Convolutional Network to aggregate global context features efficiently. Moreover, to speed up the computation, we add an Attention layer after the Graph Convolutional Network Backbone Block to mutually enhance the connection between the distant points of the non-Euclidean space. Our model is tested on the standard benchmark S3DIS. By comparing with other deep Graph Convolutional Networks, our DeepGCNs-Att's mIoU has at least two percent higher than that of all other models and even shows excellent results in space complexity and computational complexity under the same number of Graph Convolutional Network layers.
引用
收藏
页码:3827 / 3836
页数:10
相关论文
共 47 条
  • [11] Rotational Projection Statistics for 3D Local Surface Description and Object Recognition
    Guo, Yulan
    Sohel, Ferdous
    Bennamoun, Mohammed
    Lu, Min
    Wan, Jianwei
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 105 (01) : 63 - 86
  • [12] Hamid M., 2020, Journal of King Saud University-Computer and Information Sciences
  • [13] He J., 2019, ARXIV191105277
  • [14] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [15] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [16] Recurrent Slice Networks for 3D Segmentation of Point Clouds
    Huang, Qiangui
    Wang, Weiyue
    Neumann, Ulrich
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2626 - 2635
  • [17] Multi-view PointNet for 3D Scene Understanding
    Jaritz, Maximilian
    Gu, Jiayuan
    Su, Hao
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3995 - 4003
  • [18] 3D Convolutional Neural Networks for Human Action Recognition
    Ji, Shuiwang
    Xu, Wei
    Yang, Ming
    Yu, Kai
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) : 221 - 231
  • [19] Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs
    Landrieu, Loic
    Simonovsky, Martin
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4558 - 4567
  • [20] Survey on semantic segmentation using deep learning techniques
    Lateef, Fahad
    Ruichek, Yassine
    [J]. NEUROCOMPUTING, 2019, 338 : 321 - 348