Graph Transformer for 3D point clouds classification and semantic segmentation

被引:4
|
作者
Zhou, Wei [1 ]
Wang, Qian [1 ]
Jin, Weiwei [1 ]
Shi, Xinzhe [1 ]
He, Ying [2 ]
机构
[1] Northwest Univ, Sch Informat Sci & Technol, Xian 710127, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
来源
COMPUTERS & GRAPHICS-UK | 2024年 / 124卷
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Point cloud; Graph transformer; Shape classification; Semantic segmentation; Deep learning; NETWORK; CONVOLUTION;
D O I
10.1016/j.cag.2024.104050
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, graph-based and Transformer-based deep learning have demonstrated excellent performances on various point cloud tasks. Most of the existing graph-based methods rely on static graph, which take a fixed input to establish graph relations. Moreover, many graph-based methods apply maximizing and averaging to aggregate neighboring features, so that only a single neighboring point affects the feature of centroid or different neighboring points own the same influence on the centroid's feature, which ignoring the correlation and difference between points. Most Transformer-based approaches extract point cloud features based on global attention and lack the feature learning on local neighbors. To solve the above issues of graph-based and Transformer-based models, we propose anew feature extraction block named Graph Transformer and construct a 3D point cloud learning network called GTNet to learn features of point clouds on local and global patterns. Graph Transformer integrates the advantages of graph-based and Transformer-based methods, and consists of Local Transformer that use intra-domain cross-attention and Global Transformer that use global self-attention. Finally, we use GTNet for shape classification, part segmentation and semantic segmentation tasks in this paper. The experimental results show that our model achieves good learning and prediction ability on most tasks. The source code and pre-trained model of GTNet will be released on https://github.com/NWUzhouwei/GTNet.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] MATNet: Semantic segmentation of 3D point clouds with multiscale adaptive transformer
    Zheng, Yufei
    Lu, Jian
    Chen, Xiaogai
    Zhang, Kaibing
    Zhou, Jian
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 119
  • [2] SEGCloud: Semantic Segmentation of 3D Point Clouds
    Tchapmi, Lyne P.
    Choy, Christopher B.
    Armeni, Iro
    Gwak, JunYoung
    Savarese, Silvio
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 537 - 547
  • [3] Augmented Edge Graph Convolutional Networks for Semantic Segmentation of 3D Point Clouds
    Zhang Lujian
    Bi Yuanwei
    Liu Yaowen
    Huang Yansen
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (08)
  • [4] DPRNet: Deep 3D Point Based Residual Network for Semantic Segmentation and Classification of 3D Point Clouds
    Arshad, Saira
    Shahzad, Muhammad
    Riaz, Qaiser
    Fraz, Muhammad Moazam
    IEEE ACCESS, 2019, 7 : 68892 - 68904
  • [5] U-shaped network based on Transformer for 3D point clouds semantic segmentation
    Zhang, Jiazhe
    Li, Xingwei
    Zhao, Xianfa
    Ge, Yizhi
    Zhang, Zheng
    2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 170 - 176
  • [6] MeT: A graph transformer for semantic segmentation of 3D meshes
    Vecchio, Giuseppe
    Prezzavento, Luca
    Pino, Carmelo
    Rundo, Francesco
    Palazzo, Simone
    Spampinato, Concetto
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
  • [7] MeT: A Graph Transformer for Semantic Segmentation of 3D Meshes
    Department of Computer Engineering, University of Catania, Italy
    不详
    arXiv, 1600,
  • [8] Point attention network for semantic segmentation of 3D point clouds
    Feng, Mingtao
    Zhang, Liang
    Lin, Xuefei
    Gilani, Syed Zulqarnain
    Mian, Ajmal
    PATTERN RECOGNITION, 2020, 107 (107)
  • [9] Hierarchical Depthwise Graph Convolutional Neural Network for 3D Semantic Segmentation of Point Clouds
    Liang, Zhidong
    Yang, Ming
    Deng, Liuyuan
    Wang, Chunxiang
    Wang, Bing
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8152 - 8158
  • [10] GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds
    Zhang, Zihui
    Yang, Bo
    Wang, Bing
    Li, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17619 - 17629