Graph Transformer for 3D point clouds classification and semantic segmentation

被引：4

作者：

Zhou, Wei ^{[1
]}

Wang, Qian ^{[1
]}

Jin, Weiwei ^{[1
]}

Shi, Xinzhe ^{[1
]}

He, Ying ^{[2
]}

机构：

[1] Northwest Univ, Sch Informat Sci & Technol, Xian 710127, Peoples R China

[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

来源：

COMPUTERS & GRAPHICS-UK | 2024年 / 124卷

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Point cloud; Graph transformer; Shape classification; Semantic segmentation; Deep learning; NETWORK; CONVOLUTION;

D O I：

10.1016/j.cag.2024.104050

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Recently, graph-based and Transformer-based deep learning have demonstrated excellent performances on various point cloud tasks. Most of the existing graph-based methods rely on static graph, which take a fixed input to establish graph relations. Moreover, many graph-based methods apply maximizing and averaging to aggregate neighboring features, so that only a single neighboring point affects the feature of centroid or different neighboring points own the same influence on the centroid's feature, which ignoring the correlation and difference between points. Most Transformer-based approaches extract point cloud features based on global attention and lack the feature learning on local neighbors. To solve the above issues of graph-based and Transformer-based models, we propose anew feature extraction block named Graph Transformer and construct a 3D point cloud learning network called GTNet to learn features of point clouds on local and global patterns. Graph Transformer integrates the advantages of graph-based and Transformer-based methods, and consists of Local Transformer that use intra-domain cross-attention and Global Transformer that use global self-attention. Finally, we use GTNet for shape classification, part segmentation and semantic segmentation tasks in this paper. The experimental results show that our model achieves good learning and prediction ability on most tasks. The source code and pre-trained model of GTNet will be released on https://github.com/NWUzhouwei/GTNet.

引用

页数：10

共 50 条

[1] MATNet: Semantic segmentation of 3D point clouds with multiscale adaptive transformer
Zheng, Yufei
Lu, Jian
Chen, Xiaogai
Zhang, Kaibing
Zhou, Jian
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 119
[2] SEGCloud: Semantic Segmentation of 3D Point Clouds
Tchapmi, Lyne P.
Choy, Christopher B.
Armeni, Iro
Gwak, JunYoung
Savarese, Silvio
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 537 - 547
[3] Augmented Edge Graph Convolutional Networks for Semantic Segmentation of 3D Point Clouds
Zhang Lujian
Bi Yuanwei
Liu Yaowen
Huang Yansen
LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (08)
[4] DPRNet: Deep 3D Point Based Residual Network for Semantic Segmentation and Classification of 3D Point Clouds
Arshad, Saira
Shahzad, Muhammad
Riaz, Qaiser
Fraz, Muhammad Moazam
IEEE ACCESS, 2019, 7 : 68892 - 68904
[5] U-shaped network based on Transformer for 3D point clouds semantic segmentation
Zhang, Jiazhe
Li, Xingwei
Zhao, Xianfa
Ge, Yizhi
Zhang, Zheng
2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 170 - 176
[6] MeT: A graph transformer for semantic segmentation of 3D meshes
Vecchio, Giuseppe
Prezzavento, Luca
Pino, Carmelo
Rundo, Francesco
Palazzo, Simone
Spampinato, Concetto
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
[7] MeT: A Graph Transformer for Semantic Segmentation of 3D Meshes
Department of Computer Engineering, University of Catania, Italy
不详
arXiv, 1600,
[8] Point attention network for semantic segmentation of 3D point clouds
Feng, Mingtao
Zhang, Liang
Lin, Xuefei
Gilani, Syed Zulqarnain
Mian, Ajmal
PATTERN RECOGNITION, 2020, 107 (107)
[9] Hierarchical Depthwise Graph Convolutional Neural Network for 3D Semantic Segmentation of Point Clouds
Liang, Zhidong
Yang, Ming
Deng, Liuyuan
Wang, Chunxiang
Wang, Bing
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8152 - 8158
[10] GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds
Zhang, Zihui
Yang, Bo
Wang, Bing
Li, Bo
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17619 - 17629

← 1 2 3 4 5 →