SGT-Net: A Transformer-Based Stratified Graph Convolutional Network for 3D Point Cloud Semantic Segmentation

被引:2
作者
Liu, Suyi [1 ]
Chi, Jianning [1 ]
Wu, Chengdong [1 ]
Xu, Fang [2 ,3 ,4 ]
Yu, Xiaosheng [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110167, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[3] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
[4] SIASUN Robot & Automat Co Ltd, Shenyang 110169, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 03期
基金
中国国家自然科学基金;
关键词
3D point cloud; semantic segmentation; long-range contexts; global-local feature; graph convolutional network; dense-sparse sampling strategy;
D O I
10.32604/cmc.2024.049450
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, semantic segmentation on 3D point cloud data has attracted much attention. Unlike 2D images where pixels distribute regularly in the image domain, 3D point clouds in non-Euclidean space are irregular and inherently sparse. Therefore, it is very difficult to extract long-range contexts and effectively aggregate local features for semantic segmentation in 3D point cloud space. Most current methods either focus on local feature aggregation or long-range context dependency, but fail to directly establish a global-local feature extractor to complete the point cloud semantic segmentation tasks. In this paper, we propose a Transformer-based stratified graph convolutional network (SGT-Net), which enlarges the effective receptive field and builds direct long-range dependency. Specifically, we first propose a novel dense-sparse sampling strategy that provides dense local vertices and sparse long-distance vertices for subsequent graph convolutional network (GCN). Secondly, we propose a multi-key self-attention mechanism based on the Transformer to further weight augmentation for crucial neighboring relationships and enlarge the effective receptive field. In addition, to further improve the efficiency of the network, we propose a similarity measurement module to determine whether the neighborhood near the center point is effective. We demonstrate the validity and superiority of our method on the S3DIS and ShapeNet datasets. Through ablation experiments and segmentation visualization, we verify that the SGT model can improve the performance of the point cloud semantic segmentation.
引用
收藏
页码:4471 / 4489
页数:19
相关论文
共 47 条
[1]   3D Semantic Parsing of Large-Scale Indoor Spaces [J].
Armeni, Iro ;
Sener, Ozan ;
Zamir, Amir R. ;
Jiang, Helen ;
Brilakis, Ioannis ;
Fischer, Martin ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   GAPointNet: Graph attention based point neural network for exploiting local feature of point cloud [J].
Chen, Can ;
Fragonara, Luca Zanotti ;
Tsourdos, Antonios .
NEUROCOMPUTING, 2021, 438 :122-132
[4]   JS']JSPNet: Learning joint semantic & instance segmentation of point clouds via feature self-similarity and cross-task probability [J].
Chen, Feng ;
Wu, Fei ;
Gao, Guangwei ;
Ji, Yimu ;
Xu, Jing ;
Jiang, Guo-Ping ;
Jing, Xiao-Yuan .
PATTERN RECOGNITION, 2022, 122
[5]   Pre-Trained Image Processing Transformer [J].
Chen, Hanting ;
Wang, Yunhe ;
Guo, Tianyu ;
Xu, Chang ;
Deng, Yiping ;
Liu, Zhenhua ;
Ma, Siwei ;
Xu, Chunjing ;
Xu, Chao ;
Gao, Wen .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305
[6]  
Chen J., 2023, IEEE Geosci. Remote Sens. Lett., V20, P1, DOI [10.1109/LGRS.2023.3327763, DOI 10.1109/LGRS.2023.3327763]
[7]   GAITPOINT: A GAIT RECOGNITION NETWORK BASED ON POINT CLOUD ANALYSIS [J].
Chen, Jiajing ;
Ren, Huantao ;
Chen, Frank ;
Velipasalar, Senem ;
Phoha, Vir V. .
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, :1916-1920
[8]  
Dosovitskiy A., 2019, INT C LEARN REPR NEW, P548
[9]   PCT: Point cloud transformer [J].
Guo, Meng-Hao ;
Cai, Jun-Xiong ;
Liu, Zheng-Ning ;
Mu, Tai-Jiang ;
Martin, Ralph R. ;
Hu, Shi-Min .
COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) :187-199
[10]   Structure-Aware Graph Convolution Network for Point Cloud Parsing [J].
Hao, Fengda ;
Li, Jiaojiao ;
Song, Rui ;
Li, Yunsong ;
Cao, Kailang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :7025-7036