Multiview Fusion Driven 3-D Point Cloud Semantic Segmentation Based on Hierarchical Transformer

被引:8
|
作者
Xu, Wang [1 ]
Li, Xu [1 ]
Ni, Peizhou [1 ]
Guang, Xingxing [2 ,3 ]
Luo, Hang [2 ,3 ]
Zhao, Xijun [2 ,3 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Peoples R China
[2] China North Artificial Intelligence & Innovat Res, Beijing 100072, Peoples R China
[3] Collective Intelligence & Collaborat Lab CIC, Beijing 100072, Peoples R China
关键词
3-D point cloud; multihead attention; multiview fusion; semantic segmentation;
D O I
10.1109/JSEN.2023.3328603
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Three-dimensional semantic segmentation is a key task of environment understanding in various outdoor scenes. Due to the sparsity and varying density of point clouds, it becomes challenging to obtain fine-gained segmentation results. Previous point-based and voxel-based methods suffer from the expensive computational cost. Recent 2-D projection-based methods, including range-view (RV), bird-eye-view (BEV), and multiview fusion methods, can run in real time, but the information loss during the projection leads to the low accuracy. Also, we find that the occlusion and interlacing problems exist in single projection-based methods and most multiview fusion networks only focus on the output-level fusion. Considering the above issues, we propose a multilevel multiview fusion network using attention modules and hierarchical transformer, which ensures the effectiveness and efficiency mainly by the following three aspects: 1) the spatial-channel attention module (SCAM) integrates contextual information between points and learn differences of each channel's features; 2) the proposed geometry-based multiprojection fusion module (GMFM) achieves the geometric feature alignment between RV and BEV and fuses the features of the two views at both feature level and output level; and 3) we introduce KPConv to replace KNN, which can reduce the information loss during the postprocessing. Experiments are conducted on both structured and unstructured datasets, including urban dataset SemanticKITTI and off-road dataset Rellis3D. Our results achieve a better performance compared to other projection-based methods and are comparable with the state-of-the-art Cylinder3D.
引用
收藏
页码:31461 / 31470
页数:10
相关论文
共 50 条
  • [21] Graph Transformer for 3D point clouds classification and semantic segmentation
    Zhou, Wei
    Wang, Qian
    Jin, Weiwei
    Shi, Xinzhe
    He, Ying
    COMPUTERS & GRAPHICS-UK, 2024, 124
  • [22] Point cloud semantic segmentation with adaptive spatial structure graph transformer
    Han, Ting
    Chen, Yiping
    Ma, Jin
    Liu, Xiaoxue
    Zhang, Wuming
    Zhang, Xinchang
    Wang, Huajuan
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 133
  • [23] Semantic segmentation feature fusion network based on transformer
    Li, Tianping
    Cui, Zhaotong
    Zhang, Hua
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [24] TGNet: Geometric Graph CNN on 3-D Point Cloud Segmentation
    Li, Ying
    Ma, Lingfei
    Zhong, Zilong
    Cao, Dongpu
    Li, Jonathan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (05): : 3588 - 3600
  • [25] PReFormer: A memory-efficient transformer for point cloud semantic segmentation
    Akwensi, Perpetual Hope
    Wang, Ruisheng
    Guo, Bo
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 128
  • [26] Transformer fusion for indoor RGB-D semantic segmentation
    Wu, Zongwei
    Zhou, Zhuyun
    Allibert, Guillaume
    Stolz, Christophe
    Demonceaux, Cedric
    Ma, Chao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [27] U-shaped network based on Transformer for 3D point clouds semantic segmentation
    Zhang, Jiazhe
    Li, Xingwei
    Zhao, Xianfa
    Ge, Yizhi
    Zhang, Zheng
    2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 170 - 176
  • [28] Fuzzy Neighborhood Learning for Deep 3-D Segmentation of Point Cloud
    Zhong, Mingyang
    Li, Chaojie
    Liu, Liangchen
    Wen, Jiahui
    Ma, Jingwei
    Yu, Xinghuo
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (12) : 3181 - 3192
  • [29] Semantic segmentation of 3D point cloud based on self-attention feature fusion group convolutional neural network
    Yang J.
    Li B.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (07): : 840 - 853
  • [30] Point Cloud Semantic Segmentation Network Based on Multi-Scale Feature Fusion
    Du, Jing
    Jiang, Zuning
    Huang, Shangfeng
    Wang, Zongyue
    Su, Jinhe
    Su, Songjian
    Wu, Yundong
    Cai, Guorong
    SENSORS, 2021, 21 (05) : 1 - 20