Multiview Fusion Driven 3-D Point Cloud Semantic Segmentation Based on Hierarchical Transformer

被引:8
|
作者
Xu, Wang [1 ]
Li, Xu [1 ]
Ni, Peizhou [1 ]
Guang, Xingxing [2 ,3 ]
Luo, Hang [2 ,3 ]
Zhao, Xijun [2 ,3 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Peoples R China
[2] China North Artificial Intelligence & Innovat Res, Beijing 100072, Peoples R China
[3] Collective Intelligence & Collaborat Lab CIC, Beijing 100072, Peoples R China
关键词
3-D point cloud; multihead attention; multiview fusion; semantic segmentation;
D O I
10.1109/JSEN.2023.3328603
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Three-dimensional semantic segmentation is a key task of environment understanding in various outdoor scenes. Due to the sparsity and varying density of point clouds, it becomes challenging to obtain fine-gained segmentation results. Previous point-based and voxel-based methods suffer from the expensive computational cost. Recent 2-D projection-based methods, including range-view (RV), bird-eye-view (BEV), and multiview fusion methods, can run in real time, but the information loss during the projection leads to the low accuracy. Also, we find that the occlusion and interlacing problems exist in single projection-based methods and most multiview fusion networks only focus on the output-level fusion. Considering the above issues, we propose a multilevel multiview fusion network using attention modules and hierarchical transformer, which ensures the effectiveness and efficiency mainly by the following three aspects: 1) the spatial-channel attention module (SCAM) integrates contextual information between points and learn differences of each channel's features; 2) the proposed geometry-based multiprojection fusion module (GMFM) achieves the geometric feature alignment between RV and BEV and fuses the features of the two views at both feature level and output level; and 3) we introduce KPConv to replace KNN, which can reduce the information loss during the postprocessing. Experiments are conducted on both structured and unstructured datasets, including urban dataset SemanticKITTI and off-road dataset Rellis3D. Our results achieve a better performance compared to other projection-based methods and are comparable with the state-of-the-art Cylinder3D.
引用
收藏
页码:31461 / 31470
页数:10
相关论文
共 50 条
  • [31] A Random Fusion of Mix3D and PolarMix to Improve Semantic Segmentation Performance in 3D Lidar Point Cloud
    Liu, Bo
    Feng, Li
    Chen, Yufeng
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 140 (01): : 845 - 862
  • [32] Semantic Context Encoding for Accurate 3D Point Cloud Segmentation
    Liu, Hao
    Guo, Yulan
    Ma, Yanni
    Lei, Yinjie
    Wen, Gongjian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2045 - 2055
  • [33] 3D point cloud semantic segmentation: state of the art and challenges
    Wang Y.
    Hu Y.
    Kong Q.
    Zeng H.
    Zhang L.
    Fan B.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2023, 45 (10): : 1653 - 1664
  • [34] Subdivision of Adjacent Areas for 3D Point Cloud Semantic Segmentation
    Xu, Haixia
    Hu, Kaiyu
    Xu, Yuting
    Zhu, Jiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [35] Regional-to-Local Point-Voxel Transformer for Large-Scale Indoor 3D Point Cloud Semantic Segmentation
    Li, Shuai
    Li, Hongjun
    REMOTE SENSING, 2023, 15 (19)
  • [36] MATNet: Semantic segmentation of 3D point clouds with multiscale adaptive transformer
    Zheng, Yufei
    Lu, Jian
    Chen, Xiaogai
    Zhang, Kaibing
    Zhou, Jian
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 119
  • [37] Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
    Imad, Muhammad
    Doukhi, Oualid
    Lee, Deok-Jin
    SENSORS, 2021, 21 (12)
  • [38] Fast Semantic Segmentation of 3D Lidar Point Cloud Based on Random Forest Method
    Jiang, Songdi
    Guo, Wei
    Fan, Yuzhi
    Fu, Haiyang
    CHINA SATELLITE NAVIGATION CONFERENCE PROCEEDINGS, CSNC 2022, VOL II, 2022, 909 : 415 - 424
  • [39] Semantic segmentation of 3D point cloud based on boundary point estimation and sparse convolution neural network
    Yang J.
    Zhang C.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (06): : 1121 - 1132
  • [40] FusionFormer: An Off-Road Sence Semantic Segmentation Network Based on Data Fusion and Hierarchical Transformer
    Duan, AnZhi
    Ma, Yue
    Wang, YunFeng
    PROCEEDINGS OF 2024 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL 3, CISC 2024, 2024, 1285 : 75 - 83