Multiview Fusion Driven 3-D Point Cloud Semantic Segmentation Based on Hierarchical Transformer

被引:8
|
作者
Xu, Wang [1 ]
Li, Xu [1 ]
Ni, Peizhou [1 ]
Guang, Xingxing [2 ,3 ]
Luo, Hang [2 ,3 ]
Zhao, Xijun [2 ,3 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Peoples R China
[2] China North Artificial Intelligence & Innovat Res, Beijing 100072, Peoples R China
[3] Collective Intelligence & Collaborat Lab CIC, Beijing 100072, Peoples R China
关键词
3-D point cloud; multihead attention; multiview fusion; semantic segmentation;
D O I
10.1109/JSEN.2023.3328603
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Three-dimensional semantic segmentation is a key task of environment understanding in various outdoor scenes. Due to the sparsity and varying density of point clouds, it becomes challenging to obtain fine-gained segmentation results. Previous point-based and voxel-based methods suffer from the expensive computational cost. Recent 2-D projection-based methods, including range-view (RV), bird-eye-view (BEV), and multiview fusion methods, can run in real time, but the information loss during the projection leads to the low accuracy. Also, we find that the occlusion and interlacing problems exist in single projection-based methods and most multiview fusion networks only focus on the output-level fusion. Considering the above issues, we propose a multilevel multiview fusion network using attention modules and hierarchical transformer, which ensures the effectiveness and efficiency mainly by the following three aspects: 1) the spatial-channel attention module (SCAM) integrates contextual information between points and learn differences of each channel's features; 2) the proposed geometry-based multiprojection fusion module (GMFM) achieves the geometric feature alignment between RV and BEV and fuses the features of the two views at both feature level and output level; and 3) we introduce KPConv to replace KNN, which can reduce the information loss during the postprocessing. Experiments are conducted on both structured and unstructured datasets, including urban dataset SemanticKITTI and off-road dataset Rellis3D. Our results achieve a better performance compared to other projection-based methods and are comparable with the state-of-the-art Cylinder3D.
引用
收藏
页码:31461 / 31470
页数:10
相关论文
共 50 条
  • [41] Tinto: Multisensor Benchmark for 3-D Hyperspectral Point Cloud Segmentation in the Geosciences
    Afifi, Ahmed J.
    Thiele, Samuel T.
    Rizaldy, Aldino
    Lorenz, Sandra
    Ghamisi, Pedram
    Tolosana-Delgado, Raimon
    Kirsch, Moritz
    Gloaguen, Richard
    Heizmann, Michael
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [42] DLAFNET: A DIRECT FUSION METHOD OF 2D AERIAL IMAGE AND 3D LIDAR POINT CLOUD FOR SEMANTIC SEGMENTATION
    Liu, Wei
    Wang, He
    Qiao, Yicheng
    Liang, Bin
    Yang, Junli
    Zhang, Haopeng
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5922 - 5925
  • [43] Hierarchical SVM for Semantic Segmentation of 3D Point Clouds for Infrastructure Scenes
    Mansour, Mohamed
    Martens, Jan
    Blankenbach, Joerg
    INFRASTRUCTURES, 2024, 9 (05)
  • [44] Win-Former: Window-Based Transformer for Maize Plant Point Cloud Semantic Segmentation
    Sun, Yu
    Guo, Xindong
    Yang, Hua
    AGRONOMY-BASEL, 2023, 13 (11):
  • [45] Crossmodal Few-shot 3D Point Cloud Semantic Segmentation
    Zhao, Ziyu
    Wu, Zhenyao
    Wu, Xinyi
    Zhang, Canyu
    Wang, Song
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4760 - 4768
  • [46] Annotation Tool and Urban Dataset for 3D Point Cloud Semantic Segmentation
    Ibrahim, Muhammad
    Akhtar, Naveed
    Wise, Michael
    Mian, Ajmal
    IEEE ACCESS, 2021, 9 : 35984 - 35996
  • [47] 3d indoor point cloud semantic segmentation using image and voxel
    Yeom S.-S.
    Ha J.-E.
    Ha, Jong-Eun (jeha@seoultech.ac.kr), 1600, Institute of Control, Robotics and Systems (27): : 1000 - 1007
  • [48] SHREC 2020: 3D point cloud semantic segmentation for street scenes
    Ku, Tao
    Veltkamp, Remco C.
    Boom, Bas
    Duque-Arias, David
    Velasco-Forero, Santiago
    Deschaud, Jean-Emmanuel
    Goulette, Francois
    Marcotegui, Beatriz
    Ortega, Sebastian
    Trujillo, Agustin
    Pablo Suarez, Jose
    Miguel Santana, Jose
    Ramirez, Cristian
    Akadas, Kiran
    Gangisetty, Shankar
    COMPUTERS & GRAPHICS-UK, 2020, 93 : 13 - 24
  • [49] A LiDAR point cloud hierarchical semantic segmentation method combining CNN and MRF
    Jiang T.
    Wang Y.
    Zhang L.
    Liang C.
    Sun J.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2021, 50 (02): : 215 - 225
  • [50] Transformer based 3D semantic segmentation of urban bicycle infrastructure
    Niedermueller, Armin
    Beeking, Moritz
    JOURNAL OF LOCATION BASED SERVICES, 2024,