Semantic segmentation via fusing 2D image and 3D point cloud data with shared multi-layer perceptron

被引:0
作者
Zhao, Xueqiang [1 ,2 ]
Wang, Jiancheng [2 ]
Wu, Zheng [2 ]
Chen, Yangbo [1 ]
机构
[1] Sun Yat Sen Univ, Sch Geog & Planning, 132 Outer Ring East Rd, Guangzhou 510275, Peoples R China
[2] China Water Resources Pearl River Planning Surveyi, Guangzhou, Peoples R China
关键词
3D point cloud; semantic segmentation; autonomous driving; point cloud classification; knowledge distillation;
D O I
10.1080/01431161.2024.2439082
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Three-dimensional (3D) semantic segmentation based on point clouds has become a critical technology in the field of intelligent spatial perception and scene understanding. However, most existing methods cannot fully exploit the spatial feature of 3D point cloud data and the orderliness of two-dimensional (2D) image information, resulting in an inability to effectively improve segmentation accuracy. This study proposes a novel 2D image and 3D point cloud fusion method named shared multi-layer perceptron fusion semantic segmentation (SMFusionSeg) for semantic segmentation. The rich features of 2D image segmentation and spatial information of 3D point clouds are integrated by designing a fusion architecture that facilitates the transfer of 2D semantics to 3D semantics. Specifically, the fusion architecture concatenates the features from both 2D and 3D domains, which are then further integrated through a shared multi-layer perceptron (MLP) and optimized using an attention mechanism to enhance feature effectiveness and distinctiveness. Moreover, a knowledge distillation strategy is introduced to improve the learning capability and segmentation accuracy in parsing complex scenes. Experimental results on the SemanticKITTI dataset demonstrate that the proposed SMFusionSeg achieves a mean Intersection over Union ($mIoU$mIoU) of 65.4%, significantly outperforming traditional single-modal methods and many existing segmentation techniques, thereby effectively enhancing the precision and robustness of geographic feature recognition.
引用
收藏
页码:1720 / 1741
页数:22
相关论文
共 39 条
  • [1] Colistin and its role in the Era of antibiotic resistance: an extended review (2000-2019)
    Ahmed, Mohamed Abd El-Gawad El-Sayed
    Zhong, Lan-Lan
    Shen, Cong
    Yang, Yongqiang
    Doi, Yohei
    Tian, Guo-Bao
    [J]. EMERGING MICROBES & INFECTIONS, 2020, 9 (01) : 868 - 885
  • [2] LiDAR-BIND: Multi-Modal Sensor Fusion Through Shared Latent Embeddings
    Balemans, Niels
    Anwar, Ali
    Steckel, Jan
    Mercelis, Siegfried
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9159 - 9166
  • [3] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
    Behley, Jens
    Garbade, Martin
    Milioto, Andres
    Quenzel, Jan
    Behnke, Sven
    Stachniss, Cyrill
    Gall, Juergen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
  • [4] Multi-View 3D Object Detection Network for Autonomous Driving
    Chen, Xiaozhi
    Ma, Huimin
    Wan, Ji
    Li, Bo
    Xia, Tian
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6526 - 6534
  • [5] DSConv: Efficient Convolution Operator
    do Nascimento, Marcelo Gennari
    Fawcett, Roger
    Prisacariu, Victor Adrian
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5147 - 5156
  • [6] Large-scale estimation of change in aboveground biomass in miombo woodlands using airborne laser scanning and national forest inventory data
    Ene, Liviu Theodor
    Naesset, Erik
    Gobakken, Terje
    Bollandsas, Ole Martin
    Mauya, Ernest William
    Zahabu, Eliakimu
    [J]. REMOTE SENSING OF ENVIRONMENT, 2017, 188 : 106 - 117
  • [7] Guo YH, 2019, AAAI CONF ARTIF INTE, P8368
  • [8] FAST SEMANTIC SEGMENTATION OF 3D POINT CLOUDS WITH STRONGLY VARYING DENSITY
    Hackel, Timo
    Wegner, Jan D.
    Schindler, Konrad
    [J]. XXIII ISPRS CONGRESS, COMMISSION III, 2016, 3 (03): : 177 - 184
  • [9] Occlusion Is Underrated: An Occlusion Attention Strategy Assembled in 3-D Object Detectors
    He, Yufei
    Wu, Yan
    Mo, Yujian
    Hu, Yinghao
    Zhang, Yuwei
    Wang, Jijun
    [J]. IEEE SENSORS JOURNAL, 2024, 24 (10) : 16502 - 16509
  • [10] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]