Semantic segmentation via fusing 2D image and 3D point cloud data with shared multi-layer perceptron

被引：0

作者：

Zhao, Xueqiang ^{[1
,2
]}

Wang, Jiancheng ^{[2
]}

Wu, Zheng ^{[2
]}

Chen, Yangbo ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Geog & Planning, 132 Outer Ring East Rd, Guangzhou 510275, Peoples R China

[2] China Water Resources Pearl River Planning Surveyi, Guangzhou, Peoples R China

来源：

INTERNATIONAL JOURNAL OF REMOTE SENSING | 2025年 / 46卷 / 04期

关键词：

3D point cloud; semantic segmentation; autonomous driving; point cloud classification; knowledge distillation;

D O I：

10.1080/01431161.2024.2439082

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Three-dimensional (3D) semantic segmentation based on point clouds has become a critical technology in the field of intelligent spatial perception and scene understanding. However, most existing methods cannot fully exploit the spatial feature of 3D point cloud data and the orderliness of two-dimensional (2D) image information, resulting in an inability to effectively improve segmentation accuracy. This study proposes a novel 2D image and 3D point cloud fusion method named shared multi-layer perceptron fusion semantic segmentation (SMFusionSeg) for semantic segmentation. The rich features of 2D image segmentation and spatial information of 3D point clouds are integrated by designing a fusion architecture that facilitates the transfer of 2D semantics to 3D semantics. Specifically, the fusion architecture concatenates the features from both 2D and 3D domains, which are then further integrated through a shared multi-layer perceptron (MLP) and optimized using an attention mechanism to enhance feature effectiveness and distinctiveness. Moreover, a knowledge distillation strategy is introduced to improve the learning capability and segmentation accuracy in parsing complex scenes. Experimental results on the SemanticKITTI dataset demonstrate that the proposed SMFusionSeg achieves a mean Intersection over Union ($mIoU$mIoU) of 65.4%, significantly outperforming traditional single-modal methods and many existing segmentation techniques, thereby effectively enhancing the precision and robustness of geographic feature recognition.

引用

页码：1720 / 1741

页数：22

共 39 条

[1] Colistin and its role in the Era of antibiotic resistance: an extended review (2000-2019)
Ahmed, Mohamed Abd El-Gawad El-Sayed
Zhong, Lan-Lan
Shen, Cong
Yang, Yongqiang
Doi, Yohei
Tian, Guo-Bao
[J]. EMERGING MICROBES & INFECTIONS, 2020, 9 (01) : 868 - 885
[2] LiDAR-BIND: Multi-Modal Sensor Fusion Through Shared Latent Embeddings
Balemans, Niels
Anwar, Ali
Steckel, Jan
Mercelis, Siegfried
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9159 - 9166
[3] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
Behley, Jens
Garbade, Martin
Milioto, Andres
Quenzel, Jan
Behnke, Sven
Stachniss, Cyrill
Gall, Juergen
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
[4] Multi-View 3D Object Detection Network for Autonomous Driving
Chen, Xiaozhi
Ma, Huimin
Wan, Ji
Li, Bo
Xia, Tian
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6526 - 6534
[5] DSConv: Efficient Convolution Operator
do Nascimento, Marcelo Gennari
Fawcett, Roger
Prisacariu, Victor Adrian
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5147 - 5156
[6] Large-scale estimation of change in aboveground biomass in miombo woodlands using airborne laser scanning and national forest inventory data
Ene, Liviu Theodor
Naesset, Erik
Gobakken, Terje
Bollandsas, Ole Martin
Mauya, Ernest William
Zahabu, Eliakimu
[J]. REMOTE SENSING OF ENVIRONMENT, 2017, 188 : 106 - 117
[7] Guo YH, 2019, AAAI CONF ARTIF INTE, P8368
[8] FAST SEMANTIC SEGMENTATION OF 3D POINT CLOUDS WITH STRONGLY VARYING DENSITY
Hackel, Timo
Wegner, Jan D.
Schindler, Konrad
[J]. XXIII ISPRS CONGRESS, COMMISSION III, 2016, 3 (03): : 177 - 184
[9] Occlusion Is Underrated: An Occlusion Attention Strategy Assembled in 3-D Object Detectors
He, Yufei
Wu, Yan
Mo, Yujian
Hu, Yinghao
Zhang, Yuwei
Wang, Jijun
[J]. IEEE SENSORS JOURNAL, 2024, 24 (10) : 16502 - 16509
[10] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]

← 1 2 3 4 →