DFAMNet: dual fusion attention multi-modal network for semantic segmentation on LiDAR point clouds

被引：0

作者：

Mingjie Li

Gaihua Wang

Minghao Zhu

Chunzheng Li

Hong Liu

Xuran Pan

Qian Long

机构：

[1] Hubei University of Technology,School of Electrical and Elctronic Engineering

[2] Tianjin University of Science and Technology,College of Artificial Intelligence

来源：

Applied Intelligence | 2024年 / 54卷

关键词：

Semantic segmentation; Multi-modal; Pseudo point cloud; Point cloud;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Semantic segmentation of outdoor point clouds is an important task in the field of computer vision, aiming to classify outdoor point cloud data into different semantic categories. The methods based on pure point cloud have some shortcomings, such as incomplete information and difficulty in processing incomplete data. In the paper, it proposes pseudo point cloud method to align image with point cloud. The image features are extracted through a 2D network, and then the point cloud is mapped onto the image to obtain the corresponding pixel features, forming the pseudo point cloud. Then the dual fusion attention mechanism is designed to fuse the features of point cloud and pseudo point cloud. It improves the efficiency of the fusion network. The experimental results show that this method outperforms existing methods on the large-scale SemanticKITTI benchmark and achieves third place performance on the NuScenes benchmark. Code is available at https://github.com/Pdsn5/DFAMNet.

引用

页码：3169 / 3180

页数：11

共 50 条

[1] DFAMNet: dual fusion attention multi-modal network for semantic segmentation on LiDAR point clouds
Li, Mingjie
Wang, Gaihua
Zhu, Minghao
Li, Chunzheng
Liu, Hong
Pan, Xuran
Long, Qian
APPLIED INTELLIGENCE, 2024, 54 (04) : 3169 - 3180
[2] Dual fusion network for semantic segmentation of point clouds *
Lu, Jian
Guo, Huihui
Jia, Xurui
Wu, Jiatong
Chen, Xiaogai
OPTICS AND LASERS IN ENGINEERING, 2024, 177
[3] Application of Multi-modal Fusion Attention Mechanism in Semantic Segmentation
Liu, Yunlong
Yoshie, Osamu
Watanabe, Hiroshi
COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 378 - 397
[4] Dual-Attention Deep Fusion Network for Multi-modal Medical Image Segmentation
Zheng, Shenhai
Ye, Xin
Tan, Jiaxin
Yang, Yifei
Li, Laquan
FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
[5] TAG-fusion: Two-stage attention guided multi-modal fusion network for semantic segmentation
Zhang, Zhizhou
Wang, Wenwu
Zhu, Lei
Tang, Zhibin
DIGITAL SIGNAL PROCESSING, 2025, 156
[6] Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation
Kim, Kyungmin
SENSORS, 2024, 24 (23)
[7] EISNet: A Multi-Modal Fusion Network for Semantic Segmentation With Events and Images
Xie, Bochen
Deng, Yongjian
Shao, Zhanpeng
Li, Youfu
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8639 - 8650
[8] A Tri-Attention fusion guided multi-modal segmentation network
Zhou, Tongxue
Ruan, Su
Vera, Pierre
Canu, Stephane
PATTERN RECOGNITION, 2022, 124
[9] Imbalance knowledge-driven multi-modal network for land-cover semantic segmentation using aerial images and LiDAR point clouds
Wang, Yameng
Wan, Yi
Zhang, Yongjun
Zhang, Bin
Gao, Zhi
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 202 : 385 - 404
[10] Local Fusion Attention Network for Semantic Segmentation of Building Facade Point Clouds
Su, Yanfei
Liu, Weiquan
Cheng, Ming
Yuan, Zhimin
Wang, Cheng
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19

← 1 2 3 4 5 →