DFAMNet: dual fusion attention multi-modal network for semantic segmentation on LiDAR point clouds

被引:0
|
作者
Mingjie Li
Gaihua Wang
Minghao Zhu
Chunzheng Li
Hong Liu
Xuran Pan
Qian Long
机构
[1] Hubei University of Technology,School of Electrical and Elctronic Engineering
[2] Tianjin University of Science and Technology,College of Artificial Intelligence
来源
Applied Intelligence | 2024年 / 54卷
关键词
Semantic segmentation; Multi-modal; Pseudo point cloud; Point cloud;
D O I
暂无
中图分类号
学科分类号
摘要
Semantic segmentation of outdoor point clouds is an important task in the field of computer vision, aiming to classify outdoor point cloud data into different semantic categories. The methods based on pure point cloud have some shortcomings, such as incomplete information and difficulty in processing incomplete data. In the paper, it proposes pseudo point cloud method to align image with point cloud. The image features are extracted through a 2D network, and then the point cloud is mapped onto the image to obtain the corresponding pixel features, forming the pseudo point cloud. Then the dual fusion attention mechanism is designed to fuse the features of point cloud and pseudo point cloud. It improves the efficiency of the fusion network. The experimental results show that this method outperforms existing methods on the large-scale SemanticKITTI benchmark and achieves third place performance on the NuScenes benchmark. Code is available at https://github.com/Pdsn5/DFAMNet.
引用
收藏
页码:3169 / 3180
页数:11
相关论文
共 50 条
  • [1] DFAMNet: dual fusion attention multi-modal network for semantic segmentation on LiDAR point clouds
    Li, Mingjie
    Wang, Gaihua
    Zhu, Minghao
    Li, Chunzheng
    Liu, Hong
    Pan, Xuran
    Long, Qian
    APPLIED INTELLIGENCE, 2024, 54 (04) : 3169 - 3180
  • [2] Dual fusion network for semantic segmentation of point clouds *
    Lu, Jian
    Guo, Huihui
    Jia, Xurui
    Wu, Jiatong
    Chen, Xiaogai
    OPTICS AND LASERS IN ENGINEERING, 2024, 177
  • [3] Application of Multi-modal Fusion Attention Mechanism in Semantic Segmentation
    Liu, Yunlong
    Yoshie, Osamu
    Watanabe, Hiroshi
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 378 - 397
  • [4] Dual-Attention Deep Fusion Network for Multi-modal Medical Image Segmentation
    Zheng, Shenhai
    Ye, Xin
    Tan, Jiaxin
    Yang, Yifei
    Li, Laquan
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [5] TAG-fusion: Two-stage attention guided multi-modal fusion network for semantic segmentation
    Zhang, Zhizhou
    Wang, Wenwu
    Zhu, Lei
    Tang, Zhibin
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [6] Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation
    Kim, Kyungmin
    SENSORS, 2024, 24 (23)
  • [7] EISNet: A Multi-Modal Fusion Network for Semantic Segmentation With Events and Images
    Xie, Bochen
    Deng, Yongjian
    Shao, Zhanpeng
    Li, Youfu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8639 - 8650
  • [8] A Tri-Attention fusion guided multi-modal segmentation network
    Zhou, Tongxue
    Ruan, Su
    Vera, Pierre
    Canu, Stephane
    PATTERN RECOGNITION, 2022, 124
  • [9] Imbalance knowledge-driven multi-modal network for land-cover semantic segmentation using aerial images and LiDAR point clouds
    Wang, Yameng
    Wan, Yi
    Zhang, Yongjun
    Zhang, Bin
    Gao, Zhi
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 202 : 385 - 404
  • [10] Local Fusion Attention Network for Semantic Segmentation of Building Facade Point Clouds
    Su, Yanfei
    Liu, Weiquan
    Cheng, Ming
    Yuan, Zhimin
    Wang, Cheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19