MSGFusion: Muti-scale Semantic Guided LiDAR-Camera Fusion for 3D Object Detection

被引:0
|
作者
Zhu, Huming [1 ]
Xue, Yiyu [1 ]
Cheng, Xinyue [1 ]
Hou, Biao [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China
来源
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年
关键词
3D object detection; multi-modal fusion; multiscale; semantic segmentation;
D O I
10.1109/IJCNN60899.2024.10651407
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection is a key technology in automatic driving perception, which can provide the basis for safe and reliable autonomous driving. Aiming at the problem of false positive of low resolution object in point clouds, we present Multi-scale Semantic Guided LiDAR-Camera Fusion for 3D Object Detection(MSGFusion), which deeply fuses the features of image and LiDAR points. Specifically, we design multi-scale DenseFusion, which serially aggregate images features, pointwise features and voxel-wise feature volumes at different scales. At the same time, we design a new Image-based Predicted Keypoint Weighting(I-PKW). It predicts the object points based on the predicted foreground score map. Given the 3D proposals generated by the voxel CNN, we propose RoI-Pillar pooling. It abstracts the feature by aggregating the keypoints in the RoI by pillars. Compared with RoI-grid pooling, pillar-based feature encoding is more consistent with the distribution of fused feature keypoints to accurately regress the classification confidence and bounding box. Extensive experiments on the KITTI dataset show the superiority of MSGFusion.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] LiDAR-camera fusion: Dual transformer enhancement for 3D object detection
    Chen, Mu
    Liu, Pengfei
    Zhao, Huaici
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [2] Rethinking the Late Fusion of LiDAR-Camera Based 3D Object Detection
    Yu, Lehang
    Zhang, Jing
    Liu, Zhong
    Yue, Haosong
    Chen, Weihai
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,
  • [3] URFormer: Unified Representation LiDAR-Camera 3D Object Detection with Transformer
    Zhang, Guoxin
    Xie, Jun
    Liu, Lin
    Wang, Zhepeng
    Yang, Kuihe
    Song, Ziying
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 401 - 413
  • [4] Robust LiDAR-Camera 3-D Object Detection With Object-Level Feature Fusion
    Chen, Yongxiang
    Yan, Fuwu
    Yin, Zhishuai
    Nie, Linzhen
    Tao, Bo
    Miao, Mingze
    Zheng, Ningyu
    Zhang, Pei
    Zeng, Junyuan
    IEEE SENSORS JOURNAL, 2024, 24 (18) : 29108 - 29120
  • [5] CLFusion:3D Semantic Segmentation Based on Camera and Lidar Fusion
    Wang, Tianyue
    Song, Rujun
    Xiao, Zhuoling
    Yan, Bo
    Qin, Haojie
    He, Di
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [6] Building and optimization of 3D semantic map based on Lidar and camera fusion
    Li, Jing
    Zhang, Xin
    Li, Jiehao
    Liu, Yanyu
    Wang, Junzheng
    NEUROCOMPUTING, 2020, 409 : 394 - 407
  • [7] LXL: LiDAR Excluded Lean 3D Object Detection With 4D Imaging Radar and Camera Fusion
    Xiong, Weiyi
    Liu, Jianan
    Huang, Tao
    Han, Qing-Long
    Xia, Yuxuan
    Zhu, Bing
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 79 - 92
  • [8] Radar-camera fusion for 3D object detection with aggregation transformer
    Li, Jun
    Zhang, Han
    Wu, Zizhang
    Xu, Tianhao
    APPLIED INTELLIGENCE, 2024, 54 (21) : 10627 - 10639
  • [9] LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
    Xiong, Weiyi
    Zou, Zean
    Zhao, Qiuchi
    He, Fengchun
    Zhu, Bing
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2862 - 2869
  • [10] A Multi-Phase Camera-LiDAR Fusion Network for 3D Semantic Segmentation With Weak Supervision
    Chang, Xuepeng
    Pan, Huihui
    Sun, Weichao
    Gao, Huijun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3737 - 3746