Filter Fusion: Camera-LiDAR Filter Fusion for 3-D Object Detection With a Robust Fused Head

被引:0
作者
Xu, Yaming [1 ]
Li, Boliang [1 ]
Wang, Yan [1 ]
Cui, Yihan [2 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Harbin 150006, Heilongjiang, Peoples R China
[2] Army Acad Armored Forces, Sergeant Sch, Changchun 130000, Peoples R China
关键词
Three-dimensional displays; Feature extraction; Object detection; Laser radar; Point cloud compression; Detectors; Cameras; Difference function; feature secondary filtering; filter fusion; robust fused head; visual fusion rotating platform;
D O I
10.1109/TIM.2024.3449944
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The different representations of images and point clouds make fusion difficult, resulting in the suboptimal performance of 3-D object detection methods. We propose a camera-light detection and ranging (LiDAR) filter fusion framework for 3-D object detection based on feature secondary filtering. This framework uses two uncoupled object detection structures to extract images and point features and a robust camera-LiDAR fused head to fuse features from multisource heterogeneous sensors. Unlike previous work, we propose a novel four-stage fusion strategy to fully use unique features extracted from two uncoupled 3-D object detectors. Our network fully extracts heterostructural features through dedicated detectors, which makes the extracted information more sufficient, especially for smaller objects. In addition, we propose a difference function for more efficient fusion of independent features from uncoupled object extractors. We mathematically prove the validity of the robust fused head and verify the effectiveness of our filter fusion framework in a test scene and on the KITTI dataset, particularly in KITTI pedestrian detection. The code is available at: https://github.com/xuminglei-hit/FilterFusion
引用
收藏
页数:12
相关论文
共 49 条
  • [11] PointPillars: Fast Encoders for Object Detection from Point Clouds
    Lang, Alex H.
    Vora, Sourabh
    Caesar, Holger
    Zhou, Lubing
    Yang, Jiong
    Beijbom, Oscar
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12689 - 12697
  • [12] Li P., 2020, ECCV, P644, DOI DOI 10.1007/978-3-030-58580-838
  • [13] Contextual Transformer Networks for Visual Recognition
    Li, Yehao
    Yao, Ting
    Pan, Yingwei
    Mei, Tao
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1489 - 1500
  • [14] DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
    Li, Yingwei
    Yu, Adams Wei
    Meng, Tianjian
    Caine, Ben
    Ngiam, Jiquan
    Peng, Daiyi
    Shen, Junyang
    Lu, Yifeng
    Zhou, Denny
    Le, Quoc, V
    Yuille, Alan
    Tan, Mingxing
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17161 - 17170
  • [15] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation
    Liu, Zechen
    Wu, Zizhang
    Toth, Roland
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4289 - 4298
  • [16] Delving into Localization Errors for Monocular 3D Object Detection
    Ma, Xinzhu
    Zhang, Yinmin
    Xu, Dan
    Zhou, Dongzhan
    Yi, Shuai
    Li, Haojie
    Ouyang, Wanli
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4719 - 4728
  • [17] Ouyang J., 2022, IEEE Trans. Instrum. Meas., V71, P1
  • [18] Fast-CLOCs: Fast Camera-LiDAR Object Candidates Fusion for 3D Object Detection
    Pang, Su
    Morris, Daniel
    Radha, Hayder
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3747 - 3756
  • [19] CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection
    Pang, Su
    Morris, Daniel
    Radha, Hayder
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10386 - 10393
  • [20] Qi C. R., 2017, PROC CVPR IEEE, P652