Filter Fusion: Camera-LiDAR Filter Fusion for 3-D Object Detection With a Robust Fused Head

被引：0

作者：

Xu, Yaming ^{[1
]}

Li, Boliang ^{[1
]}

Wang, Yan ^{[1
]}

Cui, Yihan ^{[2
]}

机构：

[1] Harbin Inst Technol, Sch Astronaut, Harbin 150006, Heilongjiang, Peoples R China

[2] Army Acad Armored Forces, Sergeant Sch, Changchun 130000, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2024年 / 73卷

关键词：

Three-dimensional displays; Feature extraction; Object detection; Laser radar; Point cloud compression; Detectors; Cameras; Difference function; feature secondary filtering; filter fusion; robust fused head; visual fusion rotating platform;

D O I：

10.1109/TIM.2024.3449944

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The different representations of images and point clouds make fusion difficult, resulting in the suboptimal performance of 3-D object detection methods. We propose a camera-light detection and ranging (LiDAR) filter fusion framework for 3-D object detection based on feature secondary filtering. This framework uses two uncoupled object detection structures to extract images and point features and a robust camera-LiDAR fused head to fuse features from multisource heterogeneous sensors. Unlike previous work, we propose a novel four-stage fusion strategy to fully use unique features extracted from two uncoupled 3-D object detectors. Our network fully extracts heterostructural features through dedicated detectors, which makes the extracted information more sufficient, especially for smaller objects. In addition, we propose a difference function for more efficient fusion of independent features from uncoupled object extractors. We mathematically prove the validity of the robust fused head and verify the effectiveness of our filter fusion framework in a test scene and on the KITTI dataset, particularly in KITTI pedestrian detection. The code is available at: https://github.com/xuminglei-hit/FilterFusion

引用

页数：12

共 49 条

[11] PointPillars: Fast Encoders for Object Detection from Point Clouds
Lang, Alex H.
Vora, Sourabh
Caesar, Holger
Zhou, Lubing
Yang, Jiong
Beijbom, Oscar
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12689 - 12697
[12] Li P., 2020, ECCV, P644, DOI DOI 10.1007/978-3-030-58580-838
[13] Contextual Transformer Networks for Visual Recognition
Li, Yehao
Yao, Ting
Pan, Yingwei
Mei, Tao
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1489 - 1500
[14] DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
Li, Yingwei
Yu, Adams Wei
Meng, Tianjian
Caine, Ben
Ngiam, Jiquan
Peng, Daiyi
Shen, Junyang
Lu, Yifeng
Zhou, Denny
Le, Quoc, V
Yuille, Alan
Tan, Mingxing
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17161 - 17170
[15] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation
Liu, Zechen
Wu, Zizhang
Toth, Roland
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4289 - 4298
[16] Delving into Localization Errors for Monocular 3D Object Detection
Ma, Xinzhu
Zhang, Yinmin
Xu, Dan
Zhou, Dongzhan
Yi, Shuai
Li, Haojie
Ouyang, Wanli
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4719 - 4728
[17] Ouyang J., 2022, IEEE Trans. Instrum. Meas., V71, P1
[18] Fast-CLOCs: Fast Camera-LiDAR Object Candidates Fusion for 3D Object Detection
Pang, Su
Morris, Daniel
Radha, Hayder
[J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3747 - 3756
[19] CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection
Pang, Su
Morris, Daniel
Radha, Hayder
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10386 - 10393
[20] Qi C. R., 2017, PROC CVPR IEEE, P652

← 1 2 3 4 5 →