DeepFusion: A Robust and Modular 3D Object Detector for Lidars, Cameras and Radars

被引：15

作者：

Drews, Florian ^{[1
]}

Feng, Di ^{[1
]}

Faion, Florian ^{[1
]}

Rosenbaum, Lars ^{[1
]}

Ulrich, Michael ^{[1
]}

Glaser, Claudius ^{[1
]}

机构：

[1] Robert Bosch GmbH, Corp Res, Stuttgart, Germany

来源：

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年

关键词：

D O I：

10.1109/IROS47612.2022.9981778

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose DeepFusion, a modular multi-modal architecture to fuse lidars, cameras and radars in different combinations for 3D object detection. Specialized feature extractors take advantage of each modality and can be exchanged easily, making the approach simple and flexible. Extracted features are transformed into bird's-eye-view as a common representation for fusion. Spatial and semantic alignment is performed prior to fusing modalities in the feature space. Finally, a detection head exploits rich multi-modal features for improved 3D detection performance. Experimental results for lidar-camera, lidar-camera-radar and camera-radar fusion show the flexibility and effectiveness of our fusion approach. In the process, we study the largely unexplored task of faraway car detection up to 225 meters, showing the benefits of our lidarcamera fusion. Furthermore, we investigate the required density of lidar points for 3D object detection and illustrate implications at the example of robustness against adverse weather conditions. Moreover, ablation studies on our camera-radar fusion highlight the importance of accurate depth estimation.

引用

页码：560 / 567

页数：8

共 53 条

[1] Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in Unseen Adverse Weather [J].

Bijelic, Mario ;

Gruber, Tobias ;

Mannan, Fahim ;

Kraus, Florian ;

Ritter, Werner ;

Dietmayer, Klaus ;

Heide, Felix .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11679-11689

[2] nuScenes: A multimodal dataset for autonomous driving [J].

Caesar, Holger ;

Bankiti, Varun ;

Lang, Alex H. ;

Vora, Sourabh ;

Liong, Venice Erin ;

Xu, Qiang ;

Krishnan, Anush ;

Pan, Yu ;

Baldan, Giancarlo ;

Beijbom, Oscar .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628

[3]

Chadwick S, 2019, IEEE INT CONF ROBOT, P8311, DOI [10.1109/ICRA.2019.8794312, 10.1109/icra.2019.8794312]

[4] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[5]

Feng D, 2020, IEEE INT VEH SYM, P871, DOI [10.1109/IV47402.2020.9304551, 10.1109/iv47402.2020.9304551]

[6] Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges [J].

Feng, Di ;

Haase-Schutz, Christian ;

Rosenbaum, Lars ;

Hertlein, Heinz ;

Glaser, Claudius ;

Timm, Fabian ;

Wiesbeck, Werner ;

Dietmayer, Klaus .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) :1341-1360

[7]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

[8]

He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]

[9]

Hendy N., 2020, IEEE C COMPUTER VISI

[10]

Huang JJ, 2022, Arxiv, DOI arXiv:2112.11790

← 1 2 3 4 5 6 →