Feature Map Transformation for Multi-sensor Fusion in Object Detection Networks for Autonomous Driving

被引:0
作者
Schroder, Enrico [1 ]
Braun, Sascha [1 ]
Mahlisch, Mirko [1 ]
Vitay, Julien [2 ]
Hamker, Fred [2 ]
机构
[1] AUDI AG, Dept Dev Sensor Data Fus & Localisat, D-85045 Ingolstadt, Germany
[2] Tech Univ Chemnitz, Dept Comp Sci, Str Nationen, Chemnitz, Germany
来源
ADVANCES IN COMPUTER VISION, VOL 2 | 2020年 / 944卷
关键词
Autonomous driving; Perception; Sensor fusion; Object detection; Lidar;
D O I
10.1007/978-3-030-17798-0_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a general framework for fusing pre-trained object detection networks for multiple sensor modalities in autonomous cars at an intermediate stage. The key innovation is an autoencoder-inspired Transformer module which transforms perspective as well as feature activation characteristics from one sensor modality to another. Transformed feature maps can be combined with those of a modality-native feature extractor to enhance performance and reliability through a simple fusion scheme. Our approach is not limited to specific object detection network types. Compared to other methods, our framework allows fusion of pre-trained object detection networks and fuses sensor modalities at a single stage, resulting in a modular and traceable architecture. We show effectiveness of the proposed scheme by fusing camera and Lidar information to detect objects using our own as well as the KITTI dataset.
引用
收藏
页码:118 / 131
页数:14
相关论文
共 13 条
[1]  
[Anonymous], 2015, P ADV NEUR INF PROC
[2]  
[Anonymous], 2014, Comput. Sci.
[3]  
[Anonymous], ARXIV161110012CS
[4]  
[Anonymous], ARXIV171202294CS
[5]  
[Anonymous], ARXIV180306199CS
[6]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[7]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[8]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778