A Novel Multi-Sensor Fusion Based Object Detection and Recognition Algorithm for Intelligent Assisted Driving

被引:22
作者
Liu, Tianbi [1 ,5 ]
Du, Shanshan [2 ]
Liang, Chenchen [3 ]
Zhang, Bo [4 ]
Feng, Rui [1 ,5 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Fudan Univ, Sch Informat Sci & Technol, Shanghai 200433, Peoples R China
[3] Tongji Univ, Sch Elect & Informat Engn, Shanghai 200082, Peoples R China
[4] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[5] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
关键词
Radar; Radar imaging; Radar detection; Object detection; Millimeter wave radar; Cameras; Data integration; Intelligent assisted driving; object detection and recognition; multi-sensor fusion; millimeter-wave radar; high-definition video;
D O I
10.1109/ACCESS.2021.3083503
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The object detection and recognition algorithm based on the fusion of millimeter-wave radar and high-definition video data can improve the safety of intelligent-driving vehicles effectively. However, due to the different data modalities of millimeter-wave radar and video, how to fuse the two effectively is the key point. The difficulty lies in the data fusion methods such as insufficient adaptability of image distortion in data alignment and coordinate transformation and also the mismatching of information levels of the data to be fused. To solve the problem of data fusion of millimeter wave radar and video, this paper proposes a decision-level fusion method of millimeter-wave radar and high-definition video data based on angular alignment. Specifically, through the joint calibration and approximate interpolation, projected to polar coordinate system, the radar and the camera are angularly aligned in the horizontal direction. Then objects are detected by a deep neural network model from video data, and combined with those detected by radar to make the joint decision. Finally, object detection and recognition task based on the fusion of the two kinds of data is completed. Theoretical analysis and experimental results indicate that the accuracy of the algorithm based on the two data fusion is superior to that of the single detection and recognition algorithm on the basis of millimeter-wave radar or video data.
引用
收藏
页码:81564 / 81574
页数:11
相关论文
共 39 条
[1]  
Ali W., 2018, P EUR C COMP VIS ECC, P716
[2]  
[Anonymous], 2017, ARXIV170602413
[3]  
[Anonymous], 2016, ARXIV160807916
[4]  
Aziz K., 2020, P IEEE RAD C RADARCO, P1
[5]  
Beltrán J, 2018, IEEE INT C INTELL TR, P3517, DOI 10.1109/ITSC.2018.8569311
[6]  
Bochkovskiy A., 2020, ARXIV200410934
[7]  
Chen Xinlei, 2015, CORR
[8]  
Chen YL, 2019, IEEE I CONF COMP VIS, P9774, DOI [10.1109/iccv.2019.00987, 10.1109/ICCV.2019.00987]
[9]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[10]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893