A fast and effective video vehicle detection method leveraging feature fusion and proposal temporal link

被引:13
作者
Yang, Yanni [1 ]
Song, Huansheng [1 ]
Sun, Shijie [1 ]
Zhang, Wentao [1 ]
Chen, Yan [1 ]
Rakal, Lionel [1 ]
Fang, Yong [1 ]
机构
[1] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Feature fusion; SSD; Video vehicle detection; Detections optimizing;
D O I
10.1007/s11554-021-01121-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vehicle detection in videos is a valuable but challenging technology in traffic monitoring. Due to the advantage of real-time detection, Single Shot MultiBox Detector (SSD) is often used to detect vehicles in images. However, the accuracy degradation caused by SSD is one of the significant problems in video vehicle detection. To address this problem in real time, this paper enhances the detection performance by improving the SSD and employing the relationship of inter-frame detections. We propose a feature-fused SSD detector and a Tracking-guided Detections Optimizing (TDO) strategy for fast and effective video vehicle detection. We introduce a lightweight feature fusion sub-network to the standard SSD network, which aggregate the deeper layer features into the shallower layer features to enhance the semantic information of the shallower layer features. At the post-processing stage of the feature-fused SSD, the non-maximum suppression (NMS) is replaced by the TDO strategy, which link vehicles of inter-frames by fast tracking algorithm. Thus the missed detections can be compensated by the propagated results, and the confidence of the final results can be optimized in the temporal. Our approach significantly improves the temporal consistency of the detection results with lower complexity computations. We evaluate the proposed method on two datasets. The experiments on our labeled highway dataset show that the mean average precision (mAP) of our method is 8.2% higher than that of the base detector. The runtime of our feature-fused SSD is 27.1 frames per second (fps), which is suitable for real-time detection. The experiments on the ImageNet VID dataset prove that the proposed method is comparable with the state-of-the-art detectors as well.
引用
收藏
页码:1261 / 1274
页数:14
相关论文
共 44 条
[1]  
[Anonymous], 2009, Int. J. Image Process.
[2]   Object Detection in Video with Spatiotemporal Sampling Networks [J].
Bertasius, Gedas ;
Torresani, Lorenzo ;
Shi, Jianbo .
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :342-357
[3]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[4]  
Chen ZZ, 2012, IEEE INT C INTELL TR, P951, DOI 10.1109/ITSC.2012.6338852
[5]   Multimodal background subtraction for high-performance embedded systems [J].
Cocorullo, Giuseppe ;
Corsonello, Pasquale ;
Frustaci, Fabio ;
Guachi-Guachi, Lorena-de-los-Angeles ;
Perri, Stefania .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2019, 16 (05) :1407-1423
[6]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[7]   FlowNet: Learning Optical Flow with Convolutional Networks [J].
Dosovitskiy, Alexey ;
Fischer, Philipp ;
Ilg, Eddy ;
Haeusser, Philip ;
Hazirbas, Caner ;
Golkov, Vladimir ;
van der Smagt, Patrick ;
Cremers, Daniel ;
Brox, Thomas .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2758-2766
[8]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[9]   Detect to Track and Track to Detect [J].
Feichtenhofer, Christoph ;
Pinz, Axel ;
Zisserman, Andrew .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3057-3065
[10]   Diagnostic utility of clinical laboratory data determinations for patients with the severe COVID-19 [J].
Gao, Yong ;
Li, Tuantuan ;
Han, Mingfeng ;
Li, Xiuyong ;
Wu, Dong ;
Xu, Yuanhong ;
Zhu, Yulin ;
Liu, Yan ;
Wang, Xiaowu ;
Wang, Linding .
JOURNAL OF MEDICAL VIROLOGY, 2020, 92 (07) :791-796