Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor

被引:157
作者
Lopez-Sastre, Roberto J. [1 ]
Herranz-Perdiguero, Carlos [1 ]
Guerrero-Gomez-Olmedo, Ricardo [2 ]
Onoro-Rubio, Daniel [3 ]
Maldonado-Bascon, Saturnino [1 ]
机构
[1] Univ Alcala, Dept Signal Theory & Commun, GRAM, Alcala De Henares 28805, Spain
[2] BBVA Next Technol, Madrid 28050, Spain
[3] NEC Labs Europe, Kurfursten Anlage 36, D-69115 Heidelberg, Germany
关键词
traffic monitoring sensor; vehicle tracking; vehicle detection; tracking by detection; viewpoint estimation; smart city; CATEGORIZATION; ENVIRONMENTS; ALGORITHM; MODEL;
D O I
10.3390/s19194062
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this work, we address the problem of multi-vehicle detection and tracking for traffic monitoring applications. We preset a novel intelligent visual sensor for tracking-by-detection with simultaneous pose estimation. Essentially, we adapt an Extended Kalman Filter (EKF) to work not only with the detections of the vehicles but also with their estimated coarse viewpoints, directly obtained with the vision sensor. We show that enhancing the tracking with observations of the vehicle pose, results in a better estimation of the vehicles trajectories. For the simultaneous object detection and viewpoint estimation task, we present and evaluate two independent solutions. One is based on a fast GPU implementation of a Histogram of Oriented Gradients (HOG) detector with Support Vector Machines (SVMs). For the second, we adequately modify and train the Faster R-CNN deep learning model, in order to recover from it not only the object localization but also an estimation of its pose. Finally, we publicly release a challenging dataset, the GRAM Road Traffic Monitoring (GRAM-RTM), which has been especially designed for evaluating multi-vehicle tracking approaches within the context of traffic monitoring applications. It comprises more than 700 unique vehicles annotated across more than 40.300 frames of three videos. We expect the GRAM-RTM becomes a benchmark in vehicle detection and tracking, providing the computer vision and intelligent transportation systems communities with a standard set of images, annotations and evaluation procedures for multi-vehicle tracking. We present a thorough experimental evaluation of our approaches with the GRAM-RTM, which will be useful for establishing further comparisons. The results obtained confirm that the simultaneous integration of vehicle localizations and pose estimations as observations in an EKF, improves the tracking results.
引用
收藏
页数:24
相关论文
共 58 条
[1]  
[Anonymous], P IWINAC MALL SPAIN
[2]  
[Anonymous], P IEEE INT C ADV VID
[3]  
[Anonymous], P ECCV FLOR IT 7 13
[4]  
[Anonymous], 2006, 95041 TR U N CAR
[5]  
[Anonymous], 2015, ARXIV, DOI DOI 10.48550/ARXIV.1504.08083
[6]  
[Anonymous], P BMVC 2014 NOTT UK
[7]  
[Anonymous], 2014, ARXIV13112524
[8]  
[Anonymous], P INT TRANSP SYST PI
[9]  
[Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
[10]  
[Anonymous], 2014, P IEEE WINT C APPL C