Understanding Traffic Density from Large-Scale Web Camera Data

被引:100
作者
Zhang, Shanghang [1 ,2 ]
Wu, Guanhang [1 ]
Costeira, Joao P. [2 ]
Moura, Jose M. F. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Lisbon, ISR IST, Lisbon, Portugal
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
基金
美国安德鲁·梅隆基金会;
关键词
D O I
10.1109/CVPR.2017.454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding traffic density from large-scale web camera (webcam) videos is a challenging problem because such videos have low spatial and temporal resolution, high occlusion and large perspective. To deeply understand traffic density, we explore both optimization based and deep learning based methods. To avoid individual vehicle detection or tracking, both methods map the dense image feature into vehicle density, one based on rank constrained regression and the other based on fully convolutional networks (FCN). The regression based method learns different weights for different blocks of the image to embed road geometry and significantly reduce the error induced by camera perspective. The FCN based method jointly estimates vehicle density and vehicle count with a residual learning framework to perform end-to-end dense prediction, allowing arbitrary image resolution, and adapting to different vehicle scales and perspectives. We analyze and compare both methods, and get insights from optimization based method to improve deep model. Since existing datasets do not cover all the challenges in our work, we collected and labelled a large-scale traffic video dataset, containing 60 million frames from 212 webcams. Both methods are extensively evaluated and compared on different counting tasks and datasets. FCN based method significantly reduces the mean absolute error (MAE) from 10.99 to 5.31 on the public dataset TRANCOS compared with the state-of-the-art baseline.
引用
收藏
页码:4264 / 4273
页数:10
相关论文
共 38 条
[1]  
An S., 2007, P IEEE C COMP VIS PA, DOI 10.1109/CVPR.2007.383105.
[2]  
[Anonymous], IEEE C COMP VIS PATT
[3]  
[Anonymous], ACM SIGGRAPH
[4]  
[Anonymous], 2012, PROC IEEE C COMPUTER
[5]  
[Anonymous], P NEURAL INFORM PROC
[6]  
[Anonymous], 15 INT IEEE C INT TR
[7]  
[Anonymous], 15 INT IEEE C INT TR
[8]  
[Anonymous], IEEE 70 VEH TECHN C
[9]  
[Anonymous], 2016, ARXIV160600915
[10]  
[Anonymous], ADV NEURAL INFORM PR