Crowd Counting via Weighted VLAD on a Dense Attribute Feature Map

被引:69
作者
Sheng, Biyun [1 ,2 ]
Shen, Chunhua [3 ]
Lin, Guosheng [3 ]
Li, Jun [1 ,2 ]
Yang, Wankou [1 ,2 ]
Sun, Changyin [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China
[3] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Crowd counting; locality-aware feature (LAF); semantic attributes; weighted vector of locally aggregated descriptor (W-VLAD) encoder; AGE;
D O I
10.1109/TCSVT.2016.2637379
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Crowd counting is an important task in computer vision, which has many applications in video surveillance. Although the regression-based framework has achieved great improvements for crowd counting, how to improve the discriminative power of image representation is still an open problem. Conventional holistic features used in crowd counting often fail to capture semantic attributes and spatial cues of the image. In this paper, we propose integrating semantic information into learning locality-aware feature (LAF) sets for accurate crowd counting. First, with the help of a convolutional neural network, the original pixel space is mapped onto a dense attribute feature map, where each dimension of the pixelwise feature indicates the probabilistic strength of a certain semantic class. Then, LAF built on the idea of spatial pyramids on neighboring patches is proposed to explore more spatial context and local information. Finally, the traditional vector of locally aggregated descriptor (VLAD) encoding method is extended to a more generalized form weighted-VLAD (W-VLAD) in which diverse coefficient weights are taken into consideration. Experimental results validate the effectiveness of our presented method.
引用
收藏
页码:1788 / 1797
页数:10
相关论文
共 33 条
[1]  
[Anonymous], PAMI
[2]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.348
[3]  
[Anonymous], 2016, PROC IEEE C COMPUTER
[4]  
[Anonymous], 2015, PROC CVPR IEEE
[5]  
[Anonymous], P CVPR
[6]  
[Anonymous], CVPR
[7]   Statistical physics of social dynamics [J].
Castellano, Claudio ;
Fortunato, Santo ;
Loreto, Vittorio .
REVIEWS OF MODERN PHYSICS, 2009, 81 (02) :591-646
[8]  
Chan A. B., 2008, P CVPR, P1
[9]   Counting People With Low-Level Features and Bayesian Regression [J].
Chan, Antoni B. ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (04) :2160-2177
[10]   Feature Mining for Localised Crowd Counting [J].
Chen, Ke ;
Loy, Chen Change ;
Gong, Shaogang ;
Xiang, Tao .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,