CrowdNet: A Deep Convolutional Network for Dense Crowd Counting

被引:409
作者
Boominathan, Lokesh [1 ]
Kruthiventi, Srinivas S. S. [1 ]
Babu, R. Venkatesh [1 ]
机构
[1] Indian Inst Sci, Video Analyt Lab, Bangalore 560012, Karnataka, India
来源
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE | 2016年
关键词
Crowd Density; Convolutional Neural Networks;
D O I
10.1145/2964284.2967300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our work proposes a novel deep learning framework for estimating crowd density from static images of highly dense crowds. We use a combination of deep and shallow, fully convolutional networks to predict the density map for a given crowd image. Such a combination is used for effectively capturing both the high-level semantic information (face/body detectors) and the low-level features (blob detectors), that are necessary for crowd counting under large scale variations. As most crowd datasets have limited training samples (<100 images) and deep learning based approaches require large amounts of training data, we perform multi scale data augmentation. Augmenting the training samples in such a manner helps in guiding the CNN to learn scale invariant representations. Our method is tested on the challenging UCF_CC_50 dataset, and shown to outperform the state of the art methods.
引用
收藏
页码:640 / 644
页数:5
相关论文
共 19 条
[1]  
[Anonymous], 2006, IEEE COMP SOC C COMP
[2]  
[Anonymous], 2015, ARXIV151002927
[3]  
Brostow G.J., 2006, CVPR, P594, DOI DOI 10.1109/CVPR.2006.320
[4]   Privacy preserving crowd monitoring: Counting people without people models or tracking [J].
Chan, Antoni B. ;
Liang, Zhang-Sheng John ;
Vasconcelos, Nuno .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :1766-1772
[5]   Feature Mining for Localised Crowd Counting [J].
Chen, Ke ;
Loy, Chen Change ;
Gong, Shaogang ;
Xiang, Tao .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[6]  
Chen L.-C., 2014, ARXIV
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]  
Ferryman J., 2010, IEEE INT C ADV VID S, P2
[10]   Multi-Source Multi-Scale Counting in Extremely Dense Crowd Images [J].
Idrees, Haroon ;
Saleemi, Imran ;
Seibert, Cody ;
Shah, Mubarak .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2547-2554