End to End Multi-Scale Convolutional Neural Network for Crowd Counting

被引:0
作者
Ji, Deyi [1 ]
Lu, Hongtao [1 ]
Zhang, Tongzhen [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Key Lab Shanghai Educ Commiss Intelligent Interac, Shanghai 200240, Peoples R China
来源
ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018) | 2019年 / 11041卷
关键词
Crowd counting; Deep convolutional neural network; Multi-scale features; End to end;
D O I
10.1117/12.2522940
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting is a challenging task in computer vison field and haven't been well addressed until now. In this paper, we intend to develop an end to end multi-scale deep convolutional neural network(CNN) model that can accurately estimate the crowd count from an individual image with arbitrary crowd density and perspective. The proposed model extract multi-scale deep CNN features from the input image and regress the crwod count directly, without any post-processing. Hence our model could handle muti-scale targets well in various crowd scene. We evaluate our model on several benchmark datasets and the performance outperforms some state-of-the-art methods. What's more, due to the end-to-end characteristics, our model demonstrates good practical application performance.
引用
收藏
页数:6
相关论文
共 21 条
[1]  
[Anonymous], 2014, FULLY CONVOLUTIONAL
[2]  
[Anonymous], 2016, LECT NOTES COMPUT SC, DOI DOI 10.1007/978-3-319-46484-8_43
[3]  
[Anonymous], ADV NEURAL INFORM PR
[4]  
[Anonymous], EUR C COMP VIS
[5]  
[Anonymous], CVPR WORKSH CVPRW AI
[6]  
[Anonymous], 2017, PYTORCH
[7]   Bayesian Poisson Regression for Crowd Counting [J].
Chan, Antoni B. ;
Vasconcelos, Nuno .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :545-551
[8]   Feature Mining for Localised Crowd Counting [J].
Chen, Ke ;
Loy, Chen Change ;
Gong, Shaogang ;
Xiang, Tao .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[9]   Dense crowd counting from still images with convolutional neural networks [J].
Hu, Yaocong ;
Chang, Huan ;
Nian, Fudong ;
Wang, Yan ;
Li, Teng .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 38 :530-539
[10]  
Kingma DP, 2014, ADV NEUR IN, V27