Single-column CNN for crowd counting with pixel-wise attention mechanism

被引:11
作者
Wang, Bisheng [1 ]
Cao, Guo [1 ]
Shang, Yanfeng [2 ]
Zhou, Licun [2 ]
Zhang, Youqiang [1 ]
Li, Xuesong [1 ]
机构
[1] Nanjing Univ Sci & Technol, 200 Xiao Lingwei St, Nanjing, Jiangsu, Peoples R China
[2] Minist Publ Secur, Res Inst 3, Shanghai, Peoples R China
关键词
Crowd counting; CNN; Pixel-wise attention mechanism; FCN; ANOMALY DETECTION;
D O I
10.1007/s00521-018-3810-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel method for accurate people counting in highly dense crowd images. The proposed method consists of three modules: extracting foreground regions (EF), pixel-wise attention mechanism (PAM) and single-column density map estimator (S-DME). EF can suppress the disturbance of complex background efficiently with a fully convolutional network, PAM performs pixel-wise classification of crowd images to generate high-quality local crowd density maps, and S-DME is a carefully designed single-column network that can learn more representative features with much fewer parameters. In addition, two new evaluation metrics are introduced to get a comprehensive understanding of the performance of different modules in our algorithm. Experiments demonstrate that our approach can get the state-of-the-art results on several challenging datasets including our dataset with highly cluttered environments and various camera perspectives.
引用
收藏
页码:2897 / 2908
页数:12
相关论文
共 43 条
[1]  
[Anonymous], 2015, CVPR
[2]  
[Anonymous], 2015, CVPR
[3]  
[Anonymous], BRIT MACHINE VISION
[4]  
[Anonymous], 2014, ARXIV14120774
[5]  
[Anonymous], 2014, Comput. Sci.
[6]  
[Anonymous], COMPUTER VISION PATT
[7]  
[Anonymous], 2015, COMPUTER VISION PATT
[8]  
[Anonymous], 2015, ACM INT C MULT
[9]  
[Anonymous], 2012, EUR C COMP VIS
[10]  
[Anonymous], 2016, CVPR