Multi-Scale and spatial position-based channel attention network for crowd counting

被引:6
作者
Wang, Lin [1 ]
Li, Jie [1 ]
Zhang, Siqi [2 ]
Qi, Chun [1 ]
Wang, Pan [1 ]
Wang, Fengping [1 ]
机构
[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Informat & Commun Engn, Xian 710049, Peoples R China
[2] Xian Modern Control Technol Res Inst, Xian 710065, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowd counting; Spatial position -based channel attention model; Multi -scale structure; Adaptive loss;
D O I
10.1016/j.jvcir.2022.103718
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Crowd counting algorithms have recently incorporated attention mechanisms into convolutional neural networks (CNNs) to achieve significant progress. The channel attention model (CAM), as a popular attention mechanism, calculates a set of probability weights to select important channel-wise feature responses. However, most CAMs roughly assign a weight to the entire channel-wise map, which makes useful and useless information being treat indiscriminately, thereby limiting the representational capacity of networks. In this paper, we propose a multi -scale and spatial position-based channel attention network (MS-SPCANet), which integrates spatial position -based channel attention models (SPCAMs) with multiple scales into a CNN. SPCAM assigns different channel attention weights to different positions of channel-wise maps to capture more informative features. Furthermore, an adaptive loss, which uses adaptive coefficients to combine density map loss and headcount loss, is constructed to improve network performance in sparse crowd scenes. Experimental results on four public datasets verify the superiority of the scheme.
引用
收藏
页数:12
相关论文
共 55 条
[31]   A Diffusion and Clustering-Based Approach for Finding Coherent Motions and Understanding Crowd Scenes [J].
Lin, Weiyao ;
Mi, Yang ;
Wang, Weiyue ;
Wu, Jianxin ;
Wang, Jingdong ;
Mei, Tao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (04) :1674-1687
[32]   Recurrent Attentive Zooming for Joint Crowd Counting and Precise Localization [J].
Liu, Chenchen ;
Weng, Xinyu ;
Mu, Yadong .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1217-1226
[33]   DecideNet: Counting Varying Density Crowds Through Attention Guided Detection and Density Estimation [J].
Liu, Jiang ;
Gao, Chenqiang ;
Meng, Deyu ;
Hauptmann, Alexander G. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5197-5206
[34]   ADCrowdNet: An Attention-Injective Deformable Convolutional Network for Crowd Understanding [J].
Liu, Ning ;
Long, Yongchao ;
Zou, Changqing ;
Niu, Qun ;
Pan, Li ;
Wu, Hefeng .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3220-3229
[35]  
Liu XB, 2016, AAAI CONF ARTIF INTE, P3553
[36]   Towards Perspective-Free Object Counting with Deep Learning [J].
Onoro-Rubio, Daniel ;
Lopez-Sastre, Roberto J. .
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 :615-629
[37]   Locate, Size, and Count: Accurately Resolving People in Dense Crowds via Detection [J].
Sam, Deepak Babu ;
Peri, Skand Vishwanath ;
Sundararaman, Mukuntha Narayanan ;
Kamath, Amogh ;
Babu, R. Venkatesh .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (08) :2739-2751
[38]   Switching Convolutional Neural Network for Crowd Counting [J].
Sam, Deepak Babu ;
Surya, Shiv ;
Babu, R. Venkatesh .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4031-4039
[39]   Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method [J].
Sindagi, Vishwanath A. ;
Yasarla, Rajeev ;
Patel, Vishal M. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1221-1231
[40]   HA-CCN: Hierarchical Attention-Based Crowd Counting Network [J].
Sindagi, Vishwanath A. ;
Patel, Vishal M. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :323-335