Multi-Scale and spatial position-based channel attention network for crowd counting

被引:6
作者
Wang, Lin [1 ]
Li, Jie [1 ]
Zhang, Siqi [2 ]
Qi, Chun [1 ]
Wang, Pan [1 ]
Wang, Fengping [1 ]
机构
[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Informat & Commun Engn, Xian 710049, Peoples R China
[2] Xian Modern Control Technol Res Inst, Xian 710065, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowd counting; Spatial position -based channel attention model; Multi -scale structure; Adaptive loss;
D O I
10.1016/j.jvcir.2022.103718
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Crowd counting algorithms have recently incorporated attention mechanisms into convolutional neural networks (CNNs) to achieve significant progress. The channel attention model (CAM), as a popular attention mechanism, calculates a set of probability weights to select important channel-wise feature responses. However, most CAMs roughly assign a weight to the entire channel-wise map, which makes useful and useless information being treat indiscriminately, thereby limiting the representational capacity of networks. In this paper, we propose a multi -scale and spatial position-based channel attention network (MS-SPCANet), which integrates spatial position -based channel attention models (SPCAMs) with multiple scales into a CNN. SPCAM assigns different channel attention weights to different positions of channel-wise maps to capture more informative features. Furthermore, an adaptive loss, which uses adaptive coefficients to combine density map loss and headcount loss, is constructed to improve network performance in sparse crowd scenes. Experimental results on four public datasets verify the superiority of the scheme.
引用
收藏
页数:12
相关论文
共 55 条
[41]   Generating High-Quality Crowd Density Maps using Contextual Pyramid CNNs [J].
Sindagi, Vishwanath A. ;
Patel, Vishal M. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1879-1888
[42]  
Viola P, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P747
[43]   Single-column CNN for crowd counting with pixel-wise attention mechanism [J].
Wang, Bisheng ;
Cao, Guo ;
Shang, Yanfeng ;
Zhou, Licun ;
Zhang, Youqiang ;
Li, Xuesong .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07) :2897-2908
[44]   Density-Aware Curriculum Learning for Crowd Counting [J].
Wang, Qi ;
Lin, Wei ;
Gao, Junyu ;
Li, Xuelong .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) :4675-4687
[45]   NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization [J].
Wang, Qi ;
Gao, Junyu ;
Lin, Wei ;
Li, Xuelong .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (06) :2141-2149
[46]   Learning from Synthetic Data for Crowd Counting in the Wild [J].
Wang, Qi ;
Gao, Junyu ;
Lin, Wei ;
Yuan, Yuan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8190-8199
[47]   A Self-Training Approach for Point-Supervised Object Detection and Counting in Crowds [J].
Wang, Yi ;
Hou, Junhui ;
Hou, Xinyu ;
Chau, Lap-Pui .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2876-2887
[48]   Pedestrian Behavior Modeling From Stationary Crowds With Applications to Intelligent Surveillance [J].
Yi, Shuai ;
Li, Hongsheng ;
Wang, Xiaogang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (09) :4354-4368
[49]  
Yu Fisher, 2016, CoRR abs/1511.07122
[50]   Relational Attention Network for Crowd Counting [J].
Zhang, Anran ;
Shen, Jiayi ;
Xiao, Zehao ;
Zhu, Fan ;
Zhen, Xiantong ;
Cao, Xianbin ;
Shao, Ling .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6787-6796