HAGN: Hierarchical Attention Guided Network for Crowd Counting

被引:5
作者
Duan, Zuodong [1 ]
Xie, Yujun [2 ]
Deng, Jiahao [1 ]
机构
[1] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
关键词
Crowd counting; crowd density estimation; crowd localization; hierarchical attention mechanism;
D O I
10.1109/ACCESS.2020.2975268
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, deep learning based crowd counting networks have achieved significant progress. However, most of them generate rough crowd density maps due to low-resolution features used for estimating crowd distribution, which affects the performance of crowd counting. To solve this problem, in this paper, we propose a Hierarchical Attention Guided Network (HAGN) for crowd counting. We apply the first 13 layers of VGG-16 to extract base features. Then, the extracted features are processed by the Hierarchical Attention Mechanism (HAM), which guided the extracted features to enlarge step by step via our proposed attention guided branch. Finally, the outputs of HAM are fed to 1 x 1 convolutional layer for final crowd density estimation. Experiments are performed on ShanghaiTech and UCF-QNRF datasets, and our HAGN achieves promising performance compared with the other state-of-the-art methods on crowd counting and crowd localization, respectively.
引用
收藏
页码:36376 / 36385
页数:10
相关论文
共 64 条
[1]  
[Anonymous], 2018, ARXIV180700601
[2]  
[Anonymous], 2019, 33 AAAI C ART INT
[3]  
[Anonymous], 2018, ARXIV180801050
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   Efficient and Switchable CNN for Crowd Counting Based on Embedded Terminal [J].
Chen, Jingyu ;
Zhang, Qiong ;
Zheng, Wei-Shi ;
Xie, Xiaohua .
IEEE ACCESS, 2019, 7 :51533-51541
[6]   SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J].
Chen, Long ;
Zhang, Hanwang ;
Xiao, Jun ;
Nie, Liqiang ;
Shao, Jian ;
Liu, Wei ;
Chua, Tat-Seng .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6298-6306
[7]   Learning Spatial Awareness to Improve Crowd Counting [J].
Cheng, Zhi-Qi ;
Li, Jun-Xiu ;
Dai, Qi ;
Wu, Xiao ;
Hauptmann, Alexander G. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6151-6160
[8]   Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting [J].
Cheng, Zhi-Qi ;
Li, Jun-Xiu ;
Dai, Qi ;
Wu, Xiao ;
He, Jun-Yan ;
Hauptmann, Alexander G. .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :1897-1906
[9]   SCAR: Spatial-/channel-wise attention regression networks for crowd counting [J].
Gao, Junyu ;
Wang, Qi ;
Yuan, Yuan .
NEUROCOMPUTING, 2019, 363 :1-8
[10]  
Gao Junyu, 2019, arXiv:1907.02724