PDANet: Pyramid density-aware attention based network for accurate crowd counting

被引:20
作者
Amirgholipour, Saeed [1 ,2 ]
Jia, Wenjing [1 ]
Liu, Lei [3 ]
Fan, Xiaochen [1 ]
Wang, Dadong [2 ]
He, Xiangjian [1 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Sydney, NSW, Australia
[2] CSIRO, Quantitat Imaging Res Team, Data61, Sydney, NSW, Australia
[3] Beihang Univ, Sch Instrumentat Sci & Optoelect Engn, Beijing, Peoples R China
关键词
Crowd counting; Pyramid module; Density ware; Attention module; Classification module; Convolutional neural networks;
D O I
10.1016/j.neucom.2021.04.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting, i.e., estimating the number of people in crowded areas, has attracted much interest in the research community. Although many attempts have been reported, crowd counting remains an open real-world problem due to the vast density variations and severe occlusion within the interested crowd area. In this paper, we propose a novel Pyramid Density-Aware Attention based network, abbreviated as PDANet, which leverages the attention, pyramid scale feature, and two branch decoder modules for density-aware crowd counting. The PDANet utilizes these modules to extract features of different scales while focusing on the relevant information and suppressing the misleading information. We also address the variation of crowdedness levels among different images with a Density-Aware Decoder (DAD) modules. For this purpose, a classifier is constructed to evaluate the density level of the input features and then passes them to the corresponding high and low density DAD modules. Finally, we generate an overall density map by considering the summation of low and high crowdedness density maps. Meanwhile, we employ different losses aiming to achieve a precise density map for the input scene. Extensive evaluations conducted on the challenging benchmark datasets well demonstrate the superior performance of the proposed PDANet in terms of the accuracy of counting and generated density maps over the wellknown state-of-the-art approaches. (c) 2021 Published by Elsevier B.V.
引用
收藏
页码:215 / 230
页数:16
相关论文
共 48 条
[1]  
Amirgholipour S, 2018, IEEE IMAGE PROC, P948, DOI 10.1109/ICIP.2018.8451399
[2]  
[Anonymous], IEEE T MULTIMEDIA
[3]  
[Anonymous], 2019, 33 AAAI C ART INT
[4]  
[Anonymous], 2010, NIPS
[5]   Scale Aggregation Network for Accurate and Efficient Crowd Counting [J].
Cao, Xinkun ;
Wang, Zhipeng ;
Zhao, Yanyun ;
Su, Fei .
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :757-773
[6]  
Chan A. B., 2008, PROC CVPR IEEE, P1
[7]   Bayesian Poisson Regression for Crowd Counting [J].
Chan, Antoni B. ;
Vasconcelos, Nuno .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :545-551
[8]   Scale Pyramid Network for Crowd Counting [J].
Chen, Xinya ;
Bin, Yanrui ;
Sang, Nong ;
Gao, Changxin .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1941-1950
[9]   An Aggregated Multicolumn Dilated Convolution Network for Perspective-Free Counting [J].
Deb, Diptodip ;
Ventura, Jonathan .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :308-317
[10]  
Fiaschi L, 2012, INT C PATT RECOG, P2685