Crowd Counting on Images with Scale Variation and Isolated Clusters

被引:11
作者
Bai, Haoyue [1 ]
Wen, Song [1 ]
Chan, S. -H. Gary [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年
关键词
D O I
10.1109/ICCVW.2019.00009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting is to estimate the number of objects (e.g., people or vehicles) in an image of unconstrained congested scenes. Designing a general crowd counting algorithm applicable to a wide range of crowd images is challenging, mainly due to the possibly large variation in object scales and the presence of many isolated small clusters. Previous approaches based on convolution operations with multi-branch architecture are effective for only some narrow bands of scales, and have not captured the long-range contextual relationship due to isolated clustering. To address that, we propose SACANet, a novel scale-adaptive long-range context-aware network for crowd counting. SACANet consists of three major nodules: the pyramid contextual module which extracts long-range contextual information and enlarges the receptive field, a scale-adaptive self-attention multi-branch module to attain high scale sensitivity and detection accuracy of isolated clusters, and a hierarchical fusion module to fuse multi-level self-attention features. With group normalization, SACANet achieves better optimality in the training process. We have conducted extensive experiments using the VisDrone2019 People dataset, the VisDrone2019 Vehicle dataset, and some other challenging benchmarks. As compared with the stateof-the-art methods, SACANet is shown to be effective, especially for extremely crowded conditions with diverse scales and scattered clusters, and achieves much lower MAE as compared with baselines.
引用
收藏
页码:18 / 27
页数:10
相关论文
共 38 条
[1]   Scale Aggregation Network for Accurate and Efficient Crowd Counting [J].
Cao, Xinkun ;
Wang, Zhipeng ;
Zhao, Yanyun ;
Su, Fei .
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :757-773
[2]   Counting People With Low-Level Features and Bayesian Regression [J].
Chan, Antoni B. ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (04) :2160-2177
[3]   Scale Pyramid Network for Crowd Counting [J].
Chen, Xinya ;
Bin, Yanrui ;
Sang, Nong ;
Gao, Changxin .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1941-1950
[4]  
Glorot X., 2011, P 14 INT C ART INT S, P315
[5]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[6]  
He K., 2016, CVPR, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]
[7]   Automatic Microscopic Cell Counting by Use of Unsupervised Adversarial Domain Adaptation and Supervised Density Regression [J].
He, Shenghua ;
Minn, Kyaw Thu ;
Solnica-Krezel, Lilianna ;
Li, Hua ;
Anastasio, Mark .
MEDICAL IMAGING 2019: DIGITAL PATHOLOGY, 2019, 10956
[8]  
Ioffe S, 2015, 32 INT C MACH LEARN
[9]   Beyond Counting: Comparisons of Density Maps for Crowd Analysis Tasks-Counting, Detection, and Tracking [J].
Kang, Di ;
Ma, Zheng ;
Chan, Antoni B. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (05) :1408-1422
[10]  
Kingma DP, 2014, ARXIV