Scale Aggregation Network for Accurate and Efficient Crowd Counting

被引:484
作者
Cao, Xinkun [1 ]
Wang, Zhipeng [1 ]
Zhao, Yanyun [1 ,2 ]
Su, Fei [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst & Network Culture, Beijing, Peoples R China
来源
COMPUTER VISION - ECCV 2018, PT V | 2018年 / 11209卷
关键词
Crowd counting; Crowd density estimation; Scale Aggregation Network; Local pattern consistency;
D O I
10.1007/978-3-030-01228-1_45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel encoder-decoder network, called Scale Aggregation Network (SANet), for accurate and efficient crowd counting. The encoder extracts multi-scale features with scale aggregation modules and the decoder generates high-resolution density maps by using a set of transposed convolutions. Moreover, we find that most existing works use only Euclidean loss which assumes independence among each pixel but ignores the local correlation in density maps. Therefore, we propose a novel training loss, combining of Euclidean loss and local pattern consistency loss, which improves the performance of the model in our experiments. In addition, we use normalization layers to ease the training process and apply a patch-based test scheme to reduce the impact of statistic shift problem. To demonstrate the effectiveness of the proposed method, we conduct extensive experiments on four major crowd counting datasets and our method achieves superior performance to state-of-the-art methods while with much less parameters.
引用
收藏
页码:757 / 773
页数:17
相关论文
共 39 条
  • [1] [Anonymous], 2017, PYTORCH TENSORS DYNA
  • [2] [Anonymous], 2017, CORR
  • [3] Babu Sam D., 2017, CVPR
  • [4] CrowdNet: A Deep Convolutional Network for Dense Crowd Counting
    Boominathan, Lokesh
    Kruthiventi, Srinivas S. S.
    Babu, R. Venkatesh
    [J]. MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 640 - 644
  • [5] Privacy preserving crowd monitoring: Counting people without people models or tracking
    Chan, Antoni B.
    Liang, Zhang-Sheng John
    Vasconcelos, Nuno
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 1766 - 1772
  • [6] Bayesian Poisson Regression for Crowd Counting
    Chan, Antoni B.
    Vasconcelos, Nuno
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 545 - 551
  • [7] Feature Mining for Localised Crowd Counting
    Chen, Ke
    Loy, Chen Change
    Gong, Shaogang
    Xiang, Tao
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [8] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [9] Pedestrian Detection: An Evaluation of the State of the Art
    Dollar, Piotr
    Wojek, Christian
    Schiele, Bernt
    Perona, Pietro
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) : 743 - 761
  • [10] Ge Weina, 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2913, DOI 10.1109/CVPRW.2009.5206621