Crowd counting via Multi-Scale Adversarial Convolutional Neural Networks

被引:3
作者
Zhu, Liping [1 ]
Zhang, Hong [1 ]
Ali, Sikandar [1 ]
Yang, Baoli [1 ]
Li, Chengyang [1 ]
机构
[1] China Univ Petr, Beijing Key Lab Petr Data Min, Beijing 10224, Peoples R China
关键词
Crowd counting; Multi-Scale; Crowd density estimation; Density map;
D O I
10.1515/jisys-2019-0157
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of crowd counting is to estimate the number of pedestrians in crowd images. Crowd counting or density estimation is an extremely challenging task in computer vision, due to large scale variations and dense scene. Current methods solve these issues by compounding multi-scale Convolutional Neural Network with different receptive fields. In this paper, a novel end-to-end architecture based on Multi-Scale Adversarial Convolutional Neural Network (MSA-CNN) is proposed to generate crowd density and estimate the amount of crowd. Firstly, a multi-scale network is used to extract the globally relevant features in the crowd image, and then fractionally-strided convolutional layers are designed for up-sampling the output to recover the loss of crucial details caused by the earlier max pooling layers. An adversarial loss is directly employed to shrink the estimated value into the realistic subspace to reduce the blurring effect of density estimation. Joint training is performed in an end-to-end fashion using a combination of Adversarial loss and Euclidean loss. The two losses are integrated via a joint training scheme to improve density estimation performance. We conduct some extensive experiments on available datasets to show the significant improvements and supremacy of the proposed approach over the available state-of-the-art approaches.
引用
收藏
页码:180 / 191
页数:12
相关论文
共 35 条
  • [1] [Anonymous], 2009, P INT C PATT REC
  • [2] Bansal A., 2015, COMPUTER SCI
  • [3] Boesen-Lindbo-Larsen A., 2015, INT C MACH LEARN
  • [4] CrowdNet: A Deep Convolutional Network for Dense Crowd Counting
    Boominathan, Lokesh
    Kruthiventi, Srinivas S. S.
    Babu, R. Venkatesh
    [J]. MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 640 - 644
  • [5] Feature Mining for Localised Crowd Counting
    Chen, Ke
    Loy, Chen Change
    Gong, Shaogang
    Xiang, Tao
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [6] Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110
  • [7] Dan K., 2005, BRIT MACH VIS C
  • [8] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [9] Drone-based Object Counting by Spatially Regularized Regional Proposal Network
    Hsieh, Meng-Ru
    Lin, Yen-Liang
    Hsu, Winston H.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4165 - 4173
  • [10] Multi-Source Multi-Scale Counting in Extremely Dense Crowd Images
    Idrees, Haroon
    Saleemi, Imran
    Seibert, Cody
    Shah, Mubarak
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2547 - 2554