DSPNet: Deep scale purifier network for dense crowd counting

被引:49
作者
Zeng, Xin [1 ]
Wu, Yunpeng [1 ]
Hu, Shizhe [1 ]
Wang, Ruobin [1 ]
Ye, Yangdong [1 ]
机构
[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Henan, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Crowd counting; Density map estimation; Convolutional neural network; Deep learning;
D O I
10.1016/j.eswa.2019.112977
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting has produced considerable concern in recent years. However, crowd counting in highly congested scenes is a challenging problem owing to scale variation. To remedy this issue, we propose a novel deep scale purifier network (DSPNet) that can encode multiscale features and reduce the loss of contextual information for dense crowd counting. Our proposed method has two strong points. First, the DSPNet model consists of a frontend and a backend. The frontend is a conventional deep convolutional neural network, while the unified deep neural network backend adopts a "maximal ratio combining" strategy to learn complementary scale information at different levels. The scale purifier module, which improves scale representations, can effectively fuse multiscale features. Second, DSPNet performs the whole RGB image-based inference to facilitate model learning and decrease contextual information loss. Our customized network is end-to-end and has a fully convolutional architecture. We demonstrate the generalization ability of our approach by cross-scene evaluation. Extensive experiments on three publicly available crowd counting benchmarks (i.e., UCF-QNRF, ShanghaiTech, and UCF_CC_50 datasets) show that our DSPNet delivers superior performance against state-of-the-art methods. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 47 条
  • [1] [Anonymous], IEEE T PATTERN ANAL
  • [2] [Anonymous], P IEEE C COMP VIS PA
  • [3] [Anonymous], TECHNICAL REPORT
  • [4] [Anonymous], ADV VIDEO SIGNAL BAS
  • [5] [Anonymous], 2015, PROC CVPR IEEE
  • [6] [Anonymous], 2017, ADV NEURAL INFORM PR
  • [7] [Anonymous], ADV NEURAL INFORM PR
  • [8] [Anonymous], 2018, P IEEE C COMP VIS PA
  • [9] [Anonymous], 2010, P ICML 2010 P 27 INT
  • [10] A novel video based system for detecting and counting vehicles at user-defined virtual loops
    Barcellos, Pablo
    Bouvie, Christiano
    Escouto, Fabiano Lopes
    Scharcanski, Jacob
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (04) : 1845 - 1856