SPCANet: congested crowd counting via strip pooling combined attention network

被引:0
作者
Yuan, Zhongyuan [1 ]
机构
[1] Hunan Agr Univ, Coll Informat & Intelligence, Changsha, Hunan, Peoples R China
关键词
Crowd counting; Convolutional neural network; Spatial pooling; Channel attention;
D O I
10.7717/peerj-cs.2273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting aims to estimate the number and distribution of the population in crowded places, which is an important research direction in object counting. It is widely used in public place management, crowd behavior analysis, and other scenarios, showing its robust practicality. In recent years, crowd-counting technology has been developing rapidly. However, in highly crowded and noisy scenes, the counting effect of most models is still seriously affected by the distortion of view angle, dense occlusion, and inconsistent crowd distribution. Perspective distortion causes crowds to appear in different sizes and shapes in the image, and dense occlusion and inconsistent crowd distributions result in parts of the crowd not being captured completely. This ultimately results in the imperfect capture of spatial information in the model. To solve such problems, we propose a strip pooling combined attention (SPCANet) network model based on normed-deformable convolution (NDConv). We model longdistance dependencies more efficiently by introducing strip pooling. In contrast to traditional square kernel pooling, strip pooling uses long and narrow kernels (1xN or Nx1) to deal with dense crowds, mutual occlusion, and overlap. Efficient channel attention (ECA), a mechanism for learning channel attention using a local crosschannel interaction strategy, is also introduced in SPCANet. This module generates channel attention through a fast 1D convolution to reduce model complexity while improving performance as much as possible. Four mainstream datasets, Shanghai Tech Part A, Shanghai Tech Part B, UCF-QNRF, and UCF CC 50, were utilized in extensive experiments, and mean absolute error (MAE) exceeds the baseline, which is 60.9, 7.3, 90.8, and 161.1, validating the effectiveness of SPCANet. Meanwhile, mean squared error (MSE) decreases by 5.7% on average over the four datasets, and the robustness is greatly improved.
引用
收藏
页数:21
相关论文
共 47 条
  • [1] Modeling framework for optimal evacuation of large-scale crowded pedestrian facilities
    Abdelghany, Ahmed
    Abdelghany, Khaled
    Mahmassani, Hani
    Alhalabi, Wael
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2014, 237 (03) : 1105 - 1118
  • [2] Almeida JE, 2013, Arxiv, DOI [arXiv:1303.4692, 10.48550/arXiv.1303.4692, DOI 10.48550/ARXIV.1303.4692]
  • [3] Scale Aggregation Network for Accurate and Efficient Crowd Counting
    Cao, Xinkun
    Wang, Zhipeng
    Zhao, Yanyun
    Su, Fei
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 757 - 773
  • [4] Privacy preserving crowd monitoring: Counting people without people models or tracking
    Chan, Antoni B.
    Liang, Zhang-Sheng John
    Vasconcelos, Nuno
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 1766 - 1772
  • [5] Counting People With Low-Level Features and Bayesian Regression
    Chan, Antoni B.
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (04) : 2160 - 2177
  • [6] Bayesian Poisson Regression for Crowd Counting
    Chan, Antoni B.
    Vasconcelos, Nuno
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 545 - 551
  • [7] Feature Mining for Localised Crowd Counting
    Chen, Ke
    Loy, Chen Change
    Gong, Shaogang
    Xiang, Tao
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [8] Cumulative Attribute Space for Age and Crowd Density Estimation
    Chen, Ke
    Gong, Shaogang
    Xiang, Tao
    Loy, Chen Change
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2467 - 2474
  • [9] Chen S, 2015, PROC CVPR IEEE, P1364, DOI 10.1109/CVPR.2015.7298742
  • [10] Rethinking Spatial Invariance of Convolutional Networks for Object Counting
    Cheng, Zhi-Qi
    Dai, Qi
    Li, Hong
    Song, Jingkuan
    Wu, Xiao
    Hauptmann, Alexander G.
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19606 - 19616