MSFFA: a multi-scale feature fusion and attention mechanism network for crowd counting

被引:0
作者
Zhaoxin Li
Shuhua Lu
Yishan Dong
Jingyuan Guo
机构
[1] People’s Public Security University of China,College of Information and Cyber Security
来源
The Visual Computer | 2023年 / 39卷
关键词
Crowd counting; Multi-scale; Attention mechanism; Mixed loss function;
D O I
暂无
中图分类号
学科分类号
摘要
Crowd counting has been a growing hot topic in the computer vision community in recent years due to its extensive applications in the fields of public safety and commercial planning. However, up to now, it has been still a challenging task in realistic scenes owing to large-scale variations and complex background interference. In this paper, we have proposed an efficient end-to-end Multi-Scale Feature Fusion and Attention mechanism CNN network, named as MSFFA. The presented network consists of three parts: the front-end of the low-level feature extractor, the mid-end of the multi-scale feature fusion operator and the back-end of the density map generator. Among them, most significantly, in the mid-end, we stack three MSFF blocks with the residual connection, which on the one hand, makes the network obtain large-scale continuous variations and on the other hand, enhances the information transmission. Meanwhile, a global attention mechanism module is employed to extract effective features in complex background scenes. Our method has been evaluated on three public datasets, including ShanghaiTech, UCF-QNRF and UCF_CC_50. Experimental results show that our method outperforms some existing advanced approaches, indicating its excellent accuracy and stability.
引用
收藏
页码:1045 / 1056
页数:11
相关论文
共 70 条
  • [11] Basalamah S(2021)PSC-Net: Learning part spatial co-occurrence for occluded pedestrian detection Sci. China Inf. Sci. 64 1-13
  • [12] Gao J(2020)Two-branch fusion network with attention map for crowd counting Neurocomputing 411 1-8
  • [13] Wang Q(2020)Crowd counting by using multi-level density-based spatial information: A Multi-scale CNN framework Inf. Sci. 528 79-91
  • [14] Yuan Y(2021)Crowd counting method based on the self-attention residual network Appl. Intell. 51 427-440
  • [15] Dollar P(2020)DSPNet: Deep scale purifier network for dense crowd counting Expert Syst. Appl. 141 112977-101
  • [16] Wojek C(2019)Atrous convolutions spatial pyramid network for crowd counting and density estimation Neurocomputing 350 91-24
  • [17] Schiele B(2021)Crowd counting based on attention-guided multi-scale fusion networks Neurocomputing 451 12-undefined
  • [18] Perona P(undefined)undefined undefined undefined undefined-undefined
  • [19] Wu B(undefined)undefined undefined undefined undefined-undefined
  • [20] Nevatia R(undefined)undefined undefined undefined undefined-undefined