JMFEEL-Net: a joint multi-scale feature enhancement and lightweight transformer network for crowd counting

被引:2
|
作者
Wang, Mingtao [1 ]
Zhou, Xin [1 ]
Chen, Yuanyuan [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Sichuan, Peoples R China
关键词
Crowd counting; Count estimation; Multi-scale variations; Multi-density map supervision; PEOPLE; SCALE; MODEL;
D O I
10.1007/s10115-023-02056-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting based on convolutional neural networks (CNNs) has made significant progress in recent years. However, the limited receptive field of CNNs makes it challenging to capture global features for comprehensive contextual modeling, resulting in insufficient accuracy in count estimation. In comparison, vision transformer (ViT)-based counting networks have demonstrated remarkable performance by exploiting their powerful global contextual modeling capabilities. However, ViT models are associated with higher computational costs and training difficulty. In this paper, we propose a novel network named JMFEEL-Net, which utilizes joint multi-scale feature enhancement and lightweight transformer to improve crowd counting accuracy. Specifically, we use a high-resolution CNN as the backbone network to generate high-resolution feature maps. In the backend network, we propose a multi-scale feature enhancement module to address the problem of low recognition accuracy caused by multi-scale variations, especially when counting small-scale objects in dense scenes. Furthermore, we introduce an improved lightweight ViT encoder to effectively model complex global contexts. We also adopt a multi-density map supervision strategy to learn crowd distribution features from feature maps of different resolutions, thereby improving the quality and training efficiency of the density maps. To validate the effectiveness of the proposed method, we conduct extensive experiments on four challenging datasets, namely ShanghaiTech Part A/B, UCF-QNRF, and JHU-Crowd++, achieving very competitive counting performance.
引用
收藏
页码:3033 / 3053
页数:21
相关论文
共 50 条
  • [21] Multi-Scale Guided Attention Network for Crowd Counting
    Li, Pengfei
    Zhang, Min
    Wan, Jian
    Jiang, Ming
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [22] Multi-scale Attention Recalibration Network for crowd counting
    Xie, Jinyang
    Pang, Chen
    Zheng, Yanjun
    Li, Liang
    Lyu, Chen
    Lyu, Lei
    Liu, Hong
    APPLIED SOFT COMPUTING, 2022, 117
  • [23] STOCHASTIC MULTI-SCALE AGGREGATION NETWORK FOR CROWD COUNTING
    Wang, Mingjie
    Cai, Hao
    Zhou, Jun
    Gong, Minglun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2008 - 2012
  • [24] Crowd Counting based on Multi-level Multi-scale Feature
    Wu, Di
    Fan, Zheyi
    Yi, Shuhan
    APPLIED INTELLIGENCE, 2023, 53 (19) : 21891 - 21901
  • [25] A fish counting model based on pyramid vision transformer with multi-scale feature enhancement
    Xin, Jiaming
    Wang, Yiying
    Li, Dashe
    Xiang, Zhongliang
    ECOLOGICAL INFORMATICS, 2025, 86
  • [26] MULTI-STEP QUANTIZATION OF A MULTI-SCALE NETWORK FOR CROWD COUNTING
    Shim, Kyujin
    Byun, Junyoung
    Kim, Changick
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 683 - 687
  • [27] Anti-Background Interference Crowd Counting Network Based on Multi-scale Feature Fusion
    Yu, Ying
    Li, Jianfei
    Qian, Jin
    Cai, Zhen
    Zhu, Zhiliang
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (10): : 915 - 927
  • [28] MSR-FAN: Multi-scale residual feature-aware network for crowd counting
    Zhao, Haoyu
    Min, Weidong
    Wei, Xin
    Wang, Qi
    Fu, Qiyan
    Wei, Zitai
    IET IMAGE PROCESSING, 2021, 15 (14) : 3512 - 3521
  • [29] A Multi-Scale Feature Fusion Network With Cascaded Supervision for Cross-Scene Crowd Counting
    Zhang, Xinfeng
    Han, Lina
    Shan, Wencong
    Wang, Xiaohu
    Chen, Shuhan
    Zhu, Congcong
    Li, Bin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [30] Crowd Counting Method Based on Multi-Scale Enhanced Network
    Xu Tao
    Duan Yinong
    Du Jiahao
    Liu Caihua
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1764 - 1771