Multi-scale dilated convolution of feature Fusion Network for Crowd counting

被引:0
|
作者
Liu, Donghua [1 ]
Wang, Guodong [1 ]
Zhai, Guangtao [2 ]
机构
[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao 266071, Shandong, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200240, Peoples R China
关键词
Crowd counting; Convolution neural network; Dilated convolution; Feature fusion; TRACKING;
D O I
10.1007/s11042-022-13130-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Crowd counting has long been a challenging task due to the perspective distortion and variability in head size. The previous methods ignore the multi-scale information in images or simply use convolutions with different kernel sizes to extract multi-scale features, resulting in incomplete multi-scale features extracted. In this paper, we propose a crowd counting model called Multi-scale Dilated Convolution of Feature Fusion Network (MsDFNet) based on a CNN (convolutional neural network). Our MsDFNet is based on the regression method of the density map. The density map is predicted by the parameters learned by CNN to obtain better prediction results. The proposed network mainly includes three components, a CNN to extract low-level features, a multi-scale dilated convolution module and multi-column feature fusion blocks, a density map regression module. Multi-scale dilated convolutions are employed to extract multi-scale high-level features, and the features extracted from different columns are fused. The combination of the multi-scale dilated convolution module and the multi-column feature fusion block can effectively extract more complete multi-scale features and boost the performance of counting small-sized targets. Experiments show that the problem of various head sizes in images can be effectively solved by fusing multi-scale context feature information. We prove the effectiveness of our method on two public datasets (The ShanghaiTech dataset and the UCF_CC_50 dataset). We compare our method with the previous state-of-the-art crowd counting algorithms in terms of MAE (Mean Absolute Error) and MSE (Mean Square Error) and significantly improves the performance, especially in case of various head sizes. On the UCF_CC_50 dataset, our method reduces the MAE index by 28.6 compared with the previous state-of-the-art method. (The lower the MAE, the better the performance).
引用
收藏
页码:37939 / 37952
页数:14
相关论文
共 50 条
  • [11] Deep feature network with multi-scale fusion for highly congested crowd counting
    Yan, Leilei
    Zhang, Li
    Zheng, Xiaohan
    Li, Fanzhang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (03) : 819 - 835
  • [12] MLANet: multi-level attention network with multi-scale feature fusion for crowd counting
    Xiong, Liyan
    Zeng, Yijuan
    Huang, Xiaohui
    Li, Zhida
    Huang, Peng
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (05): : 6591 - 6608
  • [13] A multi-scale fusion and dual attention network for crowd counting
    De Zhang
    Yiting Wang
    Xiaoping Zhou
    Liangliang Su
    Multimedia Tools and Applications, 2025, 84 (13) : 11269 - 11294
  • [14] Anti-Background Interference Crowd Counting Network Based on Multi-scale Feature Fusion
    Yu, Ying
    Li, Jianfei
    Qian, Jin
    Cai, Zhen
    Zhu, Zhiliang
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (10): : 915 - 927
  • [15] A Multi-Scale Feature Fusion Network With Cascaded Supervision for Cross-Scene Crowd Counting
    Zhang, Xinfeng
    Han, Lina
    Shan, Wencong
    Wang, Xiaohu
    Chen, Shuhan
    Zhu, Congcong
    Li, Bin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [16] Multi-scale Feature Aggregation for Crowd Counting
    Jiang, Xiaoheng
    Wu, Xinyi
    Cholakkal, Hisham
    Anwer, Rao Muhammad
    Cao, Jiale
    Xu, Mingliang
    Zhou, Bing
    Pang, Yanwei
    Khan, Fahad Shahbaz
    arXiv, 2022,
  • [17] A multi-scale and multi-level feature aggregation network for crowd counting
    Zhu, Fushun
    Yan, Hua
    Chen, Xinyue
    Li, Tong
    Zhang, Zhengyu
    NEUROCOMPUTING, 2021, 423 : 46 - 56
  • [18] MSNet: Multi-scale Network for Crowd Counting
    Shi, Ying
    Sang, Jun
    Alam, Mohammad S.
    Liu, Xinyue
    Tian, Shaoli
    PATTERN RECOGNITION AND TRACKING XXXII, 2021, 11735
  • [19] Multi-scale supervised network for crowd counting
    Wang, Yongjie
    Zhang, Wei
    Huang, Dongxiao
    Liu, Yanyan
    Zhu, Jianghua
    IET IMAGE PROCESSING, 2020, 14 (17) : 4701 - 4707
  • [20] MFP-Net: Multi-scale feature pyramid network for crowd counting
    Lei, Tao
    Zhang, Dong
    Wang, Risheng
    Li, Shuying
    Zhang, Weijiang
    Nandi, Asoke K.
    IET IMAGE PROCESSING, 2021, 15 (14) : 3522 - 3533