Crowd Counting based on Multi-level Multi-scale Feature

被引:0
作者
Di Wu
Zheyi Fan
Shuhan Yi
机构
[1] School of Information and Electronics,
[2] Beijing Institute of Technology,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Crowd counting; Multi-scale; Dilated convolution;
D O I
暂无
中图分类号
学科分类号
摘要
Crowd counting has drawn more and more attention for its significance in reality application. However, it’s still a challenging task because of scale variation in images. In this paper, we propose a model to extract and refine features with abundant scale-relevant information, which consists of Multi-layer Multi-scale Feature Extraction Network (MLMS) and Dependency-based Feature Fusion Network (DFF). MLMS plays a role as feature extractor. Three multi-scale feature extraction modules (MSFE) are designed with dilated convolution layers and inserted in different levels of MLMS, which improve the ability for multi-scale feature extraction. DFF plays a role as feature refiner. DFF explores the dependency between hierarchical features. It’s the first time in crowd counting to use Long-short term memory (LSTM) to filter information and fuse the features with the assistance of the dependency. Our model provides new ideas for solving scale-relevant problems from two angels: scale feature extraction and fusion. In this way, our model extracts scale-relevant features and refines the features further. Experiments on four challenging datasets ShanghaiTech Part A/B, UCF_QNRF and UCF_CC_50, getting Mean Absolute Error (MAE) 65.3/8.3/113.2/216.3, demonstrate the effectiveness of the proposed model.
引用
收藏
页码:21891 / 21901
页数:10
相关论文
共 44 条
[1]  
Wang Q(2021)Pixel-wise crowd understanding via synthetic data Int J Comput Vis 129 225-245
[2]  
Gao J(2020)Multi-level feature fusion based locality-constrained spatial transformer network for video crowd counting Neurocomputing 392 98-107
[3]  
Lin W(2021)Decoupled two-stage crowd counting and beyond IEEE Trans Image Process 30 2862-2875
[4]  
Yuan Y(2021)Interlayer and intralayer scale aggregation for scale-invariant crowd counting Neurocomputing 441 128-137
[5]  
Fang Y(2008)Pedestrian detection via classification on riemannian manifolds IEEE Trans Pattern Anal Mach Intell 30 1713-1727
[6]  
Gao S(2009)Object detection with discriminatively trained part-based models IEEE Trans Pattern Anal Mach Intell 32 1627-1645
[7]  
Li J(2001)Estimation of number of people in crowded scenes using perspective transformation IEEE Trans Syst Man Cybern-Part A Syst Hum 31 645-654
[8]  
Luo W(2002)Multiresolution gray-scale and rotation invariant texture classification with local binary patterns IEEE Trans Patt Anal Mach Intell 24 971-987
[9]  
He L(2018)Multiscale multitask deep netvlad for crowd counting IEEE Trans Ind Inform 14 4953-4962
[10]  
Hu B(2019)Ha-ccn: Hierarchical attention-based crowd counting network IEEE Trans Image Process 29 323-335