Crowd Counting based on Multi-level Multi-scale Feature

被引：0

作者：

Di Wu

Zheyi Fan

Shuhan Yi

机构：

[1] School of Information and Electronics,

[2] Beijing Institute of Technology,undefined

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Crowd counting; Multi-scale; Dilated convolution;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Crowd counting has drawn more and more attention for its significance in reality application. However, it’s still a challenging task because of scale variation in images. In this paper, we propose a model to extract and refine features with abundant scale-relevant information, which consists of Multi-layer Multi-scale Feature Extraction Network (MLMS) and Dependency-based Feature Fusion Network (DFF). MLMS plays a role as feature extractor. Three multi-scale feature extraction modules (MSFE) are designed with dilated convolution layers and inserted in different levels of MLMS, which improve the ability for multi-scale feature extraction. DFF plays a role as feature refiner. DFF explores the dependency between hierarchical features. It’s the first time in crowd counting to use Long-short term memory (LSTM) to filter information and fuse the features with the assistance of the dependency. Our model provides new ideas for solving scale-relevant problems from two angels: scale feature extraction and fusion. In this way, our model extracts scale-relevant features and refines the features further. Experiments on four challenging datasets ShanghaiTech Part A/B, UCF_QNRF and UCF_CC_50, getting Mean Absolute Error (MAE) 65.3/8.3/113.2/216.3, demonstrate the effectiveness of the proposed model.

引用

页码：21891 / 21901

页数：10

共 44 条

[1]

Wang Q(2021)Pixel-wise crowd understanding via synthetic data Int J Comput Vis 129 225-245

[2]

Gao J(2020)Multi-level feature fusion based locality-constrained spatial transformer network for video crowd counting Neurocomputing 392 98-107

[3]

Lin W(2021)Decoupled two-stage crowd counting and beyond IEEE Trans Image Process 30 2862-2875

[4]

Yuan Y(2021)Interlayer and intralayer scale aggregation for scale-invariant crowd counting Neurocomputing 441 128-137

[5]

Fang Y(2008)Pedestrian detection via classification on riemannian manifolds IEEE Trans Pattern Anal Mach Intell 30 1713-1727

[6]

Gao S(2009)Object detection with discriminatively trained part-based models IEEE Trans Pattern Anal Mach Intell 32 1627-1645

[7]

Li J(2001)Estimation of number of people in crowded scenes using perspective transformation IEEE Trans Syst Man Cybern-Part A Syst Hum 31 645-654

[8]

Luo W(2002)Multiresolution gray-scale and rotation invariant texture classification with local binary patterns IEEE Trans Patt Anal Mach Intell 24 971-987

[9]

He L(2018)Multiscale multitask deep netvlad for crowd counting IEEE Trans Ind Inform 14 4953-4962

[10]

Hu B(2019)Ha-ccn: Hierarchical attention-based crowd counting network IEEE Trans Image Process 29 323-335

← 1 2 3 4 5 →