PCC Net: Perspective Crowd Counting via Spatial Convolutional Network

被引：198

作者：

Gao, Junyu ^{[1
,2
]}

Wang, Qi ^{[1
,2
]}

Li, Xuelong ^{[1
,2
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China

[2] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710072, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2020年 / 30卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Estimation; Feature extraction; Image segmentation; Training; Task analysis; Head; Semantics; Crowd counting; crowd analysis; spatial convolutional network; background segmentation; multi-task learning;

D O I：

10.1109/TCSVT.2019.2919139

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Crowd counting from a single image is a challenging task due to high appearance similarity, perspective changes, and severe congestion. Many methods only focus on the local appearance features and they cannot handle the aforementioned challenges. In order to tackle them, we propose a perspective crowd counting network (PCC Net), which consists of three parts: 1) density map estimation (DME) focuses on learning very local features of density map estimation; 2) random high-level density classification (R-HDC) extracts global features to predict the coarse density labels of random patches in images; and 3) fore-/background segmentation (FBS) encodes mid-level features to segments the foreground and background. Besides, the Down, Up, Left, and Right (DULR) module is embedded in PCC Net to encode the perspective changes on four directions (DULR). The proposed PCC Net is verified on five mainstream datasets, which achieves the state-of-the-art performance on the one and attains the competitive results on the other four datasets. The source code is available at https://github.com/gjy3035/PCC-Net.

引用

页码：3486 / 3498

页数：13

共 40 条

[1]

[Anonymous], 2015, P IEEE C COMP VIS PA

[2]

[Anonymous], ARXIV161200220

[3]

[Anonymous], 2018, ARXIV180801050

[4]

[Anonymous], ARXIV180303095

[5] Cumulative Attribute Space for Age and Crowd Density Estimation [J].

Chen, Ke ;

Gong, Shaogang ;

Xiang, Tao ;

Loy, Chen Change .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2467-2474

[6] Patch-based topic model for group detection [J].

Chen, Mulin ;

Wang, Qi ;

Li, Xuelong .

SCIENCE CHINA-INFORMATION SCIENCES, 2017, 60 (11)

[7] Toward Abnormal Trajectory and Event Detection in Video Surveillance [J].

Cosar, Serhan ;

Donatiello, Giuseppe ;

Bogorny, Vania ;

Garate, Carolina ;

Alvares, Luis Otavio ;

Bremond, Francois .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (03) :683-695

[8] Fast crowd density estimation with convolutional neural networks [J].

Fu, Min ;

Xu, Pei ;

Li, Xudong ;

Liu, Qihe ;

Ye, Mao ;

Zhu, Ce .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 43 :81-88

[9] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[10]

He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]

← 1 2 3 4 →