MULTI-STEP QUANTIZATION OF A MULTI-SCALE NETWORK FOR CROWD COUNTING

被引：0

作者：

Shim, Kyujin ^{[1
]}

Byun, Junyoung ^{[1
]}

Kim, Changick ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon, South Korea

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年

基金：

新加坡国家研究基金会;

关键词：

Crowd counting; Crowd density estimation; Quantization;

D O I：

10.1109/icip40778.2020.9190692

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Crowd counting is one of the most important tasks in visual surveillance applications since it provides useful information such as the number of crowds and their distribution. However, it is very challenging due to severe occlusions, large geometrical deformations, and high visual clutter. To tackle this problem, we propose a novel CNN-based crowd density estimation network consisting of a backbone, decoder, and mapper, and also a multi-step quantization scheme to train the network more effectively. As a backbone network, ResNet is adopted, then the decoder and mapper are added to deal with multi-scale problems of crowd counting and to generate high-resolution density maps. Finally, a multi-step quantization scheme discretizes the continuous space of both predictions and ground truth density maps, and it reduces the search scope of the network and raises their matching ratio. As a result, our method outperforms recent methods in four major datasets.

引用

页码：683 / 687

页数：5

共 24 条

[1]

[Anonymous], 2012, COURSERA

[2]

[Anonymous], 2010, P 27 INT C MACH LEAR, DOI 10.5555/3104322.3104425

[3]

[Anonymous], 2016, Residual networks behave like ensembles of relatively shallow networks

[4]

Bengio Y., 2013, CoRR abs/1308.3432

[5] Scale Aggregation Network for Accurate and Efficient Crowd Counting [J].

Cao, Xinkun ;

Wang, Zhipeng ;

Zhao, Yanyun ;

Su, Fei .

COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :757-773

[6] DADNet: Dilated-Attention-Deformable ConvNet for Crowd Counting [J].

Guo, Dan ;

Li, Kun ;

Zha, Zheng-Jun ;

Wang, Meng .

PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :1823-1832

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8]

Hubara I, 2016, ADV NEUR IN, V29

[9] Composition Loss for Counting, Density Map Estimation and Localization in Dense Crowds [J].

Idrees, Haroon ;

Tayyab, Muhmmad ;

Athrey, Kishan ;

Zhang, Dong ;

Al-Maadeed, Somaya ;

Rajpoot, Nasir ;

Shah, Mubarak .

COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 :544-559

[10] Multi-Source Multi-Scale Counting in Extremely Dense Crowd Images [J].

Idrees, Haroon ;

Saleemi, Imran ;

Seibert, Cody ;

Shah, Mubarak .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2547-2554

← 1 2 3 →