Denstity Level Aware Network for Crowd Counting

被引:0
作者
Zhong, Wencai [1 ,2 ]
Wang, Wei [1 ]
Lu, Hongtao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
来源
NEURAL INFORMATION PROCESSING, ICONIP 2020, PT I | 2020年 / 12532卷
关键词
Crowd counting; Denstity map estimation; Multi-level supervision; ATTENTION;
D O I
10.1007/978-3-030-63830-6_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting has wide applications in video surveillance and public safety, while it remains an extremely challenging task due to large scale variation and diverse crowd distributions. In this paper, we present a novel method called Density Level Aware Network (DLA-Net) to improve the density map estimation in varying density scenes. Specifically, we divide the input into multiple regions according to their density levels and handle the regions independently. Dense regions (with small scale heads) require higher resolution features from shallow layers, while sparse regions (with large heads) need deep features with broader receptive filed. Based on this requirement, we propose to predict multiple density maps focusing on regions of varying density levels correspondingly. Inspired by the U-Net architecture, our density map estimators borrow features of shallow layers to improve the estimation of dense regions. Moreover, we design a Density Level Aware Loss (DLA-Loss) to better supervise those density maps in different regions. We conduct extensive experiments on three crowd counting datasets (ShanghaiTech, UCF-CC-50 and UCF-QNRF) to validate the effectiveness of the proposed method. The results demonstrate that our DLA-Net achieves the best performance compared with other state-of-the-art approaches.
引用
收藏
页码:266 / 277
页数:12
相关论文
共 21 条
[1]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[2]  
Ding XH, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P1942, DOI 10.1109/ICASSP.2018.8461772
[3]   Pay Attention to Deep Feature Fusion in Crowd Density Estimation [J].
Guo, Huimin ;
He, Fujin ;
Cheng, Xin ;
Ding, Xinghao ;
Huang, Yue .
NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 :363-370
[4]   Composition Loss for Counting, Density Map Estimation and Localization in Dense Crowds [J].
Idrees, Haroon ;
Tayyab, Muhmmad ;
Athrey, Kishan ;
Zhang, Dong ;
Al-Maadeed, Somaya ;
Rajpoot, Nasir ;
Shah, Mubarak .
COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 :544-559
[5]   Multi-Source Multi-Scale Counting in Extremely Dense Crowd Images [J].
Idrees, Haroon ;
Saleemi, Imran ;
Seibert, Cody ;
Shah, Mubarak .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2547-2554
[6]   Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks [J].
Jiang, Xiaolong ;
Xiao, Zehao ;
Zhang, Baochang ;
Zhen, Xiantong ;
Cao, Xianbin ;
Doermann, David ;
Shao, Ling .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6126-6135
[7]  
Lempitsky V., 2010, Adv. Neural Inf. Process. Syst, V23
[8]   CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes [J].
Li, Yuhong ;
Zhang, Xiaofan ;
Chen, Deming .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1091-1100
[9]   ADCrowdNet: An Attention-Injective Deformable Convolutional Network for Crowd Understanding [J].
Liu, Ning ;
Long, Yongchao ;
Zou, Changqing ;
Niu, Qun ;
Pan, Li ;
Wu, Hefeng .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3220-3229
[10]  
Marsden M., 2016, arXiv