Learning a deep network with cross-hierarchy aggregation for crowd counting

被引:21
作者
Guo, Qiang [1 ]
Zeng, Xin [1 ]
Hu, Shizhe [1 ]
Phoummixay, Sonephet [1 ]
Ye, Yangdong [1 ]
机构
[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Crowd counting; Cross-hierarchy aggregation; Density maps; DENSITY-ESTIMATION; FUSION;
D O I
10.1016/j.knosys.2020.106691
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting, a significant but challenging task in computer vision, aims at estimating the number of people in an image or video. Recent methods for crowd counting have obtained promising performance due to deep neural networks but most of them ignore the abundant conducive information in hierarchical features. In this paper, a novel Cross-Hierarchy Aggregation Network (CHANet) is proposed to exploit multi-hierarchy information in the crowd features from each hierarchy and aggregate cross-hierarchy features to generate reasonable density maps. Firstly, we propose a CHA module to fully extract local hierarchical features and capture maximum information of the crowd features. The CHA module combines residual and dense connections without over-assigning parameters for feature reuse. Then, we utilize the global hierarchical features from the shallow hierarchies to obtain a more powerful representation ability with a global residual connection. Experimental evaluations on four publicly available crowd counting datasets (ShanghaiTech, UCF-QNRF, WorldExpo'10, and Beijing BRT) demonstrate that the proposed CHANet achieves superior performance compared to other state-of-the-art methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 51 条
[1]   People detection and articulated pose estimation framework for crowded scenes [J].
Alyammahi, Sohailah ;
Bhaskar, Harish ;
Ruta, Dymitr ;
Al-Mualla, Mohammed .
KNOWLEDGE-BASED SYSTEMS, 2017, 131 :83-104
[2]  
[Anonymous], 2010, NIPS
[3]   Scale Aggregation Network for Accurate and Efficient Crowd Counting [J].
Cao, Xinkun ;
Wang, Zhipeng ;
Zhao, Yanyun ;
Su, Fei .
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :757-773
[4]   Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting [J].
Cheng, Zhi-Qi ;
Li, Jun-Xiu ;
Dai, Qi ;
Wu, Xiao ;
He, Jun-Yan ;
Hauptmann, Alexander G. .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :1897-1906
[5]   An Aggregated Multicolumn Dilated Convolution Network for Perspective-Free Counting [J].
Deb, Diptodip ;
Ventura, Jonathan .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :308-317
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]   Crowd Density Estimation Using Fusion of Multi-Layer Features [J].
Ding, Xinghao ;
He, Fujin ;
Lin, Zhirui ;
Wang, Yu ;
Guo, Huimin ;
Huang, Yue .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (08) :4776-4787
[8]  
Ding XH, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P1942, DOI 10.1109/ICASSP.2018.8461772
[9]   Fast crowd density estimation with convolutional neural networks [J].
Fu, Min ;
Xu, Pei ;
Li, Xudong ;
Liu, Qihe ;
Ye, Mao ;
Zhu, Ce .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 43 :81-88
[10]   Fusion of multispectral data through illumination-aware deep neural networks for pedestrian detection [J].
Guan, Dayan ;
Cao, Yanpeng ;
Yang, Jiangxin ;
Cao, Yanlong ;
Yang, Michael Ying .
INFORMATION FUSION, 2019, 50 :148-157