Kernel-Based Density Map Generation for Dense Object Counting

被引:99
作者
Wan, Jia [1 ]
Wang, Qingzhong [1 ]
Chan, Antoni B. [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Crowd counting; vehicle counting; object counting; density map generation; density map estimation; deep learning; PEOPLE;
D O I
10.1109/TPAMI.2020.3022878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting is an essential topic in computer vision due to its practical usage in surveillance systems. The typical design of crowd counting algorithms is divided into two steps. First, the ground-truth density maps of crowd images are generated from the ground-truth dot maps (density map generation), e.g., by convolving with a Gaussian kernel. Second, deep learning models are designed to predict a density map from an input image (density map estimation). The density map based counting methods that incorporate density map as the intermediate representation have improved counting performance dramatically. However, in the sense of end-to-end training, the hand-crafted methods used for generating the density maps may not be optimal for the particular network or dataset used. To address this issue, we propose an adaptive density map generator, which takes the annotation dot map as input, and learns a density map representation for a counter. The counter and generator are trained jointly within an end-to-end framework. We also show that the proposed framework can be applied to general dense object counting tasks. Extensive experiments are conducted on 10 datasets for 3 applications: crowd counting, vehicle counting, and general object counting. The experiment results on these datasets confirm the effectiveness of the proposed learnable density map representations.
引用
收藏
页码:1357 / 1370
页数:14
相关论文
共 64 条
[31]   Leveraging Unlabeled Data for Crowd Counting by Learning to Rank [J].
Liu, Xialei ;
van de Weijer, Joost ;
Bagdanov, Andrew D. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7661-7669
[32]   Point in, Box out: Beyond Counting Persons in Crowds [J].
Liu, Yuting ;
Shi, Miaojing ;
Zhao, Qijun ;
Wang, Xiaofang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6462-6471
[33]   Bayesian Loss for Crowd Count Estimation with Point Supervision [J].
Ma, Zhiheng ;
Wei, Xing ;
Hong, Xiaopeng ;
Gong, Yihong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6141-6150
[34]   Fully Convolutional Crowd Counting on Highly Congested Scenes [J].
Marsden, Mark ;
McGuinness, Kevin ;
Little, Suzanne ;
O'Connor, Noel E. .
PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, :27-33
[35]   A Large Contextual Dataset for Classification, Detection and Counting of Cars with Deep Learning [J].
Mundhenk, T. Nathan ;
Konjevod, Goran ;
Sakla, Wesam A. ;
Boakye, Kofi .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :785-800
[36]   Iterative Crowd Counting [J].
Ranjan, Viresh ;
Le, Hieu ;
Hoai, Minh .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :278-293
[37]   YOLO9000: Better, Faster, Stronger [J].
Redmon, Joseph ;
Farhadi, Ali .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6517-6525
[38]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[39]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[40]  
Sam D. B., 2020, ARXIV190607538