Second-Order Convolutional Network for Crowd Counting

被引:79
作者
Wang, Luyang [1 ]
Zhai, Qiang [1 ]
Yin, Baoqun [1 ]
Bilal, Hazrat [1 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Anhui, Peoples R China
来源
FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION | 2019年 / 11198卷
关键词
Crowd Counting; Computer Vision; Second-order CNN; Context Attention Module;
D O I
10.1117/12.2540362
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single image crowd counting remains challenging primarily due to various issues, such as large scale variations, perspective and non-uniform crowd distribution. In this paper, we propose a novel architecture referred to Second-Order Convolutional Network (SOCN) to deal with this task from the perspective of improving the feature transformation capability of the network. The proposed SOCN applies a convolutional neural network as the backbone. We introduce three cascaded second-order blocks located behind the backbone to augment the family of transformation operations and increase the nonlinearity of the network, which can extract multi-scale and discriminative features. Furthermore, we design a context attention module (CAM) including dilated convolutions to assign weights to the score map of each second-order block for the purpose that the features which contribute to counting can be highlighted. We conduct various experiments on ShanghaiTeach(1) and UCF_CC_50(2) datasets, and the results demonstrate the effectiveness of our method.
引用
收藏
页数:6
相关论文
共 18 条
[1]  
[Anonymous], P 12 IEEE INT C COMP
[2]  
[Anonymous], P 3 INT C LEARNING R
[3]  
[Anonymous], P ICCV
[4]  
[Anonymous], P AAAI
[5]  
[Anonymous], P ICCV
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]   Multi-Source Multi-Scale Counting in Extremely Dense Crowd Images [J].
Idrees, Haroon ;
Saleemi, Imran ;
Seibert, Cody ;
Shah, Mubarak .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2547-2554
[8]  
Kingma J., 2015, INT C LEARNING REPRE
[9]  
Lempitsky V., 2010, Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems, P1324, DOI DOI 10.5555/2997189.2997337
[10]   CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes [J].
Li, Yuhong ;
Zhang, Xiaofan ;
Chen, Deming .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1091-1100