Second-Order Convolutional Network for Crowd Counting

被引：79

作者：

Wang, Luyang ^{[1
]}

Zhai, Qiang ^{[1
]}

Yin, Baoqun ^{[1
]}

Bilal, Hazrat ^{[1
]}

机构：

[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Anhui, Peoples R China

来源：

FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION | 2019年 / 11198卷

关键词：

Crowd Counting; Computer Vision; Second-order CNN; Context Attention Module;

D O I：

10.1117/12.2540362

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Single image crowd counting remains challenging primarily due to various issues, such as large scale variations, perspective and non-uniform crowd distribution. In this paper, we propose a novel architecture referred to Second-Order Convolutional Network (SOCN) to deal with this task from the perspective of improving the feature transformation capability of the network. The proposed SOCN applies a convolutional neural network as the backbone. We introduce three cascaded second-order blocks located behind the backbone to augment the family of transformation operations and increase the nonlinearity of the network, which can extract multi-scale and discriminative features. Furthermore, we design a context attention module (CAM) including dilated convolutions to assign weights to the score map of each second-order block for the purpose that the features which contribute to counting can be highlighted. We conduct various experiments on ShanghaiTeach(1) and UCF_CC_50(2) datasets, and the results demonstrate the effectiveness of our method.

引用

页数：6

共 18 条

[1]

[Anonymous], P 12 IEEE INT C COMP

[2]

[Anonymous], P 3 INT C LEARNING R

[3]

[Anonymous], P ICCV

[4]

[Anonymous], P AAAI

[5]

[Anonymous], P ICCV

[6] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[7] Multi-Source Multi-Scale Counting in Extremely Dense Crowd Images [J].

Idrees, Haroon ;

Saleemi, Imran ;

Seibert, Cody ;

Shah, Mubarak .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2547-2554

[8]

Kingma J., 2015, INT C LEARNING REPRE

[9]

Lempitsky V., 2010, Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems, P1324, DOI DOI 10.5555/2997189.2997337

[10] CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes [J].

Li, Yuhong ;

Zhang, Xiaofan ;

Chen, Deming .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1091-1100

← 1 2 →