Crowd counting method based on cross column fusion attention mechanism

被引：1

作者：

Cui, Xiao ^{[1
]}

Zhang, Zhi-Feng ^{[1
]}

Zheng, Qian ^{[1
]}

Cao, Jie ^{[1
]}

机构：

[1] Zhengzhou Univ Light Ind, Software Engn Coll, Zhengzhou, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2021年 / 30卷 / 03期

基金：

中国国家自然科学基金;

关键词：

crowd counting; cross column fusion attention module; shallow convolution module; density map;

D O I：

10.1117/1.JEI.30.3.033032

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep learning has made substantial progress in crowd density estimation, but there are still some problems in existing methods, such as large population density, background interference, and scale change, which makes it difficult to count people. To solve the above problems, we proposed a crowd counting method based on a cross column fusion attention mechanism. First, the first ten layers of VGG16 with good migration ability and feature extraction ability are used as the front-end network to preliminarily extract human head features. Then, a cross column fusion attention module is designed. In this module, feature maps are fused across columns to make the network contain richer deep and shallow features. At the same time, to alleviate the background interference, the attention mechanism is used to guide the network to focus on the head position in the picture, and different weights are assigned to different positions according to the attention score map, so as to highlight the crowd and weaken the background, and finally get a high-quality density map. In addition, a shallow convolution module is designed as another branch. The output feature map of the shallow convolution module and the output feature map of the attention module of cross column fusion are fused to solve the problem of scale change effectively. Finally, in the last layer of the network, the convolution layer of 1 x 1 is used to replace the full connection layer, and fewer network parameters are used to reduce the calculation and the population density map is regressed. The experimental results show that the mean absolute error and mean square error of the proposed algorithm are significantly reduced compared with the comparison algorithm. (C) 2021 SPIE and IS&T

引用

页数：12

共 27 条

[1]

[Anonymous], 2007, IN 2007 IEEE C COMP

[2]

Bansal A., 2015, ARXIV150708445

[3] Scale Aggregation Network for Accurate and Efficient Crowd Counting [J].

Cao, Xinkun ;

Wang, Zhipeng ;

Zhao, Yanyun ;

Su, Fei .

COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :757-773

[4] Bayesian Poisson Regression for Crowd Counting [J].

Chan, Antoni B. ;

Vasconcelos, Nuno .

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :545-551

[5] Feature Mining for Localised Crowd Counting [J].

Chen, Ke ;

Loy, Chen Change ;

Gong, Shaogang ;

Xiang, Tao .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,

[6] Object Detection with Discriminatively Trained Part-Based Models [J].

Felzenszwalb, Pedro F. ;

Girshick, Ross B. ;

McAllester, David ;

Ramanan, Deva .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) :1627-1645

[7] Sharable and Individual Multi-View Metric Learning [J].

Hu, Junlin ;

Lu, Jiwen ;

Tan, Yap-Peng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (09) :2281-2288

[8] Multi-Source Multi-Scale Counting in Extremely Dense Crowd Images [J].

Idrees, Haroon ;

Saleemi, Imran ;

Seibert, Cody ;

Shah, Mubarak .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2547-2554

[9] CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes [J].

Li, Yuhong ;

Zhang, Xiaofan ;

Chen, Deming .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1091-1100

[10] Counting crowd flow based on feature points [J].

Liang, Ronghua ;

Zhu, Yuge ;

Wang, Haixia .

NEUROCOMPUTING, 2014, 133 :377-384

← 1 2 3 →