TRIPLE ATTENTION FOR ROBUST VIDEO CROWD COUNTING

被引：0

作者：

Wu, Qiyao ^{[1
]}

Zhang, Chongyang ^{[1
]}

Kong, Xiyu ^{[1
]}

Zhao, Muming ^{[1
]}

Chen, Yanjun ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年

关键词：

Crowd counting; Co-attention; Robustness;

D O I：

10.1109/icip40778.2020.9190701

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Tra.ditional static-image based crowd counting methods work well on public d.atasets. However, due to the complexity and variability of real-world scenarios, their performance tends to drop dramatically in practice. Aiming to solve the robust problem of crowd counting, we propose to use a co-attention mechanism to extract correlation features lying behind adjacent video frames which can enhance the distinguish-ability between background and foreground. Also, we combine co-attention with spatial attention and multi-scale self-attention. Three different and complementary attention-based modules jointly reinforce the robustness of the counting model. Experiments on two widely used video crowd datasets demonstrate the effectiveness of the proposed approach.

引用

页码：1966 / 1970

页数：5

共 22 条

[1] Feature Mining for Localised Crowd Counting
Chen, Ke
Loy, Chen Change
Gong, Shaogang
Xiang, Tao
[J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[2] Scale Pyramid Network for Crowd Counting
Chen, Xinya
Bin, Yanrui
Sang, Nong
Gao, Changxin
[J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1941 - 1950
[3] Hsieh TI, 2019, ADV NEUR IN, V32
[4] Kong XY, 2020, INT CONF ACOUST SPEE, P2722, DOI [10.1109/icassp40776.2020.9054258, 10.1109/ICASSP40776.2020.9054258]
[5] CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
Li, Yuhong
Zhang, Xiaofan
Chen, Deming
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1091 - 1100
[6] DecideNet: Counting Varying Density Crowds Through Attention Guided Detection and Density Estimation
Liu, Jiang
Gao, Chenqiang
Meng, Deyu
Hauptmann, Alexander G.
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5197 - 5206
[7] ADCrowdNet: An Attention-Injective Deformable Convolutional Network for Crowd Understanding
Liu, Ning
Long, Yongchao
Zou, Changqing
Niu, Qun
Pan, Li
Wu, Hefeng
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3220 - 3229
[8] Liu Weizhe, 2019, IEEE C COMP VIS PATT
[9] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Ren, Shaoqing
He, Kaiming
Girshick, Ross
Sun, Jian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
[10] Switching Convolutional Neural Network for Crowd Counting
Sam, Deepak Babu
Surya, Shiv
Babu, R. Venkatesh
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4031 - 4039

← 1 2 3 →