TRIPLE ATTENTION FOR ROBUST VIDEO CROWD COUNTING

被引:0
作者
Wu, Qiyao [1 ]
Zhang, Chongyang [1 ]
Kong, Xiyu [1 ]
Zhao, Muming [1 ]
Chen, Yanjun [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年
关键词
Crowd counting; Co-attention; Robustness;
D O I
10.1109/icip40778.2020.9190701
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Tra.ditional static-image based crowd counting methods work well on public d.atasets. However, due to the complexity and variability of real-world scenarios, their performance tends to drop dramatically in practice. Aiming to solve the robust problem of crowd counting, we propose to use a co-attention mechanism to extract correlation features lying behind adjacent video frames which can enhance the distinguish-ability between background and foreground. Also, we combine co-attention with spatial attention and multi-scale self-attention. Three different and complementary attention-based modules jointly reinforce the robustness of the counting model. Experiments on two widely used video crowd datasets demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:1966 / 1970
页数:5
相关论文
共 22 条
  • [1] Feature Mining for Localised Crowd Counting
    Chen, Ke
    Loy, Chen Change
    Gong, Shaogang
    Xiang, Tao
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [2] Scale Pyramid Network for Crowd Counting
    Chen, Xinya
    Bin, Yanrui
    Sang, Nong
    Gao, Changxin
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1941 - 1950
  • [3] Hsieh TI, 2019, ADV NEUR IN, V32
  • [4] Kong XY, 2020, INT CONF ACOUST SPEE, P2722, DOI [10.1109/icassp40776.2020.9054258, 10.1109/ICASSP40776.2020.9054258]
  • [5] CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
    Li, Yuhong
    Zhang, Xiaofan
    Chen, Deming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1091 - 1100
  • [6] DecideNet: Counting Varying Density Crowds Through Attention Guided Detection and Density Estimation
    Liu, Jiang
    Gao, Chenqiang
    Meng, Deyu
    Hauptmann, Alexander G.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5197 - 5206
  • [7] ADCrowdNet: An Attention-Injective Deformable Convolutional Network for Crowd Understanding
    Liu, Ning
    Long, Yongchao
    Zou, Changqing
    Niu, Qun
    Pan, Li
    Wu, Hefeng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3220 - 3229
  • [8] Liu Weizhe, 2019, IEEE C COMP VIS PATT
  • [9] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [10] Switching Convolutional Neural Network for Crowd Counting
    Sam, Deepak Babu
    Surya, Shiv
    Babu, R. Venkatesh
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4031 - 4039