SAF: Semantic Attention Fusion Mechanism for Pedestrian Detection

被引:3
作者
Yu, Ruizhe [1 ]
Wang, Shunzhou [1 ]
Lu, Yao [1 ]
Di, Huijun [1 ]
Zhang, Lin [1 ]
Lu, Lihua [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
来源
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II | 2019年 / 11671卷
关键词
Pedestrian detection; Semantic attention; Background errors;
D O I
10.1007/978-3-030-29911-8_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Benefiting from deep learning methods, pedestrian detection has witnessed a great progress in recent years. However, many pedestrian detectors are prone to detect background instances, especially under urban scenes, which results in plenty of false positive detections. In this paper, we propose a semantic attention fusion mechanism (SAF) to increase the discriminability of detector. The SAF includes two key components, attention modules and reverse fusion blocks. Different from previous attention mechanisms which use attention modules for re-weighting the top features of network directly, the outputs of our attention modules are fused by reverse fusion blocks from high level layers to low level layers step by step, which aims at generating strong semantic features for pedestrian detections. Experiments on CityPersons dataset demonstrate the effectiveness of our SAF.
引用
收藏
页码:523 / 533
页数:11
相关论文
共 21 条
  • [1] [Anonymous], 2017, ARXIV170404861
  • [2] Beyond triplet loss: a deep quadruplet network for person re-identification
    Chen, Weihua
    Chen, Xiaotang
    Zhang, Jianguo
    Huang, Kaiqi
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1320 - 1329
  • [3] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [4] Pedestrian Detection: An Evaluation of the State of the Art
    Dollar, Piotr
    Wojek, Christian
    Schiele, Bernt
    Perona, Pietro
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) : 743 - 761
  • [5] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [6] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [7] RON: Reverse Connection with Objectness Prior Networks for Object Detection
    Kong, Tao
    Sun, Fuchun
    Yao, Anbang
    Liu, Huaping
    Lu, Ming
    Chen, Yurong
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5244 - 5252
  • [8] Graininess-Aware Deep Feature Learning for Pedestrian Detection
    Lin, Chunze
    Lu, Jiwen
    Wang, Gang
    Zhou, Jie
    [J]. COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 745 - 761
  • [9] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
  • [10] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944