Deep Pedestrian Detection Using Contextual Information and Multi-level Features

被引:6
作者
Kong, Weijie [1 ]
Li, Nannan [1 ]
Li, Thomas H. [2 ]
Li, Ge [1 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Sch Elect & Comp Engn, Shenzhen, Peoples R China
[2] Gpower Semicond Inc, Suzhou, Peoples R China
来源
MULTIMEDIA MODELING, MMM 2018, PT I | 2018年 / 10704卷
基金
美国国家科学基金会;
关键词
Pedestrian detection; Faster R-CNN; Contextual information; Multi-level features;
D O I
10.1007/978-3-319-73603-7_14
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, Faster R-CNN achieves great performance in deep learning based object detection. However, a major bottleneck of Faster R-CNN lies on the sharp performance deterioration when detecting objects that are small in size or have a similar appearance with their backgrounds. To address this problem, we present a new pedestrian detection approach based on Faster R-CNN, which combines contextual information with multi-level features. The contextual information is embedded by pooling information from a larger area around the original region of interest. It helps pedestrians detection from cluttered backgrounds. The multi-level features can be obtained by pooling proposal-specific features from several shallow but high-resolution layers. These features are more informative for detecting small-size pedestrians. Extensive experiments on the challenging Caltech dataset validate that our approach not only performs better than the baseline of Faster R-CNN but also boosts the detection performance when combined with contextual information and multi-level features. Meanwhile, compared with numerous pedestrian detection approaches, our combined method outperforms all of them and achieves a quite superior performance.
引用
收藏
页码:166 / 177
页数:12
相关论文
共 20 条
  • [1] [Anonymous], SCALE AWARE FAST R C
  • [2] Dai J., 2016, ADV NEURAL INFORM PR, V29, P379, DOI [DOI 10.1016/J.JPOWSOUR.2007.02.075, DOI 10.48550/ARXIV.1605.06409, DOI 10.1109/CVPR.2017.690]
  • [3] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [4] Fast Feature Pyramids for Object Detection
    Dollar, Piotr
    Appel, Ron
    Belongie, Serge
    Perona, Pietro
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) : 1532 - 1545
  • [5] Pedestrian Detection: An Evaluation of the State of the Art
    Dollar, Piotr
    Wojek, Christian
    Schiele, Bernt
    Perona, Pietro
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) : 743 - 761
  • [6] Dollar Piotr, 2009, Integral channel featuresJ
  • [7] Du X., 2016, Fused DNN: A deep neural network fusion approach to fast and robust pedestrian detection
  • [8] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [9] Glorot X, 2010, P 13 INT C ART INT S, P249, DOI DOI 10.1109/LGRS.2016.2565705
  • [10] Liu W., 2015, Parsenet: Looking wider to see better