Illuminating Pedestrians via Simultaneous Detection & Segmentation

被引:198
作者
Brazil, Garrick [1 ]
Yin, Xi [1 ]
Liu, Xiaoming [1 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
关键词
D O I
10.1109/ICCV.2017.530
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian detection is a critical problem in computer vision with significant impact on safety in urban autonomous driving. In this work, we explore how semantic segmentation can be used to boost pedestrian detection accuracy while having little to no impact on network efficiency. We propose a segmentation infusion network to enable joint supervision on semantic segmentation and pedestrian detection. When placed properly, the additional supervision helps guide features in shared layers to become more sophisticated and helpful for the downstream pedestrian detector. Using this approach, we find weakly annotated boxes to be sufficient for considerable performance gains. We provide an in-depth analysis to demonstrate how shared layers are shaped by the segmentation supervision. In doing so, we show that the resulting feature maps become more semantically meaningful and robust to shape and occlusion. Overall, our simultaneous detection and segmentation framework achieves a considerable gain over the state-of-the-art on the Caltech pedestrian dataset, competitive performance on KITTI, and executes 2x faster than competitive methods.
引用
收藏
页码:4960 / 4969
页数:10
相关论文
共 31 条
  • [1] [Anonymous], 2015, ARXIV150502438
  • [2] [Anonymous], 2012, PROC IEEE C COMPUTER
  • [3] [Anonymous], PROC CVPR IEEE
  • [4] [Anonymous], 2016, ARXIV160600915
  • [5] [Anonymous], The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results
  • [6] [Anonymous], 2016, ARXIV160707032
  • [7] [Anonymous], PROC CVPR IEEE
  • [8] [Anonymous], IEEE T PATTERN ANAL
  • [9] [Anonymous], ARXIV161003466
  • [10] Multiscale Combinatorial Grouping
    Arbelaez, Pablo
    Pont-Tuset, Jordi
    Barron, Jonathan T.
    Marques, Ferran
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 328 - 335