Joint Deep Learning for Pedestrian Detection

被引:435
作者
Ouyang, Wanli [1 ]
Wang, Xiaogang [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong, Peoples R China
来源
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年
关键词
FEATURES;
D O I
10.1109/ICCV.2013.257
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature extraction, deformation handling, occlusion handling, and classification are four important components in pedestrian detection. Existing methods learn or design these components either individually or sequentially. The interaction among these components is not yet well explored. This paper proposes that they should be jointly learned in order to maximize their strengths through cooperation. We formulate these four components into a joint deep learning framework and propose a new deep network architecture(1). By establishing automatic, mutual interaction among components, the deep model achieves a 9% reduction in the average miss rate compared with the current best-performing pedestrian detection approaches on the largest Caltech benchmark dataset.
引用
收藏
页码:2056 / 2063
页数:8
相关论文
共 56 条
  • [1] [Anonymous], 2009, BMVC
  • [2] [Anonymous], 2010, BRIT MACH VIS C AB U
  • [3] [Anonymous], 2007, CVPR
  • [4] [Anonymous], 2011, ICCV
  • [5] [Anonymous], CVPR
  • [6] [Anonymous], 2010, CVPR
  • [7] [Anonymous], 2012, ICML
  • [8] [Anonymous], 2007, NIPS
  • [9] [Anonymous], 2009, ICCV
  • [10] [Anonymous], CVPR