Bi-box Regression for Pedestrian Detection and Occlusion Estimation

被引:139
作者
Zhou, Chunluan [1 ,2 ]
Yuan, Junsong [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] SUNY Buffalo, Buffalo, NY 14260 USA
来源
COMPUTER VISION - ECCV 2018, PT I | 2018年 / 11205卷
关键词
Pedestrian detection; Occlusion handling; Deep CNN;
D O I
10.1007/978-3-030-01246-5_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Occlusions present a great challenge for pedestrian detection in practical applications. In this paper, we propose a novel approach to simultaneous pedestrian detection and occlusion estimation by regressing two bounding boxes to localize the full body as well as the visible part of a pedestrian respectively. For this purpose, we learn a deep convolutional neural network (CNN) consisting of two branches, one for full body estimation and the other for visible part estimation. The two branches are treated differently during training such that they are learned to produce complementary outputs which can be further fused to improve detection performance. The full body estimation branch is trained to regress full body regions for positive pedestrian proposals, while the visible part estimation branch is trained to regress visible part regions for both positive and negative pedestrian proposals. The visible part region of a negative pedestrian proposal is forced to shrink to its center. In addition, we introduce a new criterion for selecting positive training examples, which contributes largely to heavily occluded pedestrian detection. We validate the effectiveness of the proposed bi-box regression approach on the Caltech and CityPersons datasets. Experimental results show that our approach achieves promising performance for detecting both non-occluded and occluded pedestrians, especially heavily occluded ones.
引用
收藏
页码:138 / 154
页数:17
相关论文
共 43 条
[1]  
Angelova A., 2015, P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P BMVC 2015 P
[2]  
Azizpour H, 2012, LECT NOTES COMPUT SC, V7572, P836, DOI 10.1007/978-3-642-33718-5_60
[3]   Seeking the strongest rigid detector [J].
Benenson, Rodrigo ;
Mathias, Markus ;
Tuytelaars, Tinne ;
Van Gool, Luc .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3666-3673
[4]   Illuminating Pedestrians via Simultaneous Detection & Segmentation [J].
Brazil, Garrick ;
Yin, Xi ;
Liu, Xiaoming .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4960-4969
[5]   A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [J].
Cai, Zhaowei ;
Fan, Quanfu ;
Feris, Rogerio S. ;
Vasconcelos, Nuno .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :354-370
[6]   Learning Complexity-Aware Cascades for Deep Pedestrian Detection [J].
Cai, Zhaowei ;
Saberian, Mohammad ;
Vasconcelos, Nuno .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3361-3369
[7]   Beyond triplet loss: a deep quadruplet network for person re-identification [J].
Chen, Weihua ;
Chen, Xiaotang ;
Zhang, Jianguo ;
Huang, Kaiqi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]   Fast Feature Pyramids for Object Detection [J].
Dollar, Piotr ;
Appel, Ron ;
Belongie, Serge ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545