Progressive Refinement Network for Occluded Pedestrian Detection

被引：36

作者：

Song, Xiaolin ^{[1
]}

Zhao, Kaili ^{[1
]}

Chu, Wen-Sheng ^{[2
]}

Zhang, Honggang ^{[1
]}

Guo, Jun ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China

[2] Google, Mountain View, CA USA

来源：

COMPUTER VISION - ECCV 2020, PT XXIII | 2020年 / 12368卷

关键词：

Occluded pedestrian detection; Progressive Refinement Network; Anchor calibration; Occlusion loss; Receptive Field Backfeed;

D O I：

10.1007/978-3-030-58592-1_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present Progressive Refinement Network (PRNet), a novel single-stage detector that tackles occluded pedestrian detection. Motivated by human's progressive process on annotating occluded pedestrians, PRNet achieves sequential refinement by three phases: Finding high-confident anchors of visible parts, calibrating such anchors to a full-body template derived from occlusion statistics, and then adjusting the calibrated anchors to final full-body regions. Unlike conventional methods that exploit predefined anchors, the confidence-aware calibration offers adaptive anchor initialization for detection with occlusions, and helps reduce the gap between visible-part and full-body detection. In addition, we introduce an occlusion loss to up-weigh hard examples, and a Receptive Field Backfeed (RFB) module to diversify receptive fields in early layers that commonly fire only on visible parts or small-size full-body regions. Experiments were performed within and across CityPersons, ETH, and Caltech datasets. Results show that PRNet can match the speed of existing single-stage detectors, consistently outperforms alternatives in terms of overall miss rate, and offers significantly better cross-dataset generalization. Code is available (https://github.com/sxlpris).

引用

页码：32 / 48

页数：17

共 44 条

[1] Pedestrian Detection with Autoregressive Network Phases [J].

Brazil, Garrick ;

Liu, Xiaoming .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7224-7233

[2] Illuminating Pedestrians via Simultaneous Detection & Segmentation [J].

Brazil, Garrick ;

Yin, Xi ;

Liu, Xiaoming .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4960-4969

[3] Beyond triplet loss: a deep quadruplet network for person re-identification [J].

Chen, Weihua ;

Chen, Xiaotang ;

Zhang, Jianguo ;

Huang, Kaiqi .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329

[4]

Chi C, 2019, AAAI CONF ARTIF INTE, P8231

[5]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[6] Pedestrian Detection: An Evaluation of the State of the Art [J].

Dollar, Piotr ;

Wojek, Christian ;

Schiele, Bernt ;

Perona, Pietro .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :743-761

[7]

Duan GQ, 2010, LECT NOTES COMPUT SC, V6316, P238, DOI 10.1007/978-3-642-15567-3_18

[8] Multi-Cue Pedestrian Classification With Partial Occlusion Handling [J].

Enzweiler, Markus ;

Eigenstetter, Angela ;

Schiele, Bernt ;

Gavrila, Dariu M. .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :990-997

[9]

Ess A, 2007, IEEE I CONF COMP VIS, P2065

[10] Vision meets robotics: The KITTI dataset [J].

Geiger, A. ;

Lenz, P. ;

Stiller, C. ;

Urtasun, R. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237

← 1 2 3 4 5 →