Progressive Refinement Network for Occluded Pedestrian Detection

被引:36
作者
Song, Xiaolin [1 ]
Zhao, Kaili [1 ]
Chu, Wen-Sheng [2 ]
Zhang, Honggang [1 ]
Guo, Jun [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Google, Mountain View, CA USA
来源
COMPUTER VISION - ECCV 2020, PT XXIII | 2020年 / 12368卷
关键词
Occluded pedestrian detection; Progressive Refinement Network; Anchor calibration; Occlusion loss; Receptive Field Backfeed;
D O I
10.1007/978-3-030-58592-1_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Progressive Refinement Network (PRNet), a novel single-stage detector that tackles occluded pedestrian detection. Motivated by human's progressive process on annotating occluded pedestrians, PRNet achieves sequential refinement by three phases: Finding high-confident anchors of visible parts, calibrating such anchors to a full-body template derived from occlusion statistics, and then adjusting the calibrated anchors to final full-body regions. Unlike conventional methods that exploit predefined anchors, the confidence-aware calibration offers adaptive anchor initialization for detection with occlusions, and helps reduce the gap between visible-part and full-body detection. In addition, we introduce an occlusion loss to up-weigh hard examples, and a Receptive Field Backfeed (RFB) module to diversify receptive fields in early layers that commonly fire only on visible parts or small-size full-body regions. Experiments were performed within and across CityPersons, ETH, and Caltech datasets. Results show that PRNet can match the speed of existing single-stage detectors, consistently outperforms alternatives in terms of overall miss rate, and offers significantly better cross-dataset generalization. Code is available (https://github.com/sxlpris).
引用
收藏
页码:32 / 48
页数:17
相关论文
共 44 条
[41]   Occlusion-Aware R-CNN: Detecting Pedestrians in a Crowd [J].
Zhang, Shifeng ;
Wen, Longyin ;
Bian, Xiao ;
Lei, Zhen ;
Li, Stan Z. .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :657-674
[42]   Bi-box Regression for Pedestrian Detection and Occlusion Estimation [J].
Zhou, Chunluan ;
Yuan, Junsong .
COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 :138-154
[43]   Discriminative Feature Transformation for Occluded Pedestrian Detection [J].
Zhou, Chunluan ;
Yang, Ming ;
Yuan, Junsong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9556-9565
[44]   Multi-label Learning of Part Detectors for Heavily Occluded Pedestrian Detection [J].
Zhou, Chunluan ;
Yuan, Junsong .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3506-3515