Coupled Network for Robust Pedestrian Detection With Gated Multi-Layer Feature Extraction and Deformable Occlusion Handling

被引:26
作者
Liu, Tianrui [1 ,2 ]
Luo, Wenhan [3 ]
Ma, Lin [4 ]
Huang, Jun-Jie [5 ]
Stathaki, Tania [5 ]
Dai, Tianhong [6 ]
机构
[1] Imperial Coll London ICL, Dept Elect & Elect Engn, London SW7 2AZ, England
[2] ICI PLC, Dept Comp, London SW7 2AZ, England
[3] Tencent AI Lab, Shenzhen 518000, Peoples R China
[4] Meituan, Beijing 100102, Peoples R China
[5] ICI PLC, Dept Elect & Elect Engn, London SW7 2AZ, England
[6] ICI PLC, Dept Biol Engn, London SW7 2AZ, England
关键词
Feature extraction; Logic gates; Proposals; Detectors; Task analysis; Neural networks; Forestry; Pedestrian detection; coupled network; gated feature extraction; squeeze network; multi-layer feature; occlusion handling; deformable RoI-pooling; TRACKING; SCALE;
D O I
10.1109/TIP.2020.3038371
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian detection methods have been significantly improved with the development of deep convolutional neural networks. Nevertheless, detecting ismall-scaled pedestrians and occluded pedestrians remains a challenging problem. In this paper, we propose a pedestrian detection method with a couple-network to simultaneously address these two issues. One of the sub-networks, the gated multi-layer feature extraction sub-network, aims to adaptively generate discriminative features for pedestrian candidates in order to robustly detect pedestrians with large variations on scale. The second sub-network targets on handling the occlusion problem of pedestrian detection by using deformable regional region of interest (RoI)-pooling. We investigate two different gate units for the gated sub-network, namely, the channel-wise gate unit and the spatio-wise gate unit, which can enhance the representation ability of the regional convolutional features among the channel dimensions or across the spatial domain, repetitively. Ablation studies have validated the effectiveness of both the proposed gated multi-layer feature extraction sub-network and the deformable occlusion handling sub-network. With the coupled framework, our proposed pedestrian detector achieves promising results on both two pedestrian datasets, especially on detecting small or occluded pedestrians. On the CityPersons dataset, the proposed detector achieves the lowest missing rates (i.e. 40.78% and 34.60%) on detecting small and occluded pedestrians, surpassing the second best comparison method by 6.0% and 5.87%, respectively.
引用
收藏
页码:754 / 766
页数:13
相关论文
共 51 条
[1]   A Low-Complexity Pedestrian Detection Framework for Smart Video Surveillance Systems [J].
Bilal, Muhammad ;
Khan, Asim ;
Khan, Muhammad Umar Karim ;
Kyung, Chong-Min .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (10) :2260-2273
[2]   A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [J].
Cai, Zhaowei ;
Fan, Quanfu ;
Feris, Rogerio S. ;
Vasconcelos, Nuno .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :354-370
[3]   Learning Complexity-Aware Cascades for Deep Pedestrian Detection [J].
Cai, Zhaowei ;
Saberian, Mohammad ;
Vasconcelos, Nuno .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3361-3369
[4]   Learning Multilayer Channel Features for Pedestrian Detection [J].
Cao, Jiale ;
Pang, Yanwei ;
Li, Xuelong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) :3210-3220
[5]  
Chen L. C., 2014, ICLR
[6]   Beyond triplet loss: a deep quadruplet network for person re-identification [J].
Chen, Weihua ;
Chen, Xiaotang ;
Zhang, Jianguo ;
Huang, Kaiqi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329
[7]  
Cordts Marius, 2015, CVPR WORKSHOP FUTURE, V2
[8]   Semantic Channels for Fast Pedestrian Detection [J].
Costea, Arthur Daniel ;
Nedevschi, Sergiu .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2360-2368
[9]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[10]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036