Graininess-Aware Deep Feature Learning for Robust Pedestrian Detection

被引:72
作者
Lin, Chunze [1 ,2 ,3 ]
Lu, Jiwen [1 ,2 ,3 ]
Wang, Gang [4 ]
Zhou, Jie [1 ,2 ,3 ]
机构
[1] Tsinghua Univ, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Beijing Res Ctr Informat Sci & Technol BNRIST, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[4] Alibaba Grp, AI Labs, Hangzhou 310052, Peoples R China
基金
中国国家自然科学基金;
关键词
Pedestrian detection; attention; deep learning; graininess; OCCLUSION; NETWORK;
D O I
10.1109/TIP.2020.2966371
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a graininess-aware deep feature learning method for pedestrian detection. Unlike most existing methods which utilize the convolutional features without explicit distinction, we appropriately exploit multiple convolutional layers and dynamically select most informative features. Specifically, we train a multi-scale pedestrian attention via pixel-wise segmentation supervision to efficiently identify the pedestrian of particular scales. We encodes the fine-grained attention map into the feature maps of the detection layers to guide them to highlight the pedestrians of specific scale and avoid the background interference. The graininess-aware feature maps generated with our attention mechanism are more focused on pedestrians, and in particular on the small-scale and occluded targets. We further introduce a zoom-in-zoom-out module to enhances the features by incorporating local details and context information. Extensive experimental results on five challenging pedestrian detection benchmarks show that our method achieves very competitive or even better performance with the state-of-the-arts and is faster than most existing approaches.
引用
收藏
页码:3820 / 3834
页数:15
相关论文
共 82 条
[61]  
Tian YL, 2015, PROC CVPR IEEE, P5079, DOI 10.1109/CVPR.2015.7299143
[62]   Detecting pedestrians using patterns of motion and appearance [J].
Viola, P ;
Jones, MJ ;
Snow, D .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 63 (02) :153-161
[63]   Residual Attention Network for Image Classification [J].
Wang, Fei ;
Jiang, Mengqing ;
Qian, Chen ;
Yang, Shuo ;
Li, Cheng ;
Zhang, Honggang ;
Wang, Xiaogang ;
Tang, Xiaoou .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6450-6458
[64]   Pedestrian Detection via Body Part Semantic and Contextual Information With DNN [J].
Wang, Shiguang ;
Cheng, Jian ;
Liu, Haijun ;
Wang, Feng ;
Zhou, Hui .
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (11) :3148-3159
[65]   An HOG-LBP Human Detector with Partial Occlusion Handling [J].
Wang, Xiaoyu ;
Han, Tony X. ;
Yan, Shuicheng .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :32-39
[66]   Repulsion Loss: Detecting Pedestrians in a Crowd [J].
Wang, Xinlong ;
Xiao, Tete ;
Jiang, Yuning ;
Shao, Shuai ;
Sun, Jian ;
Shen, Chunhua .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7774-7783
[67]  
Xu K, 2015, PR MACH LEARN RES, V37, P2048
[68]   Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers [J].
Yang, Fan ;
Choi, Wongun ;
Lin, Yuanqing .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2129-2137
[69]   An Extended Filtered Channel Framework for Pedestrian Detection [J].
You, Mingyu ;
Zhang, Yubin ;
Shen, Chunhua ;
Zhang, Xinyu .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (05) :1640-1651
[70]   POI: Multiple Object Tracking with High Performance Detection and Appearance Feature [J].
Yu, Fengwei ;
Li, Wenbo ;
Li, Quanquan ;
Liu, Yu ;
Shi, Xiaohua ;
Yan, Junjie .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :36-42