FE-CSP: a fast and efficient pedestrian detector with center and scale prediction

被引:5
作者
Qin, Yugang [1 ]
Qian, Yurong [1 ,2 ,3 ]
Wei, Hongyang [1 ]
Fan, Yingying [2 ]
Feng, Peiyun [1 ]
机构
[1] Xinjiang Univ, Sch Software, Urumqi 830000, Peoples R China
[2] Key Lab Signal Detect & Proc Xinjiang Uygur Auton, Urumqi 830000, Peoples R China
[3] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830000, Peoples R China
基金
中国国家自然科学基金;
关键词
Pedestrian detection; Channel attention; Spatial attention; Deformable convolution; Feature pyramid network; ATTENTION;
D O I
10.1007/s11227-022-04815-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There are still many pressing problems in pedestrian detection, such as difficulty in detection due to severe pedestrian occlusion, difficulty in detecting small objects and low detection speed. In this paper, we propose A Fast and Efficient Pedestrian Detector with Center and Scale Prediction (FE-CSP). We combine channel attention with spatial attention, replace the traditional convolution with deformable convolution, and embed the backbone network to propose CSANet (Channel and Spatial Attention Network), which efficiently extracts the semantic features of the object, and then propose a feature pyramid network to replace the traditional concatenation to perform multi-scale feature detection, which effectively improves the detection speed. By conducting experiments on CityPersons, our method achieves 10.1%, 13.7% and 47.4% MR-2 at a speed of 0.21 s/img on the reasonable setting, small setting and heavy setting, respectively. On Caltech, our method achieves 5.2% MR-2 at a speed of 0.06 s/img on the Reasonable setting, further demonstrating the superiority and generalization ability of the proposed method.
引用
收藏
页码:4084 / 4104
页数:21
相关论文
共 54 条
[1]  
Bochkovskiy A., 2020, YOLOv4: Optimal speed and accuracy of object detection, DOI [DOI 10.48550/ARXIV.2004.10934, 10.48550/ARXIV.2004.10934]
[2]   Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498
[3]   GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond [J].
Cao, Yue ;
Xu, Jiarui ;
Lin, Stephen ;
Wei, Fangyun ;
Hu, Han .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1971-1980
[4]   Channel and spatial attention based deep object co-segmentation [J].
Chen, Jia ;
Chen, Yasong ;
Li, Weihao ;
Ning, Guoqin ;
Tong, Mingwen ;
Hilton, Adrian .
KNOWLEDGE-BASED SYSTEMS, 2021, 211
[5]   You Only Look One-level Feature [J].
Chen, Qiang ;
Wang, Yingming ;
Yang, Tong ;
Zhang, Xiangyu ;
Cheng, Jian ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13034-13043
[6]   Beyond triplet loss: a deep quadruplet network for person re-identification [J].
Chen, Weihua ;
Chen, Xiaotang ;
Zhang, Jianguo ;
Huang, Kaiqi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329
[7]   Control of goal-directed and stimulus-driven attention in the brain [J].
Corbetta, M ;
Shulman, GL .
NATURE REVIEWS NEUROSCIENCE, 2002, 3 (03) :201-215
[8]  
Dai JF, 2016, ADV NEUR IN, V29
[9]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[10]   Pedestrian Detection: An Evaluation of the State of the Art [J].
Dollar, Piotr ;
Wojek, Christian ;
Schiele, Bernt ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :743-761