PFYOLOv4: An Improved Small Object Pedestrian Detection Algorithm

被引：7

作者：

Li, Kaihui ^{[1
,2
]}

Zhuang, Yuan ^{[1
,2
]}

Lai, Jinling ^{[1
]}

Zeng, Yunhui ^{[1
,2
]}

机构：

[1] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Jinan 250014, Peoples R China

[2] Qilu Univ Technol, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan, Shandong Prov Key Lab Comp Networks,Shandong Acad, Jinan 250014, Peoples R China

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

small target pedestrian detection; soft thresholding; depthwise separable convolution; convolutional block attention module;

D O I：

10.1109/ACCESS.2023.3244981

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the development of deep convolutional neural networks, the effect of pedestrian detection has been rapidly improved. However, there are still many problems in small target pedestrian detection, for example noise (such as light) interference, target occlusion, and low detection accuracy. In order to solve the above problems, based on YOLOv4 algorithm, this paper proposes an improved small target pedestrian detection algorithm named PF_YOLOv4. The algorithm is improved in three aspects on the basis of the YOLOv4 algorithm: firstly, a soft thresholding module is added to the residual structure of the backbone network to perform noise reduction process on interference factors, such as light to enhance the robustness of the algorithm; secondly, the depthwise separable convolution replaces the traditional convolution in the YOLOv4 residual structure, to reduce the number of network model parameters; finally, the Convolutional Block Attention Module (CBAM) is added after the output feature map of the backbone network to enhance of the network feature expression. Experimental results show that the PF_YOLOv4 algorithm outperforms most of the state-of-the-art algorithms in detecting small target pedestrians. The mean Average Precision (mAP) of the PF_YOLOv4 algorithm is 2.35% higher than that of the YOLOv4 algorithm and 9.67% higher than that of the YOLOv3 algorithm, while the detection speed is slightly higher than that of YOLOv4 algorithm.

引用

页码：17197 / 17206

页数：10

共 37 条

[1] Ansari M. F., 2021, Renewable Power for Sustainable Growth. Proceedings of International Conference on Renewal Power (ICRP 2020). Lecture Notes in Electrical Engineering (LNEE 723), P669, DOI 10.1007/978-981-33-4080-0_64
[2] Finding Tiny Faces in the Wild with Generative Adversarial Network
Bai, Yancheng
Zhang, Yongqiang
Ding, Mingli
Ghanem, Bernard
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 21 - 30
[3] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934, 10.48550/arXiv.2004.10934]
[4] STDnet: Exploiting high resolution feature maps for small object detection
Bosquet, Brais
Mucientes, Manuel
Brea, Victor M.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 91
[5] Feature-Fused SSD: Fast Detection for Small Objects
Cao, Guimei
Xie, Xuemei
Yang, Wenzhe
Liao, Quan
Shi, Guangming
Wu, Jinjian
[J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
[6] Attention Fusion for One-Stage Multispectral Pedestrian Detection
Cao, Zhiwei
Yang, Huihua
Zhao, Juan
Guo, Shuhong
Li, Lingqiao
[J]. SENSORS, 2021, 21 (12)
[7] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[8] Fu C-Y, 2017, arXiv
[9] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[10] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448

← 1 2 3 4 →