PFYOLOv4: An Improved Small Object Pedestrian Detection Algorithm

被引:7
作者
Li, Kaihui [1 ,2 ]
Zhuang, Yuan [1 ,2 ]
Lai, Jinling [1 ]
Zeng, Yunhui [1 ,2 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Jinan 250014, Peoples R China
[2] Qilu Univ Technol, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan, Shandong Prov Key Lab Comp Networks,Shandong Acad, Jinan 250014, Peoples R China
关键词
small target pedestrian detection; soft thresholding; depthwise separable convolution; convolutional block attention module;
D O I
10.1109/ACCESS.2023.3244981
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of deep convolutional neural networks, the effect of pedestrian detection has been rapidly improved. However, there are still many problems in small target pedestrian detection, for example noise (such as light) interference, target occlusion, and low detection accuracy. In order to solve the above problems, based on YOLOv4 algorithm, this paper proposes an improved small target pedestrian detection algorithm named PF_YOLOv4. The algorithm is improved in three aspects on the basis of the YOLOv4 algorithm: firstly, a soft thresholding module is added to the residual structure of the backbone network to perform noise reduction process on interference factors, such as light to enhance the robustness of the algorithm; secondly, the depthwise separable convolution replaces the traditional convolution in the YOLOv4 residual structure, to reduce the number of network model parameters; finally, the Convolutional Block Attention Module (CBAM) is added after the output feature map of the backbone network to enhance of the network feature expression. Experimental results show that the PF_YOLOv4 algorithm outperforms most of the state-of-the-art algorithms in detecting small target pedestrians. The mean Average Precision (mAP) of the PF_YOLOv4 algorithm is 2.35% higher than that of the YOLOv4 algorithm and 9.67% higher than that of the YOLOv3 algorithm, while the detection speed is slightly higher than that of YOLOv4 algorithm.
引用
收藏
页码:17197 / 17206
页数:10
相关论文
共 37 条
  • [1] Ansari M. F., 2021, Renewable Power for Sustainable Growth. Proceedings of International Conference on Renewal Power (ICRP 2020). Lecture Notes in Electrical Engineering (LNEE 723), P669, DOI 10.1007/978-981-33-4080-0_64
  • [2] Finding Tiny Faces in the Wild with Generative Adversarial Network
    Bai, Yancheng
    Zhang, Yongqiang
    Ding, Mingli
    Ghanem, Bernard
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 21 - 30
  • [3] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934, 10.48550/arXiv.2004.10934]
  • [4] STDnet: Exploiting high resolution feature maps for small object detection
    Bosquet, Brais
    Mucientes, Manuel
    Brea, Victor M.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 91
  • [5] Feature-Fused SSD: Fast Detection for Small Objects
    Cao, Guimei
    Xie, Xuemei
    Yang, Wenzhe
    Liao, Quan
    Shi, Guangming
    Wu, Jinjian
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [6] Attention Fusion for One-Stage Multispectral Pedestrian Detection
    Cao, Zhiwei
    Yang, Huihua
    Zhao, Juan
    Guo, Shuhong
    Li, Lingqiao
    [J]. SENSORS, 2021, 21 (12)
  • [7] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [8] Fu C-Y, 2017, arXiv
  • [9] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [10] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448