Pedestrian Detection Method Based on FCOS-DEFPN Model

被引:0
作者
Chen, Feng [1 ]
Gu, Xiang [1 ]
Gao, Long [1 ]
Wang, Jin [1 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Jiangsu, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Pedestrians; Accuracy; Feature extraction; Prediction algorithms; Real-time systems; Detectors; Deep learning; Automatic driving; pedestrian detection; full convolutional one-stage target detection; small target detection; occlusion detection;
D O I
10.1109/ACCESS.2024.3434987
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic driving technology has high accuracy and real-time requirements for pedestrian identification and localization. Pedestrian detection is a basic and necessary function in vision-based pedestrian detection systems and collision warning, which can effectively avoid traffic accidents and improve road driving safety to a certain extent. In this paper, a lightweight solution based on the FCOS-DEFPN model is proposed for real-time pedestrian detection. Based on the FCOS model, this paper proposes the FCOS-DEFPN model, which achieves the lightweight of the network by replacing the ResNet50 backbone network with the MobilenetV3 network and using the depth separable convolution instead of the ordinary convolution for parameter compression. While maintaining the detection accuracy, this paper introduces data enhancement methods such as Random Erasing and Morsia to simulate pedestrian occlusion and small target scenarios to improve the robustness of the model. For the pedestrian occlusion scenario, this paper introduces a lightweight attention network ECA, which helps to extract pedestrian features better. For small-target multi-scale pedestrians, the DEFPN feature pyramid network is proposed, which acquires feature information at multiple scales by attentional fusion of feature layers at different scales from top-down, bottom-up, and front-back. The experimental results show that the proposed model is enhanced in terms of detection accuracy for occluded and small-target pedestrians, and satisfies real-time pedestrian detection under the premise of robustness in complex scenes.
引用
收藏
页码:144337 / 144349
页数:13
相关论文
共 25 条
  • [11] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [12] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 318 - 327
  • [13] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [14] Learning Efficient Single-Stage Pedestrian Detectors by Asymptotic Localization Fitting
    Liu, Wei
    Liao, Shengcai
    Hu, Weidong
    Liang, Xuezhi
    Chen, Xiao
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 643 - 659
  • [15] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
  • [16] Redmon J., 2018, arXiv, DOI DOI 10.48550/ARXIV.1804.02767
  • [17] YOLO9000: Better, Faster, Stronger
    Redmon, Joseph
    Farhadi, Ali
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6517 - 6525
  • [18] You Only Look Once: Unified, Real-Time Object Detection
    Redmon, Joseph
    Divvala, Santosh
    Girshick, Ross
    Farhadi, Ali
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 779 - 788
  • [19] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [20] A comprehensive review of background subtraction algorithms evaluated with synthetic and real videos
    Sobral, Andrews
    Vacavant, Antoine
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 122 : 4 - 21