Pedestrian Detection Method Based on FCOS-DEFPN Model

被引：0

作者：

Chen, Feng ^{[1
]}

Gu, Xiang ^{[1
]}

Gao, Long ^{[1
]}

Wang, Jin ^{[1
]}

机构：

[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Jiangsu, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

中国国家自然科学基金;

关键词：

Pedestrians; Accuracy; Feature extraction; Prediction algorithms; Real-time systems; Detectors; Deep learning; Automatic driving; pedestrian detection; full convolutional one-stage target detection; small target detection; occlusion detection;

D O I：

10.1109/ACCESS.2024.3434987

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Automatic driving technology has high accuracy and real-time requirements for pedestrian identification and localization. Pedestrian detection is a basic and necessary function in vision-based pedestrian detection systems and collision warning, which can effectively avoid traffic accidents and improve road driving safety to a certain extent. In this paper, a lightweight solution based on the FCOS-DEFPN model is proposed for real-time pedestrian detection. Based on the FCOS model, this paper proposes the FCOS-DEFPN model, which achieves the lightweight of the network by replacing the ResNet50 backbone network with the MobilenetV3 network and using the depth separable convolution instead of the ordinary convolution for parameter compression. While maintaining the detection accuracy, this paper introduces data enhancement methods such as Random Erasing and Morsia to simulate pedestrian occlusion and small target scenarios to improve the robustness of the model. For the pedestrian occlusion scenario, this paper introduces a lightweight attention network ECA, which helps to extract pedestrian features better. For small-target multi-scale pedestrians, the DEFPN feature pyramid network is proposed, which acquires feature information at multiple scales by attentional fusion of feature layers at different scales from top-down, bottom-up, and front-back. The experimental results show that the proposed model is enhanced in terms of detection accuracy for occluded and small-target pedestrians, and satisfies real-time pedestrian detection under the premise of robustness in complex scenes.

引用

页码：144337 / 144349

页数：13

共 25 条

[11] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[12] Focal Loss for Dense Object Detection [J].

Lin, Tsung-Yi ;

Goyal, Priya ;

Girshick, Ross ;

He, Kaiming ;

Dollar, Piotr .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) :318-327

[13] Feature Pyramid Networks for Object Detection [J].

Lin, Tsung-Yi ;

Dollar, Piotr ;

Girshick, Ross ;

He, Kaiming ;

Hariharan, Bharath ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944

[14] Learning Efficient Single-Stage Pedestrian Detectors by Asymptotic Localization Fitting [J].

Liu, Wei ;

Liao, Shengcai ;

Hu, Weidong ;

Liang, Xuezhi ;

Chen, Xiao .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :643-659

[15] SSD: Single Shot MultiBox Detector [J].

Liu, Wei ;

Anguelov, Dragomir ;

Erhan, Dumitru ;

Szegedy, Christian ;

Reed, Scott ;

Fu, Cheng-Yang ;

Berg, Alexander C. .

COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37

[16]

Redmon J., 2018, arXiv

[17] YOLO9000: Better, Faster, Stronger [J].

Redmon, Joseph ;

Farhadi, Ali .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6517-6525

[18] You Only Look Once: Unified, Real-Time Object Detection [J].

Redmon, Joseph ;

Divvala, Santosh ;

Girshick, Ross ;

Farhadi, Ali .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788

[19] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].

Ren, Shaoqing ;

He, Kaiming ;

Girshick, Ross ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149

[20] A comprehensive review of background subtraction algorithms evaluated with synthetic and real videos [J].

Sobral, Andrews ;

Vacavant, Antoine .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 122 :4-21

← 1 2 3 →