An Improved Target Detection General Framework Based on Yolov4

被引：2

作者：

Liu, Hao ^{[2
]}

Zhang, Lei ^{[1
]}

Xin, Shan ^{[3
]}

机构：

[1] Beijing Univ Civil Engn & Architecture, Dept Elect & Informat Engn, Beijing 102616, Peoples R China

[2] Beijing Univ Civil Engn & Architecture, Coll Elect & Informat Engn, Beijing 102616, Peoples R China

[3] Beijing Univ Civil Engn & Architecture, Dept Elect & Informat Engn, Beijing 102616, Peoples R China

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021) | 2021年

关键词：

Target Detection; Yolov4; Framework; Improved PANet; Improved CSPDarknet53;

D O I：

10.1109/ROBIO54168.2021.9739530

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The speed and precision of the target detection algorithm have received wide attention in the application. This paper proposes an improved network framework for the yolov4 algorithm, which reducing the parameters of the model without lowering too much accuracy. Firstly, the up-sampling and down-sampling links of the PANet are strengthened with an increased CBAM attention mechanism, which improve the algorithm ability to deal with the object occlusion. The depth separable convolution is introduced to reduce the amount of model parameters and improves the algorithm speed. Secondly, the Se-Net attention mechanism is added to the residual module of CSPDarknet53 to pay more attention to the channel. Thirdly, Soft-NMS is used to optimize the screening of the detection frame during the detection process. Experiments show that the comparison of VOC2007 dataset is 84.26%, and in the VOC2007+VOC2012 dataset is 91.2%. The detection speed of FPS has increased by 2.21.

引用

页码：1532 / 1536

页数：5

共 20 条

[1] Bochkovskiy A., 2020, IEEE C COMP VIS PATT, DOI DOI 10.48550/ARXIV.2004.10934
[2] Soft-NMS - Improving Object Detection With One Line of Code
Bodla, Navaneeth
Singh, Bharat
Chellappa, Rama
Davis, Larry S.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
[3] Cao Jiale, 2021, IEEE T PATTERN ANAL
[4] Ge J., 2021, ARXIV PREPINT ARXIV2
[5] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[6] Searching for MobileNetV3
Howard, Andrew
Sandler, Mark
Chu, Grace
Chen, Liang-Chieh
Chen, Bo
Tan, Mingxing
Wang, Weijun
Zhu, Yukun
Pang, Ruoming
Vasudevan, Vijay
Le, Quoc V.
Adam, Hartwig
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324
[7] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[8] Jiang M., 2020, COMPUTER ENG SOFTWAR, P57
[9] Face detection techniques: a review
Kumar, Ashu
Kaur, Amandeep
Kumar, Munish
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (02) : 927 - 948
[10] Li K.W., 2020, COMPUTER SYSTEMS APP, P270

← 1 2 →