An Improved Target Detection General Framework Based on Yolov4

被引:2
作者
Liu, Hao [2 ]
Zhang, Lei [1 ]
Xin, Shan [3 ]
机构
[1] Beijing Univ Civil Engn & Architecture, Dept Elect & Informat Engn, Beijing 102616, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Coll Elect & Informat Engn, Beijing 102616, Peoples R China
[3] Beijing Univ Civil Engn & Architecture, Dept Elect & Informat Engn, Beijing 102616, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021) | 2021年
关键词
Target Detection; Yolov4; Framework; Improved PANet; Improved CSPDarknet53;
D O I
10.1109/ROBIO54168.2021.9739530
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The speed and precision of the target detection algorithm have received wide attention in the application. This paper proposes an improved network framework for the yolov4 algorithm, which reducing the parameters of the model without lowering too much accuracy. Firstly, the up-sampling and down-sampling links of the PANet are strengthened with an increased CBAM attention mechanism, which improve the algorithm ability to deal with the object occlusion. The depth separable convolution is introduced to reduce the amount of model parameters and improves the algorithm speed. Secondly, the Se-Net attention mechanism is added to the residual module of CSPDarknet53 to pay more attention to the channel. Thirdly, Soft-NMS is used to optimize the screening of the detection frame during the detection process. Experiments show that the comparison of VOC2007 dataset is 84.26%, and in the VOC2007+VOC2012 dataset is 91.2%. The detection speed of FPS has increased by 2.21.
引用
收藏
页码:1532 / 1536
页数:5
相关论文
共 20 条
  • [1] Bochkovskiy A., 2020, IEEE C COMP VIS PATT, DOI DOI 10.48550/ARXIV.2004.10934
  • [2] Soft-NMS - Improving Object Detection With One Line of Code
    Bodla, Navaneeth
    Singh, Bharat
    Chellappa, Rama
    Davis, Larry S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
  • [3] Cao Jiale, 2021, IEEE T PATTERN ANAL
  • [4] Ge J., 2021, ARXIV PREPINT ARXIV2
  • [5] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [6] Searching for MobileNetV3
    Howard, Andrew
    Sandler, Mark
    Chu, Grace
    Chen, Liang-Chieh
    Chen, Bo
    Tan, Mingxing
    Wang, Weijun
    Zhu, Yukun
    Pang, Ruoming
    Vasudevan, Vijay
    Le, Quoc V.
    Adam, Hartwig
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324
  • [7] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
  • [8] Jiang M., 2020, COMPUTER ENG SOFTWAR, P57
  • [9] Face detection techniques: a review
    Kumar, Ashu
    Kaur, Amandeep
    Kumar, Munish
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (02) : 927 - 948
  • [10] Li K.W., 2020, COMPUTER SYSTEMS APP, P270