An improved YOLO-based method with lightweight C3 modules for object detection in resource-constrained environments

被引:0
作者
Song, Jian [1 ]
Xie, Jie [1 ]
Wang, Qingwang [1 ]
Shen, Tao [1 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Model lightweight design; Object detectors; Limited resources; YOLO;
D O I
10.1007/s11227-025-07187-w
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid advancement of deep learning algorithms, object detectors have achieved impressive performance in practical applications. An efficient detection framework is essential for performing detection tasks on devices with limited computational resources. However, current detection algorithms often face challenges due to their complexity, including numerous parameters and significant computational demands. To overcome these challenges, this paper introduces a streamlined and effective detection method. The integration of the FasterNet Block into the Cross-Stage Partial Network (C3) of the backbone reduces computational and storage demands. Additionally, by introducing cross-scale feature fusion in the neck network, the computational load and parameter requirements during inference are further decreased. Meanwhile, the dynamic head with multi-scale processing and Shape-IoU enhances detection accuracy and robustness, achieving a balance between lightweight design and performance. Compared to the original YOLOv5 models, the proposed lightweight method reduces the number of parameters by 29.4 to 43.0%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} and compresses the size of the model by 31.6 to 42.7%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} while maintaining a high mAP@0.5. Furthermore, the designed models achieve a faster inference speed since the computations could be reduced by more than 30%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document}. In robustness experiments under varying lighting conditions, the proposed model demonstrates stable detection performance even in challenging lighting scenarios, showing its reliability in real-world applications. In conclusion, our research offers considerable improvements in model accuracy, parameter efficiency, and size compared to the mainstream object detection algorithms.
引用
收藏
页数:23
相关论文
共 38 条
  • [1] Recent trend in medical imaging modalities and their applications in disease diagnosis: a review
    Abhisheka, Barsha
    Biswas, Saroj Kumar
    Purkayastha, Biswajit
    Das, Dolly
    Escargueil, Alexandre
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 43035 - 43070
  • [2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [3] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [4] Chang Y, 2023, IEEE Access
  • [5] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
    Chen, Jierun
    Kao, Shiu-Hong
    He, Hao
    Zhuo, Weipeng
    Wen, Song
    Lee, Chul-Ho
    Chan, S. -H. Gary
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
  • [6] Dai JF, 2016, ADV NEUR IN, V29
  • [7] Dynamic Head: Unifying Object Detection Heads with Attentions
    Dai, Xiyang
    Chen, Yinpeng
    Xiao, Bin
    Chen, Dongdong
    Liu, Mengchen
    Yuan, Lu
    Zhang, Lei
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7369 - 7378
  • [8] RepVGG: Making VGG-style ConvNets Great Again
    Ding, Xiaohan
    Zhang, Xiangyu
    Ma, Ningning
    Han, Jungong
    Ding, Guiguang
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13728 - 13737
  • [9] Dong C, 2024, Expert Systems with Applications
  • [10] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338