An improved YOLO-based method with lightweight C3 modules for object detection in resource-constrained environments

被引：0

作者：

Song, Jian ^{[1
]}

Xie, Jie ^{[1
]}

Wang, Qingwang ^{[1
]}

Shen, Tao ^{[1
]}

机构：

[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China

来源：

JOURNAL OF SUPERCOMPUTING | 2025年 / 81卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Model lightweight design; Object detectors; Limited resources; YOLO;

D O I：

10.1007/s11227-025-07187-w

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid advancement of deep learning algorithms, object detectors have achieved impressive performance in practical applications. An efficient detection framework is essential for performing detection tasks on devices with limited computational resources. However, current detection algorithms often face challenges due to their complexity, including numerous parameters and significant computational demands. To overcome these challenges, this paper introduces a streamlined and effective detection method. The integration of the FasterNet Block into the Cross-Stage Partial Network (C3) of the backbone reduces computational and storage demands. Additionally, by introducing cross-scale feature fusion in the neck network, the computational load and parameter requirements during inference are further decreased. Meanwhile, the dynamic head with multi-scale processing and Shape-IoU enhances detection accuracy and robustness, achieving a balance between lightweight design and performance. Compared to the original YOLOv5 models, the proposed lightweight method reduces the number of parameters by 29.4 to 43.0%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} and compresses the size of the model by 31.6 to 42.7%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} while maintaining a high mAP@0.5. Furthermore, the designed models achieve a faster inference speed since the computations could be reduced by more than 30%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document}. In robustness experiments under varying lighting conditions, the proposed model demonstrates stable detection performance even in challenging lighting scenarios, showing its reliability in real-world applications. In conclusion, our research offers considerable improvements in model accuracy, parameter efficiency, and size compared to the mainstream object detection algorithms.

引用

页数：23

共 38 条

[1] Recent trend in medical imaging modalities and their applications in disease diagnosis: a review
Abhisheka, Barsha
Biswas, Saroj Kumar
Purkayastha, Biswajit
Das, Dolly
Escargueil, Alexandre
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 43035 - 43070
[2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[4] Chang Y, 2023, IEEE Access
[5] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Chen, Jierun
Kao, Shiu-Hong
He, Hao
Zhuo, Weipeng
Wen, Song
Lee, Chul-Ho
Chan, S. -H. Gary
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
[6] Dai JF, 2016, ADV NEUR IN, V29
[7] Dynamic Head: Unifying Object Detection Heads with Attentions
Dai, Xiyang
Chen, Yinpeng
Xiao, Bin
Chen, Dongdong
Liu, Mengchen
Yuan, Lu
Zhang, Lei
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7369 - 7378
[8] RepVGG: Making VGG-style ConvNets Great Again
Ding, Xiaohan
Zhang, Xiangyu
Ma, Ningning
Han, Jungong
Ding, Guiguang
Sun, Jian
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13728 - 13737
[9] Dong C, 2024, Expert Systems with Applications
[10] The Pascal Visual Object Classes (VOC) Challenge
Everingham, Mark
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338

← 1 2 3 4 →