Improved lightweight infrared road target detection method based on YOLOv8

被引:0
作者
Yao, Jialong [1 ]
Xu, Sheng [1 ]
Feijiang, Huang [2 ]
Su, Chengyue [1 ]
机构
[1] Guangdong Univ Technol, Sch Phys & Optoelect Engn, Guangzhou 510006, Peoples R China
[2] Guangzhou Maritime Univ, Coll Informat & Commun Engn, Guangzhou 510725, Peoples R China
关键词
Deep learning; Infrared target detection; YOLOv8; Lightweight; Edge devices; NETWORK;
D O I
10.1016/j.infrared.2024.105497
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
Infrared-based road scene object detection algorithms often face issues with excessive parameters and computational demands, making them incompatible with edge devices having constrained computational capabilities. This paper introduces an enhanced lightweight infrared-based road object detection algorithm based on YOLOv8n. Firstly, a streamlined network architecture is devised by merging YOLOv8n's C2f module with PConv, creating a lighter module and reducing the neural network's downsampling rate of infrared images. This strategy reduces redundant computations and memory access, preventing the loss of fine details in infrared images caused by deep convolutional neural networks. Additionally, the model's accuracy in detecting infrared targets is significantly enhanced through the integration of the coordinate attention mechanism. Finally, replacing CIoU with Wise-IoU for bounding box regression in YOLOv8n accelerates the model's convergence. Empirical findings indicate that in contrast to the YOLOv8n algorithm, the optimized model showcases a 34.17 % reduction in model size, a 40.35 % decrease in parameters, and a 4.8 % increase in average detection accuracy. This enhanced algorithm not only achieves a lightweight profile but also delivers superior performance on embedded edge devices.
引用
收藏
页数:11
相关论文
共 41 条
[1]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[2]   Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks [J].
Chen, Jierun ;
Kao, Shiu-Hong ;
He, Hao ;
Zhuo, Weipeng ;
Wen, Song ;
Lee, Chul-Ho ;
Chan, S. -H. Gary .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12021-12031
[3]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[4]  
Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861]
[5]  
Gevorgyan Z, 2022, Arxiv, DOI [arXiv:2205.12740, DOI 10.48550/ARXIV.2205.12740]
[6]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[7]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[8]   GhostNet: More Features from Cheap Operations [J].
Han, Kai ;
Wang, Yunhe ;
Tian, Qi ;
Guo, Jianyuan ;
Xu, Chunjing ;
Xu, Chang .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586
[9]   Coordinate Attention for Efficient Mobile Network Design [J].
Hou, Qibin ;
Zhou, Daquan ;
Feng, Jiashi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717
[10]   Searching for MobileNetV3 [J].
Howard, Andrew ;
Sandler, Mark ;
Chu, Grace ;
Chen, Liang-Chieh ;
Chen, Bo ;
Tan, Mingxing ;
Wang, Weijun ;
Zhu, Yukun ;
Pang, Ruoming ;
Vasudevan, Vijay ;
Le, Quoc V. ;
Adam, Hartwig .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1314-1324