An Improved YOLOv8 Algorithm for Real-World Road Vehicle Object Detection

被引：0

作者：

Song, Yuhan ^{[1
]}

Tao, Gan ^{[2
]}

Ding, Haoran ^{[1
]}

机构：

[1] Wuhan Univ Technol, Sch Comp & Artificial Intelligence, Wuhan, Peoples R China

[2] Wuhan Univ Technol, Sch Econ, Wuhan, Peoples R China

来源：

2024 IEEE 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ELECTRONICS AND ELECTRICAL ENGINEERING, AUTEEE | 2024年

关键词：

Deep learning; Vehicle detection; Computer vision; YOLOv8;

D O I：

10.1109/AUTEEE62881.2024.10869803

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Vehicle detection plays a pivotal role in intelligent transportation systems and autonomous driving, ensuring safety, optimizing traffic flow, and supporting advanced driver assistance functions. However, dense vehicle distributions, occlusions, and differing target sizes within complex road environments often challenge the performance of existing detection models. In response, this work introduces an improved vehicle detection framework founded on YOLOv8s. Our design integrates a dual attention mechanism-merging CBAM and SE modules-into the Backbone, thereby reinforcing feature extraction and enhancing detection for smaller targets. Additionally, a cross-layer multi-scale feature fusion strategy, built upon ReP-GFPN, is incorporated into the Neck to boost multi-scale information sharing and strengthen detection across various target dimensions. We further replace the traditional CIOU loss with Wise-IoU, enabling the model to better handle difficult samples and occlusion scenarios. Experiments on the UA-DETRAC dataset demonstrate a 4.8% improvement in mAP@0.5, underscoring the effectiveness of these enhancements in demanding traffic conditions. This work offers a promising avenue for advancing vehicle detection systems in intelligent transportation and autonomous driving contexts.

引用

页码：152 / 156

页数：5

共 9 条

[1]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]

[2]

Li M., 2024, Comput. Eng. Appl., V22, P20

[3] You Only Look Once: Unified, Real-Time Object Detection [J].

Redmon, Joseph ;

Divvala, Santosh ;

Girshick, Ross ;

Farhadi, Ali .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788

[4]

Tong ZJ, 2023, Arxiv, DOI [arXiv:2301.10051, DOI 10.48550/ARXIV.2301.10051]

[5] UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking [J].

Wen, Longyin ;

Du, Dawei ;

Cai, Zhaowei ;

Lei, Zhen ;

Chang, Ming-Ching ;

Qi, Honggang ;

Lim, Jongwoo ;

Yang, Ming-Hsuan ;

Lyu, Siwei .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 193

[6] CBAM: Convolutional Block Attention Module [J].

Woo, Sanghyun ;

Park, Jongchan ;

Lee, Joon-Young ;

Kweon, In So .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19

[7]

Xu XZ, 2022, Arxiv, DOI arXiv:2211.15444

[8] Focal and efficient IOU loss for accurate bounding box regression [J].

Zhang, Yi-Fan ;

Ren, Weiqiang ;

Zhang, Zhang ;

Jia, Zhen ;

Wang, Liang ;

Tan, Tieniu .

NEUROCOMPUTING, 2022, 506 (146-157) :146-157

[9] Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation [J].

Zheng, Zhaohui ;

Wang, Ping ;

Ren, Dongwei ;

Liu, Wei ;

Ye, Rongguang ;

Hu, Qinghua ;

Zuo, Wangmeng .

IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) :8574-8586

← 1 →