YOLOv8-MPEB small target detection algorithm based on UAV images

被引：15

作者：

Xu, Wenyuan ^{[1
]}

Cui, Chuang ^{[1
]}

Ji, Yongcheng ^{[1
]}

Li, Xiang ^{[1
]}

Li, Shuai ^{[1
]}

机构：

[1] Northeast Forestry Univ, Sch Civil Engn & Transportat, Harbin 150040, Peoples R China

来源：

HELIYON | 2024年 / 10卷 / 08期

关键词：

YOLOv8; MobileNetV3; Attention mechanism; BiFPN; Small target detection;

D O I：

10.1016/j.heliyon.2024.e29501

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Target detection in Unmanned Aerial Vehicle (UAV) aerial images has gained significance within UAV application scenarios. However, UAV aerial images present challenges, including large-scale changes, small target sizes, complex scenes, and variable external factors, resulting in missed or false detections. This study proposes an algorithm for small target detection in UAV images based on an enhanced YOLOv8 model termed YOLOv8-MPEB. Firstly, the Cross Stage Partial Darknet53 (CSPDarknet53) backbone network is substituted with the lightweight MobileNetV3 backbone network, consequently reducing model parameters and computational complexity, while also enhancing inference speed. Secondly, a dedicated small target detection layer is intricately designed to optimize feature extraction for multi-scale targets. Thirdly, the integration of the Efficient Multi-Scale Attention (EMA) mechanism within the Convolution to Feature (C2f) module aims to enhance the extraction of vital features and suppress superfluous ones. Lastly, the utilization of a bidirectional feature pyramid network (BiFPN) in the Neck segment serves to ameliorate detection errors stemming from scale variations and complex scenes, thereby augmenting model generalization. The study provides a thorough examination by conducting ablation experiments and comparing the results with alternative algorithms to substantiate the enhanced effectiveness of the proposed algorithm, with a particular focus on detection performance. The experimental outcomes illustrate that with a parameter count of 7.39 M and a model size of 14.5 MB, the algorithm attains a mean Average Precision (mAP) of 91.9 % on the custommade helmet and reflective clothing dataset. In comparison to standard YOLOv8 models, this algorithm elevates average accuracy by 2.2 percentage points, reduces model parameters by 34 %, and diminishes model size by 32 %. It outperforms other prevalent detection algorithms in terms of accuracy and speed.

引用

页数：18

共 45 条

[1]

Bai P., 2023, J. Eng. Sci., V45, P2108, DOI [10.13374/j.issn2095-9389.2022.11.11.006, DOI 10.13374/J.ISSN2095-9389.2022.11.11.006]

[2]

Bochkovskiy A., 2020, YOLOv4: Optimal Speed and Accuracy of Object Detection

[3]

Bresler Guy, 2020, arXiv

[4] Requirements and Limitations of Thermal Drones for Effective Search and Rescue in Marine and Coastal Areas [J].

Burke, Claire ;

McWhirter, Paul R. ;

Veitch-Michaelis, Josh ;

McAree, Owen ;

Pointon, Harry A. G. ;

Wich, Serge ;

Longmore, Steve .

DRONES, 2019, 3 (04) :1-14

[5] Towards Large-Scale Small Object Detection: Survey and Benchmarks [J].

Cheng, Gong ;

Yuan, Xiang ;

Yao, Xiwen ;

Yan, Kebing ;

Zeng, Qinghua ;

Xie, Xingxing ;

Han, Junwei .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) :13467-13488

[6]

Cheng H., 2023, Radiotehnika, P1

[7]

Courbariaux M, 2016, Arxiv, DOI [arXiv:1511.00363, DOI 10.48550/ARXIV.1511.00363]

[8]

Deng Z., 2023, Computer Engineering and Applications, P1

[9] New trends in visual inspection of buildings and structures: Study for the use of drones [J].

Falorca, Jorge Furtado ;

Miraldes, Joao P. N. D. ;

Goncalves Lanzinha, Joao Carlos .

OPEN ENGINEERING, 2021, 11 (01) :734-743

[10]

Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, 10.48550/arXiv.1704.04861]

← 1 2 3 4 5 →