M-YOLOv8s: An improved small target detection algorithm for UAV aerial photography☆

被引：3

作者：

Duan, Siyao ^{[1
]}

Wang, Ting ^{[2
]}

Li, Tao ^{[1
]}

Yang, Wankou ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Sch Automat Engn, Nanjing 211106, Jiangsu, Peoples R China

[2] Nanjing Forestry Univ, Sch Informat Sci & Technol, Nanjing 210037, Jiangsu, Peoples R China

[3] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2024年 / 104卷

关键词：

UAV; Small object detection; YOLOv8; Deep learning; Multi-scale fusion; Attention mechanism;

D O I：

10.1016/j.jvcir.2024.104289

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The object of UAV target detection usually means small target with complicated backgrounds. In this paper, an object detection model M-YOLOv8s based on UAV aerial photography scene is proposed. Firstly, to solve the problem that the YOLOv8s model cannot adapt to small target detection, a small target detection head (STDH) module is introduced to fuse the location and appearance feature information of the shallow layers of the backbone network. Secondly, Inner-Wise intersection over union (Inner-WIoU) is designed as the boundary box regression loss, and auxiliary boundary calculation is used to accelerate the regression speed of the model. Thirdly, the structure of multi-scale feature pyramid network (MS-FPN) can effectively combine the shallow network information with the deep network information and improve the performance of the detection model. Furthermore, a multi-scale cross-spatial attention (MCSA) module is proposed to expand the feature space through multi-scale branch, and then achieves the aggregation of target features through cross-spatial interaction, which improves the ability of the model to extract target features. Finally, the experimental results show that our model does not only possess fewer parameters, but also the values of mAP(0.5) are 6.6% and 5.4% higher than the baseline model on the Visdrone2019 validation dataset and test dataset, respectively. Then, as a conclusion, the M-YOLOv8s model achieves better detection performance than some existing ones, indicating that our proposed method can be more suitable for detecting the small targets.

引用

页数：13

共 57 条

[1] Bochkovskiy Alexey, YOLOv4: Optimal Speed and Accuracy of Object Detection
[2] UAV small target detection algorithm based on an improved YOLOv5s model
Cao, Shihai
Wang, Ting
Li, Tao
Mao, Zehui
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
[3] Case instance segmentation of small farmland based on Mask R-CNN of feature pyramid network with double attention mechanism in high resolution satellite images
Cao, Yangyang
Zhao, Zuoxi
Huang, Yuan
Lin, Xu
Luo, Shuyuan
Xiang, Borui
Yang, Houcheng
[J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 212
[4] Carion N., 2020, PROC EUR C COMPUT VI, P213
[5] R-CNN for Small Object Detection
Chen, Chenyi
Liu, Ming-Yu
Tuzel, Oncel
Xiao, Jianxiong
[J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 214 - 230
[6] CHEN HC, 1995, J AM SOC INFORM SCI, V46, P194, DOI 10.1002/(SICI)1097-4571(199504)46:3<194::AID-ASI4>3.0.CO
[7] 2-S
[8] DSW-YOLO: A detection method for ground-planted strawberry fruits under different occlusion levels
Du, Xiaoqiang
Cheng, Hongchao
Ma, Zenghong
Lu, Wenwu
Wang, Mengxiang
Meng, Zhichao
Jiang, Chengjie
Hong, Fangwei
[J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 214
[9] Object Detection with Discriminatively Trained Part-Based Models
Felzenszwalb, Pedro F.
Girshick, Ross B.
McAllester, David
Ramanan, Deva
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
[10] Ge Z., 2021, ARXIV, DOI [10.48550/arXiv.2107.08430, 10.48550/ARXIV.2107.08430]

← 1 2 3 4 5 6 →