M-YOLOv8s: An improved small target detection algorithm for UAV aerial photography☆

被引:3
作者
Duan, Siyao [1 ]
Wang, Ting [2 ]
Li, Tao [1 ]
Yang, Wankou [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Sch Automat Engn, Nanjing 211106, Jiangsu, Peoples R China
[2] Nanjing Forestry Univ, Sch Informat Sci & Technol, Nanjing 210037, Jiangsu, Peoples R China
[3] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
关键词
UAV; Small object detection; YOLOv8; Deep learning; Multi-scale fusion; Attention mechanism;
D O I
10.1016/j.jvcir.2024.104289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The object of UAV target detection usually means small target with complicated backgrounds. In this paper, an object detection model M-YOLOv8s based on UAV aerial photography scene is proposed. Firstly, to solve the problem that the YOLOv8s model cannot adapt to small target detection, a small target detection head (STDH) module is introduced to fuse the location and appearance feature information of the shallow layers of the backbone network. Secondly, Inner-Wise intersection over union (Inner-WIoU) is designed as the boundary box regression loss, and auxiliary boundary calculation is used to accelerate the regression speed of the model. Thirdly, the structure of multi-scale feature pyramid network (MS-FPN) can effectively combine the shallow network information with the deep network information and improve the performance of the detection model. Furthermore, a multi-scale cross-spatial attention (MCSA) module is proposed to expand the feature space through multi-scale branch, and then achieves the aggregation of target features through cross-spatial interaction, which improves the ability of the model to extract target features. Finally, the experimental results show that our model does not only possess fewer parameters, but also the values of mAP(0.5) are 6.6% and 5.4% higher than the baseline model on the Visdrone2019 validation dataset and test dataset, respectively. Then, as a conclusion, the M-YOLOv8s model achieves better detection performance than some existing ones, indicating that our proposed method can be more suitable for detecting the small targets.
引用
收藏
页数:13
相关论文
共 57 条
  • [1] Bochkovskiy Alexey, YOLOv4: Optimal Speed and Accuracy of Object Detection
  • [2] UAV small target detection algorithm based on an improved YOLOv5s model
    Cao, Shihai
    Wang, Ting
    Li, Tao
    Mao, Zehui
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
  • [3] Case instance segmentation of small farmland based on Mask R-CNN of feature pyramid network with double attention mechanism in high resolution satellite images
    Cao, Yangyang
    Zhao, Zuoxi
    Huang, Yuan
    Lin, Xu
    Luo, Shuyuan
    Xiang, Borui
    Yang, Houcheng
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 212
  • [4] Carion N., 2020, PROC EUR C COMPUT VI, P213
  • [5] R-CNN for Small Object Detection
    Chen, Chenyi
    Liu, Ming-Yu
    Tuzel, Oncel
    Xiao, Jianxiong
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 214 - 230
  • [6] CHEN HC, 1995, J AM SOC INFORM SCI, V46, P194, DOI 10.1002/(SICI)1097-4571(199504)46:3<194::AID-ASI4>3.0.CO
  • [7] 2-S
  • [8] DSW-YOLO: A detection method for ground-planted strawberry fruits under different occlusion levels
    Du, Xiaoqiang
    Cheng, Hongchao
    Ma, Zenghong
    Lu, Wenwu
    Wang, Mengxiang
    Meng, Zhichao
    Jiang, Chengjie
    Hong, Fangwei
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 214
  • [9] Object Detection with Discriminatively Trained Part-Based Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    Ramanan, Deva
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
  • [10] Ge Z., 2021, ARXIV, DOI [10.48550/arXiv.2107.08430, 10.48550/ARXIV.2107.08430]