SOD-YOLO: Small-Object-Detection Algorithm Based on Improved YOLOv8 for UAV Images

被引:12
作者
Li, Yangang [1 ]
Li, Qi [1 ,2 ]
Pan, Jie [2 ]
Zhou, Ying [1 ]
Zhu, Hongliang [1 ]
Wei, Hongwei [1 ]
Liu, Chong [1 ]
机构
[1] Qilu Aerosp Informat Res Inst, Jinan 250132, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
关键词
object detection; UAV; small objects; feature fusion;
D O I
10.3390/rs16163057
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The rapid development of unmanned aerial vehicle (UAV) technology has contributed to the increasing sophistication of UAV-based object-detection systems, which are now extensively utilized in civilian and military sectors. However, object detection from UAV images has numerous challenges, including significant variations in the object size, changing spatial configurations, and cluttered backgrounds with multiple interfering elements. To address these challenges, we propose SOD-YOLO, an innovative model based on the YOLOv8 model, to detect small objects in UAV images. The model integrates the receptive field convolutional block attention module (RFCBAM) in the backbone network to perform downsampling, improving feature extraction efficiency and mitigating the spatial information sparsity caused by downsampling. Additionally, we developed a novel neck architecture called the balanced spatial and semantic information fusion pyramid network (BSSI-FPN) designed for multi-scale feature fusion. The BSSI-FPN effectively balances spatial and semantic information across feature maps using three primary strategies: fully utilizing large-scale features, increasing the frequency of multi-scale feature fusion, and implementing dynamic upsampling. The experimental results on the VisDrone2019 dataset demonstrate that SOD-YOLO-s improves the mAP50 indicator by 3% compared to YOLOv8s while reducing the number of parameters and computational complexity by 84.2% and 30%, respectively. Compared to YOLOv8l, SOD-YOLO-l improves the mAP50 indicator by 7.7% and reduces the number of parameters by 59.6%. Compared to other existing methods, SODA-YOLO-l achieves the highest detection accuracy, demonstrating the superiority of the proposed method.
引用
收藏
页数:26
相关论文
共 60 条
  • [1] Adaimi G., 2020, arXiv
  • [2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934]
  • [3] Vehicle Detection From UAV Imagery With Deep Learning: A Review
    Bouguettaya, Abdelmalek
    Zarzour, Hafed
    Kechida, Ahmed
    Taberkit, Amine Mohammed
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6047 - 6067
  • [4] Deep Learning Approach in Aerial Imagery for Supporting Land Search and Rescue Missions
    Bozic-Stulic, Dunja
    Marusic, Zeljko
    Gotovac, Sven
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (09) : 1256 - 1278
  • [5] Road Traffic Monitoring from UAV Images Using Deep Learning Networks
    Byun, Sungwoo
    Shin, In-Kyoung
    Moon, Jucheol
    Kang, Jiyoung
    Choi, Sang-Il
    [J]. REMOTE SENSING, 2021, 13 (20)
  • [6] Cascade R-CNN: High Quality Object Detection and Instance Segmentation
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1483 - 1498
  • [7] GCL-YOLO: A GhostConv-Based Lightweight YOLO Network for UAV Small Object Detection
    Cao, Jinshan
    Bao, Wenshu
    Shang, Haixing
    Yuan, Ming
    Cheng, Qian
    [J]. REMOTE SENSING, 2023, 15 (20)
  • [8] VisDrone-DET2021: The Vision Meets Drone Object detection Challenge Results
    Cao, Yaru
    He, Zhijian
    Wang, Lujia
    Wang, Wenguan
    Yuan, Yixuan
    Zhang, Dingwen
    Zhang, Jinglin
    Zhu, Pengfei
    Van Gool, Luc
    Han, Junwei
    Hoi, Steven
    Hu, Qinghua
    Liu, Ming
    Cheng, Chong
    Liu, Fanfan
    Cao, Guojin
    Li, Guozhen
    Wang, Hongkai
    He, Jianye
    Wan, Junfeng
    Wan, Qi
    Zhao, Qi
    Lyu, Shuchang
    Zhao, Wenzhe
    Lu, Xiaoqiang
    Zhu, Xingkui
    Liu, Yingjie
    Lv, Yixuan
    Ma, Yujing
    Yang, Yuting
    Wang, Zhe
    Xu, Zhenyu
    Luo, Zhipeng
    Zhang, Zhimin
    Zhang, Zhiguang
    Li, Zihao
    Zhang, Zixiao
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2847 - 2854
  • [9] Chang YC, 2018, IEEE IMAGE PROC, P1917, DOI 10.1109/ICIP.2018.8451144
  • [10] Towards Large-Scale Small Object Detection: Survey and Benchmarks
    Cheng, Gong
    Yuan, Xiang
    Yao, Xiwen
    Yan, Kebing
    Zeng, Qinghua
    Xie, Xingxing
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13467 - 13488