SOD-YOLO: Small-Object-Detection Algorithm Based on Improved YOLOv8 for UAV Images

被引:33
作者
Li, Yangang [1 ]
Li, Qi [1 ,2 ]
Pan, Jie [2 ]
Zhou, Ying [1 ]
Zhu, Hongliang [1 ]
Wei, Hongwei [1 ]
Liu, Chong [1 ]
机构
[1] Qilu Aerosp Informat Res Inst, Jinan 250132, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
关键词
object detection; UAV; small objects; feature fusion;
D O I
10.3390/rs16163057
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The rapid development of unmanned aerial vehicle (UAV) technology has contributed to the increasing sophistication of UAV-based object-detection systems, which are now extensively utilized in civilian and military sectors. However, object detection from UAV images has numerous challenges, including significant variations in the object size, changing spatial configurations, and cluttered backgrounds with multiple interfering elements. To address these challenges, we propose SOD-YOLO, an innovative model based on the YOLOv8 model, to detect small objects in UAV images. The model integrates the receptive field convolutional block attention module (RFCBAM) in the backbone network to perform downsampling, improving feature extraction efficiency and mitigating the spatial information sparsity caused by downsampling. Additionally, we developed a novel neck architecture called the balanced spatial and semantic information fusion pyramid network (BSSI-FPN) designed for multi-scale feature fusion. The BSSI-FPN effectively balances spatial and semantic information across feature maps using three primary strategies: fully utilizing large-scale features, increasing the frequency of multi-scale feature fusion, and implementing dynamic upsampling. The experimental results on the VisDrone2019 dataset demonstrate that SOD-YOLO-s improves the mAP50 indicator by 3% compared to YOLOv8s while reducing the number of parameters and computational complexity by 84.2% and 30%, respectively. Compared to YOLOv8l, SOD-YOLO-l improves the mAP50 indicator by 7.7% and reduces the number of parameters by 59.6%. Compared to other existing methods, SODA-YOLO-l achieves the highest detection accuracy, demonstrating the superiority of the proposed method.
引用
收藏
页数:26
相关论文
共 59 条
[1]  
Adaimi G., 2020, arXiv, DOI DOI 10.48550/ARXIV.2009.07611
[2]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[3]   Vehicle Detection From UAV Imagery With Deep Learning: A Review [J].
Bouguettaya, Abdelmalek ;
Zarzour, Hafed ;
Kechida, Ahmed ;
Taberkit, Amine Mohammed .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) :6047-6067
[4]   Deep Learning Approach in Aerial Imagery for Supporting Land Search and Rescue Missions [J].
Bozic-Stulic, Dunja ;
Marusic, Zeljko ;
Gotovac, Sven .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (09) :1256-1278
[5]   Road Traffic Monitoring from UAV Images Using Deep Learning Networks [J].
Byun, Sungwoo ;
Shin, In-Kyoung ;
Moon, Jucheol ;
Kang, Jiyoung ;
Choi, Sang-Il .
REMOTE SENSING, 2021, 13 (20)
[6]   Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498
[7]   GCL-YOLO: A GhostConv-Based Lightweight YOLO Network for UAV Small Object Detection [J].
Cao, Jinshan ;
Bao, Wenshu ;
Shang, Haixing ;
Yuan, Ming ;
Cheng, Qian .
REMOTE SENSING, 2023, 15 (20)
[8]   VisDrone-DET2021: The Vision Meets Drone Object detection Challenge Results [J].
Cao, Yaru ;
He, Zhijian ;
Wang, Lujia ;
Wang, Wenguan ;
Yuan, Yixuan ;
Zhang, Dingwen ;
Zhang, Jinglin ;
Zhu, Pengfei ;
Van Gool, Luc ;
Han, Junwei ;
Hoi, Steven ;
Hu, Qinghua ;
Liu, Ming ;
Cheng, Chong ;
Liu, Fanfan ;
Cao, Guojin ;
Li, Guozhen ;
Wang, Hongkai ;
He, Jianye ;
Wan, Junfeng ;
Wan, Qi ;
Zhao, Qi ;
Lyu, Shuchang ;
Zhao, Wenzhe ;
Lu, Xiaoqiang ;
Zhu, Xingkui ;
Liu, Yingjie ;
Lv, Yixuan ;
Ma, Yujing ;
Yang, Yuting ;
Wang, Zhe ;
Xu, Zhenyu ;
Luo, Zhipeng ;
Zhang, Zhimin ;
Zhang, Zhiguang ;
Li, Zihao ;
Zhang, Zixiao .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :2847-2854
[9]  
Chang YC, 2018, IEEE IMAGE PROC, P1917, DOI 10.1109/ICIP.2018.8451144
[10]   Towards Large-Scale Small Object Detection: Survey and Benchmarks [J].
Cheng, Gong ;
Yuan, Xiang ;
Yao, Xiwen ;
Yan, Kebing ;
Zeng, Qinghua ;
Xie, Xingxing ;
Han, Junwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) :13467-13488