Real⁃time dense small object detection algorithm for UAV based on improved YOLOv5

被引：0

作者：

Feng Z. ^{[1
]}

Xie Z. ^{[1
]}

Bao Z. ^{[2
]}

Chen K. ^{[3
]}

机构：

[1] School of Information Science and Engineering, Ningbo University, Ningbo

[2] Ningbo JIWANG Information Technology Ltd, Ningbo

[3] School of Mechanical Engineering and Mechanics, Ningbo University, Ningbo

来源：

Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica | 2023年 / 44卷 / 07期

基金：

中国国家自然科学基金;

关键词：

attention mechanism; feature fusion; self-attention mechanism; small object detection; UAV;

D O I：

10.7527/S1000-6893.2022.27106

中图分类号：

学科分类号：

摘要：

UAV aerial images have more complex backgrounds and a large number of dense small targets compared with natural scene images, which impose higher requirements on the detection network. On the premise of ensuring real-time object detection, a YOLOv5-based UAV real-time dense small object detection algorithm is proposed for the problem of low accuracy of dense small object detection in UAV view. First, combining Spatial Attention Module (SAM) with Channel Attention Module (CAM), the fully connected layer after feature compression in CAM is improved to re⁃ duce the computational effort. In addition, the connection structure of CAM and SAM is changed to improve the spatial dimensional feature capture capability. In summary, a Spatial-Channel Attention Module (SCAM) is proposed to im⁃ prove the model's attention to the aggregated regions of small targets in the feature map; secondly, an SCAM- based Attentional Feature Fusion module (SC-AFF) is proposed to enhance the feature fusion efficiency of small targets by adaptively assigning attentional weights according to feature maps of different scales; finally, a backbone network is in⁃ troduced in the Transformer in the backbone network, and use the SC-AFF to improve the feature fusion at the original residual connections to better capture global information and rich contextual information, and improve the feature ex⁃ traction capability of dense small targets in complex backgrounds. Experiments are conducted on the VisDrone2021 dataset. The effects of different network scale parameters and different input resolutions on the detection accuracy and speed of YOLOv5 are first investigated. The analysis concludes that YOLOv5s is more suitable to be used as a bench⁃ mark model for UAV real-time object detection. Under the benchmark of YOLOv5s, the improved model improves mAP50 by 6. 4% and mAP75 by 5. 8%, and the FPS for high-resolution images can reach 46. The mAP50 of the model trained at an input resolution of 1504×1504 can reach 54. 5%, which is 11. 5% better than that of YOLOv4. The accuracy is improved while the detection speed FPS remains at 46, which is more suitable for real-time UAV object de⁃ tection in dense small target scenarios. © 2023 AAAS Press of Chinese Society of Aeronautics and Astronautics. All rights reserved.

引用

共 27 条

[1] JIANG B, LI Y D，, Et al., Object detection in UAV imagery based on deep learning：Review［J］, Acta Aeronautica et Astronautica Sinica, 42, 4, (2021)
[2] Faster RCNN：Towards real-time object detection with region proposal networks［J］, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
[3] 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)
[4] LIU W，, ANGUELOV D，, ERHAN D，, Et al., SSD：Single shot MultiBox detector［C］∥European Conference on Computer Vision （ECCV）, pp. 21-37, (2016)
[5] REDMON J，, FARHADI A., YOLO9000： Better，faster，stronger［C］∥2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, (2017)
[6] FARHADI A., YOLOv3：An incremental improvement［DB/OL］, (2018)
[7] WANG C Y, LIAO H Y M., YOLOv4：Optimal speed and accuracy of object detection ［DB/OL］, (2020)
[8] LI K C, WANG X Q，, LIN H, Et al., Survey of one-stage small object detection methods in deep learning［J］, Journal of Frontiers of Computer Science and Technology, 16, 1, pp. 41-58, (2022)
[9] WANG Q C，, ZHANG H，, HONG X G，, Et al., Small object detection based on modified FSSD and model compression［J］, 2021 IEEE 6th International Conference on Signal and Image Processing（ICSIP）, pp. 88-92, (2021)
[10] GONG Y Q, DING Y，, Et al., Effective fusion factor in FPN for tiny object detection［C］∥2021 IEEE Winter Conference on Applications of Computer Vision, pp. 1159-1167, (2021)

← 1 2 3 →