An efficient feature aggregation network for small object detection in UAV aerial images

被引:2
作者
Liu, Xiangqian [1 ]
Zhang, Guangwei [2 ]
Zhou, Bing [1 ]
机构
[1] Zhengzhou Univ, Coll Comp Sci & Artificial Intelligence, Zhengzhou 450000, Henan, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing 100876, Peoples R China
关键词
Small objects; UAV aerial images; Multi-scale feature; Object detection;
D O I
10.1007/s11227-025-06987-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned aerial vehicles (UAVs) possess high mobility and a wide field of view, leading to challenges such as a high proportion of small objects, significant variation in object size, object aggregation, and complex backgrounds in aerial images. Existing object detection methods often overlook the texture information in high-level features, which is crucial for detecting small objects in complex backgrounds. To improve the detection performance of small objects in complex scenes, we propose an efficient feature aggregation network (EFA-Net) based on YOLOv7. The backbone of the network seamlessly integrates a lightweight hybrid feature extraction module (LHFE), which replaces traditional convolutions with depthwise convolutions and employs a hybrid channel attention mechanism to capture local and global information concurrently. This design can effectively reduce the parameters without sacrificing detection accuracy and enhance the network's representative capacity. In the neck, we design an innovative adaptive multi-scale feature fusion module (AMSFM) that improves the model's adaptability to small objects and complex backgrounds by fusing multi-scale features with high-level semantic information and capturing the texture information in high-level features. Additionally, we incorporate a residual spatial pyramid pooling (RSPP) module to strengthen information fusion from various receptive fields and reduce the interference of complex backgrounds on small object detection. To further improve the model's robustness and generalization ability, we propose an enhanced complete intersection over union (ECIoU) loss function to balance the influence of large and small objects during training. Experimental results demonstrate the effectiveness of the proposed method, achieving mAP50\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${mAP_{50}}$$\end{document} scores of 51.6% and 48.5%, and mAP scores of 29.6% and 29.5% on the VisDrone 2019 and UAVDT datasets, respectively.
引用
收藏
页数:26
相关论文
共 45 条
[21]   A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection [J].
Lu, Wanjie ;
Lan, Chaozhen ;
Niu, Chaoyang ;
Liu, Wei ;
Lyu, Liang ;
Shi, Qunshan ;
Wang, Shiju .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 :1211-1231
[22]   LightUAV-YOLO: a lightweight object detection model for unmanned aerial vehicle image [J].
Lyu, Yifan ;
Zhang, Tianze ;
Li, Xin ;
Liu, Aixun ;
Shi, Gang .
JOURNAL OF SUPERCOMPUTING, 2025, 81 (01)
[23]   A survey on Image Data Augmentation for Deep Learning [J].
Shorten, Connor ;
Khoshgoftaar, Taghi M. .
JOURNAL OF BIG DATA, 2019, 6 (01)
[24]   GD-PAN: a multiscale fusion architecture applied to object detection in UAV aerial images [J].
Sun, Fengxi ;
He, Ning ;
Li, Runjie ;
Wang, Xin ;
Xu, Sunhan .
MULTIMEDIA SYSTEMS, 2024, 30 (03)
[25]   RSOD: Real-time small object detection algorithm in UAV-based traffic monitoring [J].
Sun, Wei ;
Dai, Liang ;
Zhang, Xiaorui ;
Chang, Pengshuai ;
He, Xiaozheng .
APPLIED INTELLIGENCE, 2022, 52 (08) :8448-8463
[26]   Small object change detection in UAV imagery via a Siamese network enhanced with temporal mutual attention and contextual features: A case study concerning solar water heaters [J].
Tao, Shikang ;
Yang, Mengyuan ;
Wang, Min ;
Yang, Rui ;
Shen, Qian .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 218 :352-367
[27]  
Vaswani A, 2017, ADV NEUR IN, V30
[28]   A multi-center federated learning mechanism based on consortium blockchain for data secure sharing [J].
Wang, Bin ;
Tian, Zhao ;
Liu, Xinrui ;
Xia, Yujie ;
She, Wei ;
Liu, Wei .
KNOWLEDGE-BASED SYSTEMS, 2025, 310
[29]   YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors [J].
Wang, Chien-Yao ;
Bochkovskiy, Alexey ;
Liao, Hong-Yuan Mark .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :7464-7475
[30]  
Wang JW, 2018, 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), P439, DOI 10.23919/ICIF.2018.8455565