AEFFNet: Attention Enhanced Feature Fusion Network for Small Object Detection in UAV Imagery

被引：3

作者：

Nian, Zhaoyu ^{[1
]}

Yang, Wenzhu ^{[1
,2
]}

Chen, Hao ^{[1
]}

机构：

[1] Hebei Univ, Sch Cyber Secur & Comp, Baoding 071002, Hebei, Peoples R China

[2] Hebei Machine Vis Engn Res Ctr, Baoding 071002, Hebei, Peoples R China

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Feature extraction; Autonomous aerial vehicles; Head; Detectors; Attention mechanisms; Neck; Location awareness; Accuracy; YOLO; Semantics; Object detection; small object detection; attention mechanism; multi-scale feature fusion;

D O I：

10.1109/ACCESS.2025.3538873

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The rapid advancement of unmanned aerial vehicle (UAV) technology has markedly increased the use of drone-captured imagery across various applications, necessitating enhanced accuracy and real-time performance in UAV image detection. Addressing the specific challenges posed by small and densely distributed objects in such images, we introduce an attention enhanced feature fusion network (AEFFNet) designed specifically for small object detection in UAV imagery. Firstly, a hybrid attention module with associated multi-axis frequency and spatial attention was designed to enhance the feature extraction of small objects. Secondly, an adjacent layer feature fusion module is innovatively proposed in order to boost the detection capabilities for small and occluded objects. Finally, a series experiments are conducted on the VisDrone2023 dataset, which involves a large number of small objects photographed by drones. Our evaluations, conducted on the VisDrone2023 dataset, demonstrate substantial improvements over the YOLOv8m baseline model, with a 3.0% increase in mean Average Precision (mAP) and a 4.4% rise in AP50.

引用

页码：26494 / 26505

页数：12

共 51 条

[1]

aiskyeye, 2023, VisDrone 2020 Leaderboard-VISDRONE

[2]

[Anonymous], 2023, Ultralytics/YOLOv5: V5.0-YOLOv5-P6 1280 Models, AWS, Supervise.ly and YouTube Integrations |

[3]

Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934

[4]

Cai Z., 2017, arXiv, DOI [arXiv:1712.00726, DOI 10.48550/ARXIV.1712.00726, 10.48550/arXiv.1712.00726]

[5] RRNet: A Hybrid Detector for Object Detection in Drone-captured Images [J].

Chen, Changrui ;

Zhang, Yu ;

Lv, Qingxuan ;

Wei, Shuo ;

Wang, Xiaorui ;

Sun, Xin ;

Dong, Junyu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :100-108

[6]

Chen HT, 2023, Arxiv, DOI [arXiv:2305.12972, DOI 10.48550/ARXIV.2305.12972, 10.48550/arXiv.2305.12972]

[7] Extended Feature Pyramid Network for Small Object Detection [J].

Deng, Chunfang ;

Wang, Mengmeng ;

Liu, Liang ;

Liu, Yong ;

Jiang, Yunliang .

IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1968-1979

[8] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[9] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[10]

Ghiasi G., 2019, arXiv, DOI arXiv:1904.07392

← 1 2 3 4 5 6 →