Fast Remote Sensing Image Object Detection Algorithm Based on Attention Feature Fusion

被引:0
作者
Wu, Jiancheng [1 ]
Guo, Rongzuo [1 ]
Cheng, Jiawei [1 ]
Zhang, Hao [1 ]
机构
[1] College of Computer Science, Sichuan Normal University, Chengdu
关键词
attention mechanism; feature pyramid; object detection; remote sensing image; YOLO;
D O I
10.3778/j.issn.1002-8331.2303-0375
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at the challenges of complex backgrounds, numerous small targets, and difficulty in feature extraction in remote sensing images, a fast remote sensing image object detection algorithm based on attention feature fusion—YOLO-Aff is proposed. This algorithm designs a backbone network module (ECALAN) with channel attention and a blur pool (BP) module to reduce the loss caused by downsampling. In addition, a feature pyramid network (SPD-FPN) with no stride convolution is used to combine the SimAM attention feature fusion module (CBSA) to enhance the cross-scale feature fusion performance of the features. Finally, Wise-IoU is used as the coordinate loss of the network to optimize the sample imbalance problem. The experimental results show that YOLO-Aff achieves an mAP value of 96% on the NWPU VHR-10 dataset, which is 2.9 percentage points higher than the original algorithm, and provides a new solution for fast and high-precision object detection of remote sensing images. © 2019 Remedium Group Ltd. All rights reserved.
引用
收藏
页码:207 / 216
页数:9
相关论文
共 26 条
[1]  
CAO Y, WANG J, JIN Y, Et al., Few-shot object detection via association and discrimination, Advances in Neural Information Processing Systems, pp. 16570-16581, (2021)
[2]  
TIAN Z, SHEN C, CHEN H, Et al., FCOS: fully convolutional one-stage object detection, Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9627-9636, (2019)
[3]  
ZHU X, SU W, LU L, Et al., Deformable DETR: deformable transformers for end-to-end object detection, (2020)
[4]  
LIU Z, LIN Y, CAO Y, Et al., Swin transformer: hierarchical vision transformer using shifted windows, Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012-10022, (2021)
[5]  
JEUNE P L, MOKRAOUI A., A comparative attention framework for better few-shot object detection on aerial images, (2022)
[6]  
LI C, WANG K, DING C C, Et al., Improved feature fusion network for small object detection in remote sensing images, Computer Engineering and Applications, 59, 17, pp. 232-241, (2023)
[7]  
XU X, FENG Z, CAO C, Et al., An improved swin transformer-based model for remote sensing object detection and instance segmentation, Remote Sensing, 13, 23, (2021)
[8]  
REDMON J, DIVVALA S, GIRSHICK R, Et al., You only look once: unified, real-time object detection, Proceedings of the IEEE Conference on Computer vision and Pattern Recognition, pp. 779-788, (2016)
[9]  
REDMON J, FARHADI A., YOLO9000: better, faster, stronger, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263-7271, (2017)
[10]  
REDMON J, FARHADI A., YOLOv3: an incremental improvement, (2018)