Feature Pyramid Full Granularity Attention Network for Object Detection in Remote Sensing Imagery

被引:0
作者
Liu, Chang [1 ]
Qi, Xiao [1 ]
Yin, Hang [1 ]
Song, Bowei [1 ]
Li, Ke [1 ]
Shen, Fei [2 ]
机构
[1] China Elect Standardizat Inst, Beijing, Peoples R China
[2] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT X, ICIC 2024 | 2024年 / 14871卷
关键词
Object Detection; Remote Sensing; Convolutional Neural Networks; Attention;
D O I
10.1007/978-981-97-5609-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid advancement of deep learning, particularly the emergence of attention mechanisms applied to convolutional neural networks (CNNs), object detection in high-resolution remote sensing images has seen significant progress. However, due to the CNNs' inability to capture long-range dependencies and the high computational cost of the attention mechanism, object detection in remote sensing images remains a challenging task. To address these issues, this paper introduces a novel feature pyramid full granularity attention module (FPFGAM) designed to learn long-range dependencies, dynamically attend to strongly correlated features, and reduce GPU memory overhead. Initially, we perform adaptive filtering of feature regions at the coarse-grained level. This process reduces the computational burden caused by weakly correlated features. Subsequently, we perform fine-grained pixel-level queries on several strongly correlated regions to enhance long-range dependent feature learning. We propose a feature pyramid full granularity attention network (FPFGANet) by embedding the feature pyramid full granularity attention module into the backbone network ResNet50 and the feature pyramid network (FPN). FPFGAM can be easily inserted into different layers to improve object detection accuracy in remote sensing images. Finally, we evaluate our method on three commonly used public remote sensing object detection datasets: NWPU VHR-10 and DIOR. The empirical results confirm the effectiveness of our approach.
引用
收藏
页码:332 / 353
页数:22
相关论文
共 79 条
[1]  
Beal J., 2020, arXiv, DOI [10.48550/arXiv.2012.09958, DOI 10.48550/ARXIV.2012.09958]
[2]  
Cao JX, 2020, Arxiv, DOI [arXiv:2005.11475, 10.48550/arXiv.2005.11475, DOI 10.48550/ARXIV.2005.11475]
[3]   Object Detection in Remote Sensing Images Based on a Scene-Contextual Feature Pyramid Network [J].
Chen, Chaoyue ;
Gong, Weiguo ;
Chen, Yongliang ;
Li, Weihong .
REMOTE SENSING, 2019, 11 (03)
[4]   Sparse attention block: Aggregating contextual information for object detection [J].
Chen, Chunlin ;
Yu, Jun ;
Ling, Qiang .
PATTERN RECOGNITION, 2022, 124
[5]   Multi-Scale Spatial and Channel-wise Attention for Improving Object Detection in Remote Sensing Imagery [J].
Chen, Jie ;
Wan, Li ;
Zhu, Jingru ;
Xu, Gang ;
Deng, Min .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (04) :681-685
[6]  
Chen OY, 2020, IEEE WINT CONF APPL, P756, DOI 10.1109/WACV45572.2020.9093499
[7]  
Chen K, 2019, Arxiv, DOI [arXiv:1906.07155, DOI 10.48550/ARXIV.1906.07155]
[8]   Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images [J].
Cheng, Gong ;
Si, Yongjie ;
Hong, Hailong ;
Yao, Xiwen ;
Guo, Lei .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) :431-435
[9]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132
[10]   Dynamic DETR: End-to-End Object Detection with Dynamic Attention [J].
Dai, Xiyang ;
Chen, Yinpeng ;
Yang, Jianwei ;
Zhang, Pengchuan ;
Yuan, Lu ;
Zhang, Lei .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :2968-2977