Channel Self-Attention Based Multiscale Spatial-Frequency Domain Network for Oriented Object Detection in Remote Sensing Imagery

被引:0
|
作者
Xu, Yang [1 ]
Pan, Yushan [1 ]
Wu, Zebin [1 ]
Wei, Zhihui [1 ]
Zhan, Tianming [2 ,3 ]
机构
[1] Nanjing Univ Sci & Technol NJUST, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Audit Univ, Jiangsu Key Construct Lab Audit Informat Engn, Nanjing 211815, Peoples R China
[3] Nanjing Audit Univ, Sch Informat Engn, Nanjing 211815, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Feature extraction; Frequency-domain analysis; Detectors; Remote sensing; Object detection; Data mining; Attention mechanisms; Wavelet transforms; Convolution; Semantics; Fusion features; Haar wavelet transform; oriented object detection; remote sensing imagery; spatial-frequency domain;
D O I
10.1109/TGRS.2024.3500013
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The detection of oriented objects in remote sensing images remains a daunting challenge due to their complex backgrounds, various sizes, and especially arbitrary orientations. However, most of the existing methods only model the structural features of the images in the spatial domain, while the horizontal convolution kernels limit the model's ability to perceive object direction information. Furthermore, the frequency features contain rich information about scale, texture, and angle, which can be a good complement to the spatial features. Inspired by this, we propose a multiscale spatial-frequency domain network (MSFN) to utilize spatial-frequency information for oriented object detection, which can be integrated into any convolutional neural network (CNN) architectures seamlessly and perform end-to-end training easily. Firstly, multiscale Haar wavelet transforms are leveraged to extract the multiscale frequency domain features from the image. Subsequently, channel alignment feature fusion module (CA-FFM) is proposed to fuse the high-level semantic features extracted by CNN with the low-level texture features extracted by the wavelet transform in multiscale. Finally, a channel self-attention (CSA)-based spatial-frequency feature perception module (SFPM) is designed to perform self-attention weighted aggregation on the fused features along the channel dimension, thereby constructing a novel spatial-frequency feature extraction backbone network for oriented object detector in remote sensing images. Experimental results on the DOTA and HRSC2016 datasets validate the effectiveness and universality of the proposed method.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Object Detection by Channel and Spatial Exchange for Multimodal Remote Sensing Imagery
    Nan, Guozheng
    Zhao, Yue
    Fu, Liyong
    Ye, Qiaolin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8581 - 8593
  • [22] Attention-Based Mean-Max Balance Assignment for Oriented Object Detection in Optical Remote Sensing Images
    Lin, Qifeng
    Chen, Nuo
    Huang, Haibin
    Zhu, Daoye
    Fu, Gang
    Chen, Chuanxi
    Yu, Yuanlong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [23] TSMGA: Temporal-Spatial Multiscale Graph Attention Network for Remote Sensing Change Detection
    Zhang, Xiaoyang
    Yuan, Genji
    Hua, Zhen
    Li, Jinjiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 3696 - 3712
  • [24] Center-Boundary Dual Attention for Oriented Object Detection in Remote Sensing Images
    Liu, Shuai
    Zhang, Lu
    Lu, Huchuan
    He, You
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [25] Object Detection in Remote Sensing Imagery Based on Prototype Learning Network With Proposal Relation
    Ni, Kang
    Ma, Tengfei
    Zheng, Zhizhong
    Wang, Peng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [26] Multi-Oriented Rotation-Equivariant Network for Object Detection on Remote Sensing Images
    Zhu, Kun
    Zhang, Xiaodong
    Chen, Guanzhou
    Li, Xianwei
    Cai, Peihua
    Liao, Puyun
    Wang, Tong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [27] SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing Segmentation
    Yang, Yunsong
    Yuan, Genji
    Li, Jinjiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [28] Spatial-Temporal Based Multihead Self-Attention for Remote Sensing Image Change Detection
    Zhou, Yong
    Wang, Fengkai
    Zhao, Jiaqi
    Yao, Rui
    Chen, Silin
    Ma, Heping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6615 - 6626
  • [29] Learning Orientation Information From Frequency-Domain for Oriented Object Detection in Remote Sensing Images
    Zheng, Shangdong
    Wu, Zebin
    Xu, Yang
    Wei, Zhihui
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [30] A fast self-attention cascaded network for object detection in large scene remote sensing images
    Hua, Xia
    Wang, Xinqing
    Rui, Ting
    Zhang, Haitao
    Wang, Dong
    APPLIED SOFT COMPUTING, 2020, 94 (94)