Remote Sensing Object Detection Based on Convolution and Swin Transformer

被引：14

作者：

Jiang, Xuzhao ^{[1
]}

Wu, Yonghong ^{[1
]}

机构：

[1] Wuhan Univ Technol, Dept Stat, Wuhan 430070, Peoples R China

来源：

IEEE ACCESS | 2023年 / 11卷

基金：

中国国家自然科学基金;

关键词：

Object detection; Feature extraction; Transformers; Remote sensing; Prediction algorithms; Detection algorithms; Classification algorithms; Remote sensing images; object detection; attention mechanism; swin transformer; multi-scale features;

D O I：

10.1109/ACCESS.2023.3267435

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Remote sensing object detection is an essential task for surveying the earth. It is challenging for the target detection algorithm in natural scenes to obtain satisfactory detection results in remote sensing images. In this paper, the RAST-YOLO (You only look once with Regin Attention and Swin Transformer) algorithm is proposed to address the problems of remote sensing object detection, such as significant differences in target scales, complex backgrounds, and tightly arranged small-size targets. To increase the information interaction range of the feature map, make full use of the background information of the object, and improve the detection accuracy of the object with a complex background, the Regin Attention (RA) mechanism combined with Swin Transformer as the backbone is proposed to extract features. To improve the detection accuracy of small objects, the C3D module is used to fuse deep and shallow semantic information and optimize the multi-scale problem of remote sensing targets. To evaluate the performance of RAST-YOLO, extensive experiments are performed on DIOR and TGRS-HRRSD datasets. The experimental results show that RAST achieves state-of-the-art detection accuracy with high efficiency and robustness. Specifically, compared with the baseline network, the mean average precision (mAP) of detection results is improved by 5% and 2.3% on DIOR and TGRS-HRRSD datasets, respectively, which demonstrates RAST-YOLO is effective and superior. Moreover, the lightweight structure of RAST-YOLO can ensure the real-time detection speed and obtain excellent detection results.

引用

页码：38643 / 38656

页数：14

共 50 条

[41] TransMIN: Transformer-Guided Multi-Interaction Network for Remote Sensing Object Detection
Xu, Guangming
Song, Tiecheng
Sun, Xia
Gao, Chenqiang
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[42] PCViT: A Pyramid Convolutional Vision Transformer Detector for Object Detection in Remote-Sensing Imagery
Li, Jiaojiao
Tian, Penghao
Song, Rui
Xu, Haitao
Li, Yunsong
Du, Qian
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
[43] ST-YOLOX: a lightweight and accurate object detection network based on Swin Transformer
Jingjing Han
Guangqi Yang
Hongyang Wei
Weijun Gong
Yurong Qian
The Journal of Supercomputing, 2024, 80 : 8038 - 8059
[44] A Tiny Object Detection Method Based on Explicit Semantic Guidance for Remote Sensing Images
Liu, Dongyang
Zhang, Junping
Qi, Yunxiao
Wu, Yinhu
Zhang, Ye
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
[45] Generating Anchor Boxes Based on Attention Mechanism for Object Detection in Remote Sensing Images
Tian, Zhuangzhuang
Zhan, Ronghui
Hu, Jiemin
Wang, Wei
He, Zhiqiang
Zhuang, Zhaowen
REMOTE SENSING, 2020, 12 (15)
[46] Arbitrary-Oriented Object Detection in Remote Sensing Images Based on Polar Coordinates
Zhou, Lin
Wei, Haoran
Li, Hao
Zhao, Wenzhe
Zhang, Yi
Zhang, Yue
IEEE ACCESS, 2020, 8 (08): : 223373 - 223384
[47] CSTSUNet: A Cross Swin Transformer-Based Siamese U-Shape Network for Change Detection in Remote Sensing Images
Wu, Yaping
Li, Lu
Wang, Nan
Li, Wei
Fan, Junfang
Tao, Ran
Wen, Xin
Wang, Yanfeng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[48] YOLOX-CA: A Remote Sensing Object Detection Model Based on Contextual Feature Enhancement and Attention Mechanism
Wu, Chao
Zeng, Zhiyong
IEEE ACCESS, 2024, 12 : 84632 - 84642
[49] Two-Stream Swin Transformer with Differentiable Sobel Operator for Remote Sensing Image Classification
Hao, Siyuan
Wu, Bin
Zhao, Kun
Ye, Yuanxin
Wang, Wei
REMOTE SENSING, 2022, 14 (06)
[50] Guiding Clean Features for Object Detection in Remote Sensing Images
Cheng, Gong
He, Min
Hong, Hailong
Yao, Xiwen
Qian, Xiaoliang
Guo, Lei
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19

← 1 2 3 4 5 →