RSMamba: Remote Sensing Image Classification With State Space Model

被引:120
作者
Chen, Keyan [1 ,2 ,3 ]
Chen, Bowen [1 ,2 ,3 ]
Liu, Chenyang [1 ,2 ,3 ]
Li, Wenyuan [4 ]
Zou, Zhengxia [3 ,5 ]
Shi, Zhenwei [1 ,2 ,3 ]
机构
[1] Beihang Univ, Image Proc Ctr, Sch Astronaut, Beijing 100191, Peoples R China
[2] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 100191, Peoples R China
[4] Univ Hong Kong, Dept Geog, Hong Kong, Peoples R China
[5] Beihang Univ, Sch Astronaut, Dept Guidance Nav & Control, Beijing 100191, Peoples R China
关键词
Backbone network; foundation model; image classification; Mamba; remote sensing images;
D O I
10.1109/LGRS.2024.3407111
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Remote sensing image classification forms the foundation of various understanding tasks, serving a crucial function in remote sensing image interpretation. The recent advancements of convolutional neural networks (CNNs) and transformers have markedly enhanced classification accuracy. Nonetheless, remote sensing scene classification remains a significant challenge, especially given the complexity and diversity of remote sensing scenarios and the variability of spatiotemporal resolutions. The capacity for whole-image understanding can provide more precise semantic cues for scene discrimination. In this letter, we introduce RSMamba, a novel architecture for remote sensing image classification. RSMamba is based on the state space model (SSM) and incorporates an efficient, hardware-aware design known as the Mamba. It integrates the advantages of both a global receptive field and linear modeling complexity. To overcome the limitation of the vanilla Mamba, which can only model causal sequences and is not adaptable to 2-D image data, we propose a dynamic multipath activation mechanism to augment Mamba's capacity to model noncausal data. Notably, RSMamba maintains the inherent modeling mechanism of the vanilla Mamba, yet exhibits superior performance across multiple remote sensing image classification datasets, e.g., F1 scores of 95.25, 92.63, and 95.18 on the UC Merced, AID, and RESISC45 classification datasets, respectively, exceeding those of concurrent Vim and VMamba. This indicates that RSMamba holds significant potential to function as the backbone of future visual foundation models. The code is available at https://github.com/KyanChen/RSMamba.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 19 条
[1]   Contrastive Learning for Fine-Grained Ship Classification in Remote Sensing Images [J].
Chen, Jianqi ;
Chen, Keyan ;
Chen, Hao ;
Li, Wenyuan ;
Zou, Zhengxia ;
Shi, Zhenwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[2]  
Chen KY, 2023, IEEE T GEOSCI REMOTE, V61, DOI [10.1109/TGRS.2023.3283435, 10.1109/TGRS.2023.3272473]
[3]   Resolution-Agnostic Remote Sensing Scene Classification With Implicit Neural Representations [J].
Chen, Keyan ;
Li, Wenyuan ;
Chen, Jianqi ;
Zou, Zhengxia ;
Shi, Zhenwei .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[4]   Building Extraction from Remote Sensing Images with Sparse Token Transformers [J].
Chen, Keyan ;
Zou, Zhengxia ;
Shi, Zhenwei .
REMOTE SENSING, 2021, 13 (21)
[5]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[6]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[7]  
Gu A, 2024, Arxiv, DOI [arXiv:2312.00752, 10.48550/arXiv.2312.00752]
[8]  
Gu AL, 2022, Arxiv, DOI [arXiv:2111.00396, DOI 10.48550/ARXIV.2111.00396, 10.48550/arXiv.2111.00396]
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Deep learning for remote sensing image classification: A survey [J].
Li, Ying ;
Zhang, Haokui ;
Xue, Xizhe ;
Jiang, Yenan ;
Shen, Qiang .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (06)