RS-Mamba for Large Remote Sensing Image Dense Prediction

被引:61
作者
Zhao, Sijie [1 ]
Chen, Hao [2 ]
Zhang, Xueliang [1 ]
Xiao, Pengfeng [1 ]
Bai, Lei [2 ]
Ouyang, Wanli [2 ]
机构
[1] Nanjing Univ, Jiangsu Prov Key Lab Geog Informat Sci & Technol, Key Lab Land Satellite Remote Sensing Applicat, Sch Geog & Ocean Sci,Minist Nat Resources, Nanjing 210023, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200000, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Remote sensing; Task analysis; Context modeling; Transformers; Feature extraction; Predictive models; Complexity theory; Change detection (CD); deep learning; dense prediction; large remote sensing images; semantic segmentation (SS); state space model (SSM); very high resolution (VHR); NETWORK;
D O I
10.1109/TGRS.2024.3425540
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Context modeling is critical for remote sensing image dense prediction tasks. Nowadays, the growing size of very-high-resolution (VHR) remote sensing images poses challenges in effectively modeling context. While transformer-based models possess global modeling capabilities, they encounter computational challenges when applied to large VHR images due to their quadratic complexity. The conventional practice of cropping large images into smaller patches results in a notable loss of contextual information. To address these issues, we propose the remote sensing Mamba (RSM) for dense prediction tasks in large VHR remote sensing images. RSM is specifically designed to capture the global context of remote sensing images with linear complexity, facilitating the effective processing of large VHR images. Considering that the land covers in remote sensing images are distributed in arbitrary spatial directions due to characteristics of remote sensing over-head imaging, the RSM incorporates an omnidirectional selective scan module (OSSM) to globally model the context of images in multiple directions, capturing large spatial features from various directions. We designed simple yet effective models based on RSM, achieving state-of-the-art performance on dense prediction tasks in VHR remote sensing images without fancy training strategies. Extensive experiments on semantic segmentation (SS) and change detection (CD) tasks across various land covers demonstrate the effectiveness of the proposed RSM. Leveraging the linear complexity and global modeling capabilities, RSM achieves better efficiency and accuracy than transformer-based models on large remote sensing images. Interestingly, we also demonstrated that our model generally performs better with a larger image size on dense prediction tasks.
引用
收藏
页数:14
相关论文
共 66 条
[21]   Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery [J].
Han, Chengxi ;
Wu, Chen ;
Guo, Haonan ;
Hu, Meiqi ;
Li, Jiepan ;
Chen, Hongruixuan .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 :8395-8407
[22]   HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images [J].
Han, Chengxi ;
Wu, Chen ;
Guo, Haonan ;
Hu, Meiqi ;
Chen, Hongruixuan .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 :3867-3878
[23]  
Hasani R., 2022, arXiv
[24]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[25]  
IEEE J., 2022, IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens., V15, P4297
[26]   Fully Convolutional Networks for Multisource Building Extraction From an Open Aerial and Satellite Imagery Data Set [J].
Ji, Shunping ;
Wei, Shiqing ;
Lu, Meng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (01) :574-586
[27]   RoadFormer: Pyramidal deformable vision transformers for road network extraction with remote sensing images [J].
Jiang, Xiaoling ;
Li, Yinyin ;
Jiang, Tao ;
Xie, Junhao ;
Wu, Yilong ;
Cai, Qianfeng ;
Jiang, Jinhui ;
Xu, Jiaming ;
Zhang, Hui .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 113
[28]  
Kalman R.E., 1960, Journal of Basic Engineering, V82, P35, DOI DOI 10.1115/1.3662552
[29]  
Kingsbury D, 2015, P1, DOI [DOI 10.1021/bk-2015-1214.ch001, DOI 10.48550/ARXIV.1412.6980]
[30]   Hybrid-Scale Self-Similarity Exploitation for Remote Sensing Image Super-Resolution [J].
Lei, Sen ;
Shi, Zhenwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60