CSTSUNet: A Cross Swin Transformer-Based Siamese U-Shape Network for Change Detection in Remote Sensing Images

被引:15
作者
Wu, Yaping [1 ]
Li, Lu [1 ]
Wang, Nan [2 ]
Li, Wei [2 ]
Fan, Junfang [1 ]
Tao, Ran [2 ]
Wen, Xin [1 ]
Wang, Yanfeng [3 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100192, Peoples R China
[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Feature extraction; Transformers; Remote sensing; Task analysis; Decoding; Convolutional neural networks; Computer architecture; Change detection (CD); deep learning; remote sensing (RS) image; transformer;
D O I
10.1109/TGRS.2023.3326813
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Change detection (CD) in remote sensing (RS) images is a critical task that has achieved significant success by deep learning. Current networks often employ pixel-based differencing, proportion, classification-based, or feature concatenation methods to represent changes of interest. However, these methods fail to effectively detect the desired changes, as they are highly sensitive to factors such as atmospheric conditions, lighting variations, and phenological variations, resulting in detection errors. Inspired by the transformer structure, we adopt a cross-attention mechanism to more robustly extract feature differences between bitemporal images. The motivation of the method is based on the assumption that if there is no change between image pairs, the semantic features from one temporal image can well be represented by the semantic features from another temporal image. Conversely if there is a change, there are significant reconstruction errors. Therefore, a Cross Swin transformer-based Siamese U-shaped network namely CSTSUNet is proposed for RS CD. CSTSUnet consists of encoder, difference feature extraction, and decoder. The encoder is based on a hierarchical residual network (ResNet) with the Siamese U-net structure, allowing parallel processing of bitemporal images and extraction of multiscale features. The difference feature extraction consists of four difference feature extraction modules that compute difference feature at multiple scales. In this module, Cross Swin transformer is employed in each difference feature extraction module to communicate the information of bitemporal images. The decoder takes in the multiscale difference features as input, injects details and boundaries iteratively level by level, and makes the change map more and more accurate. We conduct experiments on three public datasets, and the experimental results demonstrate that the proposed CSTSUNet outperforms other state-of-the-art methods in terms of both qualitative and quantitative analyses. Our code is available at https://github.com/l7170/CSTSUNet.git.
引用
收藏
页数:15
相关论文
共 49 条
[1]   A TRANSFORMER-BASED SIAMESE NETWORK FOR CHANGE DETECTION [J].
Bandara, Wele Gedara Chaminda ;
Patel, Vishal M. .
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, :207-210
[2]   Remote Sensing Image Change Detection With Transformers [J].
Chen, Hao ;
Qi, Zipeng ;
Shi, Zhenwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[3]   A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection [J].
Chen, Hao ;
Shi, Zhenwei .
REMOTE SENSING, 2020, 12 (10)
[4]   Deep Siamese Multi-scale Convolutional Network for Change Detection in Multi-temporal VHR Images [J].
Chen, Hongruixuan ;
Wu, Chen ;
Du, Bo ;
Zhang, Liangpei .
2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,
[5]  
Chen LC, 2018, Arxiv, DOI arXiv:1802.02611
[6]   A Siamese Network Based U-Net for Change Detection in High Resolution Remote Sensing Images [J].
Chen, Tao ;
Lu, Zhiyuan ;
Yang, Yue ;
Zhang, Yuxiang ;
Du, Bo ;
Plaza, Antonio .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :2357-2369
[7]  
Daudt RC, 2018, IEEE IMAGE PROC, P4063, DOI 10.1109/ICIP.2018.8451652
[8]   Change Detection of Deforestation in the Brazilian Amazon Using Landsat Data and Convolutional Neural Networks [J].
de Bem, Pablo Pozzobon ;
de Carvalho Junior, Osmar Abilio ;
Guimaraes, Renato Fontes ;
Trancoso Gomes, Roberto Arnaldo .
REMOTE SENSING, 2020, 12 (06)
[9]   MIMO Radar Super-Resolution Imaging Based on Reconstruction of the Measurement Matrix of Compressed Sensing [J].
Ding, Jieru ;
Wang, Min ;
Kang, Hailong ;
Wang, Zhiyi .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[10]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929