TransUNetCD: A Hybrid Transformer Network for Change Detection in Optical Remote-Sensing Images

被引:242
作者
Li, Qingyang [1 ]
Zhong, Ruofei [1 ]
Du, Xin [1 ]
Du, Yu [1 ]
机构
[1] Capital Normal Univ, Coll Resource Environm & Tourism, Beijing 10048, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Remote sensing; Semantics; Data mining; Image segmentation; Sensors; Change detection (CD); optical remote-sensing image; transformer; UNet; FUSION NETWORK; DATASET;
D O I
10.1109/TGRS.2022.3169479
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the change detection (CD) task, the UNet architecture has achieved superior results. However, due to the inherent limitation of convolution operations, UNet is inadequate in learning global context and long-range spatial relations. Transformers can capture long-range feature dependencies, but the lack of low-level details may result in limited localization capabilities. Therefore, this article proposes an end-to-end encoding-decoding hybrid transformer model for CD, TransUNetCD, which has the advantages of both transformers and UNet. The model encodes the tokenized image patches from the convolutional neural network (CNN) feature map to extract rich global context information. The decoder upsamples the encoded features, connects them with higher-resolution multiscale features through skip connections to learn local-global semantic features, and restores the full spatial resolution of the feature map to achieve precise localization. The model proposed in this article not only solves the problem that redundant information is generated when extracting low-level features under the UNet framework, but also solves the problem that the relationship between each feature layer cannot be fully modeled and the optimal feature difference representation cannot be obtained. On this basis, we introduce a difference enhancement module to generate a difference feature map containing rich change information. By weighting each pixel and selectively aggregating features, the effectiveness of the network and the accuracy of extracting changing features are improved. The results on multiple datasets demonstrate that, compared to state-of-the-art methods, the TransUNetCD can further reduce false alarms and missed alarms, and the edge of the changing area is more accurate. The model has the highest score in each metric than other baseline models and has a robust generalization ability.
引用
收藏
页数:19
相关论文
共 56 条
[1]   Hyperspectral Imaging: A Review on UAV-Based Sensors, Data Processing and Applications for Agriculture and Forestry [J].
Adao, Telmo ;
Hruska, Jonas ;
Padua, Luis ;
Bessa, Jose ;
Peres, Emanuel ;
Morais, Raul ;
Sousa, Joaquim Joao .
REMOTE SENSING, 2017, 9 (11)
[2]   An Image Change Detection Algorithm Based on Multi-Feature Self-Attention Fusion Mechanism UNet Network [J].
Alimjan, Gulnaz ;
Jiaermuhamaiti, Yiliyaer ;
Jumahong, Huxidan ;
Zhu, Shuangling ;
Nurmamat, Pazilat .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (14)
[3]   A split-based approach to unsupervised change detection in large-size multitemporal images: Application to tsunami-damage assessment [J].
Bovolo, Francesca ;
Bruzzone, Lorenzo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (06) :1658-1670
[4]   Unsupervised Change Detection in Satellite Images Using Principal Component Analysis and k-Means Clustering [J].
Celik, Turgay .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2009, 6 (04) :772-776
[5]   Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].
Chattopadhay, Aditya ;
Sarkar, Anirban ;
Howlader, Prantik ;
Balasubramanian, Vineeth N. .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847
[6]  
Chen H., ARXIV210808157, V2021
[7]   Remote Sensing Image Change Detection With Transformers [J].
Chen, Hao ;
Qi, Zipeng ;
Shi, Zhenwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[8]   A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection [J].
Chen, Hao ;
Shi, Zhenwei .
REMOTE SENSING, 2020, 12 (10)
[9]   DASNet: Dual Attentive Fully Convolutional Siamese Networks for Change Detection in High-Resolution Satellite Images [J].
Chen, Jie ;
Yuan, Ziyang ;
Peng, Jian ;
Chen, Li ;
Huang, Haozhe ;
Zhu, Jiawei ;
Liu, Yu ;
Li, Haifeng .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :1194-1206
[10]  
Daudt RC, 2018, IEEE IMAGE PROC, P4063, DOI 10.1109/ICIP.2018.8451652