SAAN: Similarity-Aware Attention Flow Network for Change Detection With VHR Remote Sensing Images

被引:17
作者
Guo, Haonan [1 ]
Su, Xin [2 ]
Wu, Chen [1 ]
Du, Bo [3 ,4 ]
Zhang, Liangpei [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[3] Wuhan Univ, Inst Artificial Intelligence, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430079, Peoples R China
[4] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan 430079, Peoples R China
关键词
Remote sensing image; change detection; similarity measurement; attention mechanism; MAD;
D O I
10.1109/TIP.2024.3349868
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Change detection (CD) is a fundamental and important task for monitoring the land surface dynamics in the earth observation field. Existing deep learning-based CD methods typically extract bi-temporal image features using a weight-sharing Siamese encoder network and identify change regions using a decoder network. These CD methods, however, still perform far from satisfactorily as we observe that 1) deep encoder layers focus on irrelevant background regions; and 2) the models' confidence in the change regions is inconsistent at different decoder stages. The first problem is because deep encoder layers cannot effectively learn from imbalanced change categories using the sole output supervision, while the second problem is attributed to the lack of explicit semantic consistency preservation. To address these issues, we design a novel similarity-aware attention flow network (SAAN). SAAN incorporates a similarity-guided attention flow module with deeply supervised similarity optimization to achieve effective change detection. Specifically, we counter the first issue by explicitly guiding deep encoder layers to discover semantic relations from bi-temporal input images using deeply supervised similarity optimization. The extracted features are optimized to be semantically similar in the unchanged regions and dissimilar in the changing regions. The second drawback can be alleviated by the proposed similarity-guided attention flow module, which incorporates similarity-guided attention modules and attention flow mechanisms to guide the model to focus on discriminative channels and regions. We evaluated the effectiveness and generalization ability of the proposed method by conducting experiments on a wide range of CD tasks. The experimental results demonstrate that our method achieves excellent performance on several CD tasks, with discriminative features and semantic consistency preserved.
引用
收藏
页码:2599 / 2613
页数:15
相关论文
共 61 条
[21]   Deep building footprint update network: A semi-supervised method for updating existing building footprint from bi-temporal remote sensing images [J].
Guo, Haonan ;
Shi, Qian ;
Marinoni, Andrea ;
Du, Bo ;
Zhang, Liangpei .
REMOTE SENSING OF ENVIRONMENT, 2021, 264
[22]   Scene-Driven Multitask Parallel Attention Network for Building Extraction in High-Resolution Remote Sensing Images [J].
Guo, Haonan ;
Shi, Qian ;
Du, Bo ;
Zhang, Liangpei ;
Wang, Dongzhi ;
Ding, Huaxiang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05) :4287-4306
[23]   Deep Multiscale Siamese Network With Parallel Convolutional Structure and Self-Attention for Change Detection [J].
Guo, Qingle ;
Zhang, Junping ;
Zhu, Shengyu ;
Zhong, Chongxiao ;
Zhang, Ye .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[24]  
Gupta Ritwik, 2019, arXiv
[25]  
Hadsell R., 2006, P IEEE COMP SOC C CO, V2, P1735
[26]  
He KM, 2015, Arxiv, DOI arXiv:1512.03385
[27]  
Hu J, 2019, Arxiv, DOI [arXiv:1709.01507, 10.48550/ARXIV.1709.01507]
[28]  
Kim M, 2022, Arxiv, DOI arXiv:2204.00964
[29]  
Kingma Diederik P, 2014, ARXIV PREPRINT ARXIV
[30]  
Lebedev M. A., 2018, Int. Arch. Photogramm. Remote. Sens. Spat. Inf. Sci., V42, P565, DOI [DOI 10.5194/ISPRS-ARCHIVES-XLII-2-565-2018, DOI 10.5194/ISPRS-ARCHIVES-XLII-2-565, 10.5194/isprs-archives-XLII-2]