MTSCD-Net: A network based on multi-task learning for semantic change detection of bitemporal remote sensing images

被引:47
作者
Cui, Fengzhi [1 ,2 ]
Jiang, Jie [1 ,2 ]
机构
[1] Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Key Lab Precis Optomech Technol, Minist Educ, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Multi -task learning; Siamese network; Deep learning; Semantic change detection;
D O I
10.1016/j.jag.2023.103294
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In recent years, change detection has been one of the hot research topics within the field of remote sensing. Previous studies have concentrated on binary change detection (BCD), but it doesn't meet the current needs. Therefore, semantic change detection (SCD) is also gradually developing, which focuses on determining the specific changed type while obtaining changed areas. In the paper, we propose a multi-task learning method (MTSCD-Net) for SCD task. The SCD task is decoupled into two related subtasks, semantic segmentation (SS) and BCD, then unifies them under the same framework. Multi-scale features are extracted using the Siamese semantic-aware encoder based on Swin Transformer, and the aggregation module is designed to combine features. Then, the change information extraction module is designed to enhance the capacity to express features by fully integrating the two-level difference features that are generated from fused features. Moreover, in the decoder stage, the spatial attention weight map is obtained using the features of the BCD subtask, which provides location prior information for the features of the SS subtask. It helps fully explore the correlation between the two subtasks. The two loss functions of subtasks are weighted to train MTSCD-Net. The comparative experiments results on two typical SCD datasets confirm the advantage of MTSCD-Net for SCD task. For the SeK index, MTSCD-Net achieves 3.96% and 20.57% on HRSCD and SECOND datasets, respectively. This outperforms other comparative methods such as Bi-SRNet (which achieves 4.86% and 1.47% higher on two datasets, respectively). The same is true for the Score metric. Moreover, the ablation experiment results confirm the effectiveness of key modules.
引用
收藏
页数:12
相关论文
共 32 条
[1]   Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 :180-196
[2]   A Bayesian information theoretic model of learning to learn via multiple task sampling [J].
Baxter, J .
MACHINE LEARNING, 1997, 28 (01) :7-39
[3]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[4]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[5]   Multitask learning for large-scale semantic change detection [J].
Daudt, Rodrigo Caye ;
Le Saux, Bertrand ;
Boulch, Alexandre ;
Gousseau, Yann .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 187
[6]  
Daudt RC, 2018, IEEE IMAGE PROC, P4063, DOI 10.1109/ICIP.2018.8451652
[7]   Bi-Temporal Semantic Reasoning for the Semantic Change Detection in HR Remote Sensing Images [J].
Ding, Lei ;
Guo, Haitao ;
Liu, Sicong ;
Mou, Lichao ;
Zhang, Jing ;
Bruzzone, Lorenzo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[8]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9]   AN ATTENTION-BASED SYSTEM FOR DAMAGE ASSESSMENT USING SATELLITE IMAGERY [J].
Hao, Hanxiang ;
Baireddy, Sriram ;
Bartusiak, Emily R. ;
Konz, Latisha ;
LaTourette, Kevin ;
Gribbon, Michael ;
Chan, Moses ;
Comer, Mary ;
Delp, Edward J. .
2021 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM IGARSS, 2021, :4396-4399
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778