MTSCD-Net: A network based on multi-task learning for semantic change detection of bitemporal remote sensing images

被引:35
作者
Cui, Fengzhi [1 ,2 ]
Jiang, Jie [1 ,2 ]
机构
[1] Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Key Lab Precis Optomech Technol, Minist Educ, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Multi -task learning; Siamese network; Deep learning; Semantic change detection;
D O I
10.1016/j.jag.2023.103294
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In recent years, change detection has been one of the hot research topics within the field of remote sensing. Previous studies have concentrated on binary change detection (BCD), but it doesn't meet the current needs. Therefore, semantic change detection (SCD) is also gradually developing, which focuses on determining the specific changed type while obtaining changed areas. In the paper, we propose a multi-task learning method (MTSCD-Net) for SCD task. The SCD task is decoupled into two related subtasks, semantic segmentation (SS) and BCD, then unifies them under the same framework. Multi-scale features are extracted using the Siamese semantic-aware encoder based on Swin Transformer, and the aggregation module is designed to combine features. Then, the change information extraction module is designed to enhance the capacity to express features by fully integrating the two-level difference features that are generated from fused features. Moreover, in the decoder stage, the spatial attention weight map is obtained using the features of the BCD subtask, which provides location prior information for the features of the SS subtask. It helps fully explore the correlation between the two subtasks. The two loss functions of subtasks are weighted to train MTSCD-Net. The comparative experiments results on two typical SCD datasets confirm the advantage of MTSCD-Net for SCD task. For the SeK index, MTSCD-Net achieves 3.96% and 20.57% on HRSCD and SECOND datasets, respectively. This outperforms other comparative methods such as Bi-SRNet (which achieves 4.86% and 1.47% higher on two datasets, respectively). The same is true for the Score metric. Moreover, the ablation experiment results confirm the effectiveness of key modules.
引用
收藏
页数:12
相关论文
共 32 条
  • [1] Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks
    Audebert, Nicolas
    Le Saux, Bertrand
    Lefevre, Sebastien
    [J]. COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 : 180 - 196
  • [2] A Bayesian information theoretic model of learning to learn via multiple task sampling
    Baxter, J
    [J]. MACHINE LEARNING, 1997, 28 (01) : 7 - 39
  • [3] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [4] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [5] Multitask learning for large-scale semantic change detection
    Daudt, Rodrigo Caye
    Le Saux, Bertrand
    Boulch, Alexandre
    Gousseau, Yann
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 187
  • [6] Daudt RC, 2018, IEEE IMAGE PROC, P4063, DOI 10.1109/ICIP.2018.8451652
  • [7] Bi-Temporal Semantic Reasoning for the Semantic Change Detection in HR Remote Sensing Images
    Ding, Lei
    Guo, Haitao
    Liu, Sicong
    Mou, Lichao
    Zhang, Jing
    Bruzzone, Lorenzo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
  • [9] Hanxiang Hao, 2021, 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, P4396, DOI 10.1109/IGARSS47720.2021.9554054
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778