ChangeMask: Deep multi-task encoder-transformer-decoder architecture for semantic change detection

被引:132
作者
Zheng, Zhuo [1 ]
Zhong, Yanfei [1 ,2 ]
Tian, Shiqi [1 ]
Ma, Ailong [1 ]
Zhang, Liangpei [1 ,2 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Prov Engn Res Ctr Nat Resources Remote Sens, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Multi-task learning; Temporal symmetry; Change detection; Deep learning; Remote sensing; Multi-temporal; Semantic segmentation; LAND-COVER; CLASSIFICATION;
D O I
10.1016/j.isprsjprs.2021.10.015
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Multi-temporal high spatial resolution earth observation makes it possible to detect complex urban land surface changes, which is a significant and challenging task in remote sensing communities. Previous works mainly focus on binary change detection (BCD) based on modern technologies, e.g., deep fully convolutional network (FCN), whereas the deep network architecture for semantic change detection (SCD) is insufficiently explored in current literature. In this paper, we propose a deep multi-task encoder-transformer-decoder architecture (ChangeMask) designed by exploring two important inductive biases: sematic-change causal relationship and temporal symmetry. ChangeMask decouples the SCD into a temporal-wise semantic segmentation and a BCD, and then integrates these two tasks into a general encoder-transformer-decoder framework. In the encoder part, we design a semantic-aware encoder to model the semantic-change causal relationship. This encoder is only used to learn semantic representation and then learn change representation from semantic representation via a later transformer module. In this way, change representation can constrain semantic representation during training, which introduces a regularization to reduce the risk of overfitting. To learn a robust change representation from semantic representation, we propose a temporal-symmetric transformer (TST) to guarantee temporal symmetry for change representation and keep it discriminative. Based on the above semantic representation and change representation, we adopt simple multi-task decoders to output semantic change map. Benefiting from the differentiable building blocks, ChangeMask can be trained by a multi-task loss function, which significantly simplifies the whole pipeline of applying ChangeMask. The comprehensive experimental results on two largescale SCD datasets confirm the effectiveness and superiority of ChangeMask in SCD. Besides, to demonstrate the potential value in real-world applications, e.g., automatic urban analysis and decision-making, we deploy the ChangeMask to map a large geographic area covering 30 km2 with 300 million pixels. Code will be made available.
引用
收藏
页码:228 / 239
页数:12
相关论文
共 42 条
  • [1] Barret Zoph Q.V.L, 2017, INT C LEARN REPR
  • [2] A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection
    Chen, Hao
    Shi, Zhenwei
    [J]. REMOTE SENSING, 2020, 12 (10)
  • [3] Chen J., 2020, IEEE J-STARS
  • [4] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
  • [5] Multitask learning for large-scale semantic change detection
    Daudt, Rodrigo Caye
    Le Saux, Bertrand
    Boulch, Alexandre
    Gousseau, Yann
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 187
  • [6] Daudt RC, 2018, IEEE IMAGE PROC, P4063, DOI 10.1109/ICIP.2018.8451652
  • [7] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [8] Deep Metric Learning Using Triplet Network
    Hoffer, Elad
    Ailon, Nir
    [J]. SIMILARITY-BASED PATTERN RECOGNITION, SIMBAD 2015, 2015, 9370 : 84 - 92
  • [9] From W-Net to CDGAN: Bitemporal Change Detection via Deep Learning Techniques
    Hou, Bin
    Liu, Qingjie
    Wang, Heng
    Wang, Yunhong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (03): : 1790 - 1802
  • [10] Seasonal Change of Land-Use/Land-Cover (LULC) Detection Using MODIS Data in Rapid Urbanization Regions: A Case Study of the Pearl River Delta Region (China)
    Hu, Jinrong
    Zhang, Yuanzhi
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2013, 6 (04) : 1913 - 1920