MDFA-Net: Multi-Scale Differential Feature Self-Attention Network for Building Change Detection in Remote Sensing Images

被引:0
作者
Li, Yuanling [1 ,2 ]
Zou, Shengyuan [1 ,2 ]
Zhao, Tianzhong [1 ,2 ]
Su, Xiaohui [1 ,2 ]
机构
[1] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 100083, Peoples R China
[2] Engn Res Ctr Forestry Oriented Intelligent Informa, Beijing 100083, Peoples R China
关键词
change detection; multi-scale feature extraction; self-attention mechanism; DEEP;
D O I
10.3390/rs16183466
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Building change detection (BCD) from remote sensing images is an essential field for urban studies. In this well-developed field, Convolutional Neural Networks (CNNs) and Transformer have been leveraged to empower BCD models in handling multi-scale information. However, it is still challenging to accurately detect subtle changes using current models, which has been the main bottleneck to improving detection accuracy. In this paper, a multi-scale differential feature self-attention network (MDFA-Net) is proposed to effectively integrate CNN and Transformer by balancing the global receptive field from the self-attention mechanism and the local receptive field from convolutions. In MDFA-Net, two innovative modules were designed. Particularly, a hierarchical multi-scale dilated convolution (HMDConv) module was proposed to extract local features with hybrid dilation convolutions, which can ameliorate the effect of CNN's local bias. In addition, a differential feature self-attention (DFA) module was developed to implement the self-attention mechanism at multi-scale difference feature maps to overcome the problem that local details may be lost in the global receptive field in Transformer. The proposed MDFA-Net achieves state-of-the-art accuracy performance in comparison with related works, e.g., USSFC-Net, in three open datasets: WHU-CD, CDD-CD, and LEVIR-CD. Based on the experimental results, MDFA-Net significantly exceeds other models in F1 score, IoU, and overall accuracy; the F1 score is 93.81%, 95.52%, and 91.21% in WHU-CD, CDD-CD, and LEVIR-CD datasets, respectively. Furthermore, MDFA-Net achieved first or second place in precision and recall in the test in all three datasets, which indicates its better balance in precision and recall than other models. We also found that subtle changes, i.e., small-sized building changes and irregular boundary changes, are better detected thanks to the introduction of HMDConv and DFA. To this end, with its better ability to leverage multi-scale differential information than traditional methods, MDFA-Net provides a novel and effective avenue to integrate CNN and Transformer in BCD. Further studies could focus on improving the model's insensitivity to hyper-parameters and the model's generalizability in practical applications.
引用
收藏
页数:23
相关论文
共 52 条
  • [11] NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
    Ghiasi, Golnaz
    Lin, Tsung-Yi
    Le, Quoc V.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7029 - 7038
  • [12] A Spectral and Spatial Attention Network for Change Detection in Hyperspectral Images
    Gong, Maoguo
    Jiang, Fenlong
    Qin, A. K.
    Liu, Tongfei
    Zhan, Tao
    Lu, Di
    Zheng, Hanhong
    Zhang, Mingyang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [13] Change Detection in Synthetic Aperture Radar Images Based on Deep Neural Networks
    Gong, Maoguo
    Zhao, Jiaojiao
    Liu, Jia
    Miao, Qiguang
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (01) : 125 - 138
  • [14] Guo Y., 2023, ISPRS J. Photogramm. Remote Sens, V194, P85
  • [15] Spatial-Temporal Semantic Perception Network for Remote Sensing Image Semantic Change Detection
    He, You
    Zhang, Hanchao
    Ning, Xiaogang
    Zhang, Ruiqian
    Chang, Dong
    Hao, Minghui
    [J]. REMOTE SENSING, 2023, 15 (16)
  • [16] An Object-based Change Detection Approach by Integrating Intensity and Texture Differences
    Huang, Lingcao
    Zhang, Guo
    Li, Yang
    [J]. 2010 2ND INTERNATIONAL ASIA CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (CAR 2010), VOL 3, 2010, : 258 - 261
  • [17] Multispectral change detection using multivariate Kullback-Leibler distance
    Jabari, Shabnam
    Rezaee, Mohammad
    Fathollahi, Fatemeh
    Zhang, Yun
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 147 : 163 - 177
  • [18] Fully Convolutional Networks for Multisource Building Extraction From an Open Aerial and Satellite Imagery Data Set
    Ji, Shunping
    Wei, Shiqing
    Lu, Meng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (01): : 574 - 586
  • [19] PGA-SiamNet: Pyramid Feature-Based Attention-Guided Siamese Network for Remote Sensing Orthoimagery Building Change Detection
    Jiang, Huiwei
    Hu, Xiangyun
    Li, Kun
    Zhang, Jinming
    Gong, Jinqi
    Zhang, Mi
    [J]. REMOTE SENSING, 2020, 12 (03)
  • [20] Lebedev M. A, 2018, INT ARCH PHOTOGRAMM, VXLII-2, P565, DOI [DOI 10.5194/ISPRS-ARCHIVES-XLII-2-565-2018, 10.]