MSHFormer: A Multiscale Hybrid Transformer Network With Boundary Enhancement for VHR Remote Sensing Image Building Extraction

被引:0
作者
Zhu, Panpan [1 ]
Song, Zhichao [1 ]
Liu, Jiale [1 ]
Yan, Jiazheng [1 ]
Luo, Xiaobo [1 ]
Tao, Yuxiang [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2025年 / 63卷
基金
中国国家自然科学基金;
关键词
Buildings; Feature extraction; Transformers; Data mining; Accuracy; Decoding; Semantic segmentation; Image edge detection; Remote sensing; Convolutional neural networks; Building footprint extraction; edge enhancement; multiscale; remote sensing (RS) images; transformer;
D O I
10.1109/TGRS.2025.3545919
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Accurate and complete extraction of buildings from very high-resolution (VHR) remote sensing (RS) images is highly important for urban planning and land management. However, owing to the limited information available for small buildings and building boundaries, as well as challenges such as the spectral similarity of ground objects, tree occlusions, and shadow interference, automatically extracting buildings from VHR images remains challenging. These issues may result in building extraction errors such as misclassification, small building omissions, blurred boundaries, and incorrect segmentation. To address these challenges, we propose a multiscale hybrid transformer (MSHFormer) with boundary enhancement. This approach incorporates a hybrid encoder that combines a multiscale local perception (MSLP) module and a global perception module (GPM), combining the strengths of convolutional neural networks (CNNs) and transformers to achieve efficient synergy between global modeling and local feature extraction. In addition, we developed an edge enhancement module (EHM) to enhance boundary information, significantly improving building boundary segmentation accuracy. Finally, we design a group alignment feature fusion module (GAFFM) to efficiently integrate low-level features from the encoder with high-level features from the decoder, reducing feature space misalignment. The experimental results on three public datasets demonstrate the effectiveness of MSHFormer. Specifically, the proposed method achieves intersection-over-union (IoU) values of 89.1%, 73.6%, and 89.5% on the Potsdam, Massachusetts, and WHU datasets, respectively.
引用
收藏
页数:16
相关论文
共 59 条
  • [1] Alexey D, 2020, arXiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
  • [2] [Anonymous], 2013, Machine learning for aerial image labeling
  • [3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [4] Chen J, 2021, arXiv, DOI DOI 10.48550/ARXIV.2102.04306
  • [5] Building Extraction from Remote Sensing Images with Sparse Token Transformers
    Chen, Keyan
    Zou, Zhengxia
    Shi, Zhenwei
    [J]. REMOTE SENSING, 2021, 13 (21)
  • [6] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [8] A benchmark GaoFen-7 dataset for building extraction from satellite images
    Chen, Peimin
    Huang, Huabing
    Ye, Feng
    Liu, Jinying
    Li, Weijia
    Wang, Jie
    Wang, Zixuan
    Liu, Chong
    Zhang, Ning
    [J]. SCIENTIFIC DATA, 2024, 11 (01)
  • [9] Chu XX, 2021, ADV NEUR IN
  • [10] Personalized neoantigen pulsed dendritic cell vaccine for advanced lung cancer
    Ding, Zhenyu
    Li, Qing
    Zhang, Rui
    Xie, Li
    Shu, Yang
    Gao, Song
    Wang, Peipei
    Su, Xiaoqing
    Qin, Yun
    Wang, Yuelan
    Fang, Juemin
    Zhu, Zhongzheng
    Xia, Xuyang
    Wei, Guochao
    Wang, Hui
    Qian, Hong
    Guo, Xianling
    Gao, Zhibo
    Wang, Yu
    Wei, Yuquan
    Xu, Qing
    Xu, Heng
    Yang, Li
    [J]. SIGNAL TRANSDUCTION AND TARGETED THERAPY, 2021, 6 (01)