Multi-Scale Frequency-Spatial Domain Attention Fusion Network for Building Extraction in Remote Sensing Images

被引:1
作者
Liu, Jia [1 ]
Chen, Hao [1 ]
Li, Zuhe [1 ]
Gu, Hang [1 ]
机构
[1] Zhengzhou Univ Light Ind, Sch Comp Sci & Technol, Zhengzhou 450002, Peoples R China
关键词
remote sensing; building extraction; dual-domain learning; multi-scale fusion; CLASSIFICATION; INFORMATION;
D O I
10.3390/electronics13234642
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Building extraction from remote sensing images holds significant importance in the fields of land resource management, urban planning, and disaster assessment. Encoder-decoder deep learning models are increasingly favored due to their advanced feature representation capabilities in image analysis. However, because of the diversity of architectural styles and issues such as tree occlusion, traditional methods often result in building omissions and blurred boundaries when extracting building footprints. Given these limitations, this paper proposes a cutting-edge Multi-Scale Frequency-Spatial Domain Attention Fusion Network (MFSANet), which consists of two principal modules, named Frequency-Spatial Domain Attention Fusion Module (FSAFM) and Attention-Guided Multi-scale Fusion Upsampling Module (AGMUM). FSAFM introduces frequency domain attention and spatial attention separately to enhance the feature maps, thereby strengthening the model's boundary-detection capabilities and ultimately improving the accuracy of building extraction. AGMUM first resizes and concatenates attention enhancement maps to enhance contextual understanding and applies attention guidance to further improve prediction accuracy. Our model demonstrates superior performance compared to existing semantic segmentation methods on both the WHU building data set and the Inria aerial image data set.
引用
收藏
页数:17
相关论文
共 43 条
[1]  
Ali S, 2024, Arxiv, DOI arXiv:2404.03991
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   Urban Green Space Planning Based on Remote Sensing and Geographic Information Systems [J].
Bai, Hua ;
Li, Ziwei ;
Guo, Hanlong ;
Chen, Haopeng ;
Luo, Pingping .
REMOTE SENSING, 2022, 14 (17)
[4]  
Cao H., 2021, P ECCV WORKSH MONTR
[5]   Res2-Unet, a New Deep Architecture for Building Detection From High Spatial Resolution Images [J].
Chen, Fang ;
Wang, Ning ;
Yu, Bo ;
Wang, Lei .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :1494-1501
[6]   Fourier domain structural relationship analysis for unsupervised multimodal change detection [J].
Chen, Hongruixuan ;
Yokoya, Naoto ;
Chini, Marco .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 198 :99-114
[7]   DCTC: Fast and Accurate Contour-Based Instance Segmentation With DCT Encoding for High-Resolution Remote Sensing Images [J].
Chen, Zhong ;
Liu, Tianhang ;
Xu, Xueru ;
Leng, Junsong ;
Chen, Zhenxue .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 :8697-8709
[8]   The potential of nighttime light remote sensing data to evaluate the development of digital economy: A case study of China at the city level [J].
Chen, Zuoqi ;
Wei, Ye ;
Shi, Kaifang ;
Zhao, Zhiyuan ;
Wang, Congxiao ;
Wu, Bin ;
Qiu, Bingwen ;
Yu, Bailang .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2022, 92
[9]  
Dong B, 2023, AAAI CONF ARTIF INTE, P516
[10]   Semantic classification of urban buildings combining VHR image and GIS data: An improved random forest approach [J].
Du, Shihong ;
Zhang, Fangli ;
Zhang, Xiuyuan .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2015, 105 :107-119