Dual-Domain Fusion Network Based on Wavelet Frequency Decomposition and Fuzzy Spatial Constraint for Remote Sensing Image Segmentation

被引:3
作者
Wei, Guangyi [1 ]
Xu, Jindong [1 ]
Yan, Weiqing [1 ]
Chong, Qianpeng [2 ]
Xing, Haihua [3 ]
Ni, Mengying [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
[2] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China
[3] Hainan Normal Univ, Sch Informat Sci & Technol, Haikou 571158, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing; semantic segmentation; wavelet transform; type-2; fuzzy; SEMANTIC SEGMENTATION; CHALLENGES; CLASSIFICATION; FRAMEWORK;
D O I
10.3390/rs16193594
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Semantic segmentation is crucial for a wide range of downstream applications in remote sensing, aiming to classify pixels in remote sensing images (RSIs) at the semantic level. The dramatic variations in grayscale and the stacking of categories within RSIs lead to unstable inter-class variance and exacerbate the uncertainty around category boundaries. However, existing methods typically emphasize spatial information while overlooking frequency insights, making it difficult to achieve desirable results. To address these challenges, we propose a novel dual-domain fusion network that integrates both spatial and frequency features. For grayscale variations, a multi-level wavelet frequency decomposition module (MWFD) is introduced to extract and integrate multi-level frequency features to enhance the distinctiveness between spatially similar categories. To mitigate the uncertainty of boundaries, a type-2 fuzzy spatial constraint module (T2FSC) is proposed to achieve flexible higher-order fuzzy modeling to adaptively constrain the boundary features in the spatial by constructing upper and lower membership functions. Furthermore, a dual-domain feature fusion (DFF) module bridges the semantic gap between the frequency and spatial features, effectively realizes semantic alignment and feature fusion between the dual domains, which further improves the accuracy of segmentation results. We conduct comprehensive experiments and extensive ablation studies on three well-known datasets: Vaihingen, Potsdam, and GID. In these three datasets, our method achieved 74.56%, 73.60%, and 81.01% mIoU, respectively. Quantitative and qualitative results demonstrate that the proposed method significantly outperforms state-of-the-art methods, achieving an excellent balance between segmentation accuracy and computational overhead.
引用
收藏
页数:24
相关论文
共 70 条
[31]   Wavelet Transform Feature Enhancement for Semantic Segmentation of Remote Sensing Images [J].
Li, Yifan ;
Liu, Ziqian ;
Yang, Junli ;
Zhang, Haopeng .
REMOTE SENSING, 2023, 15 (24)
[32]   ConvFormer: Plug-and-Play CNN-Style Transformers for Improving Medical Image Segmentation [J].
Lin, Xian ;
Yan, Zengqiang ;
Deng, Xianbo ;
Zheng, Chuansheng ;
Yu, Li .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 :642-651
[33]   Adaptive Fourier Convolution Network for Road Segmentation in Remote Sensing Images [J].
Liu, Huajun ;
Wang, Cailing ;
Zhao, Jinding ;
Chen, Suting ;
Kong, Hui .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-14
[34]   Stair Fusion Network With Context-Refined Attention for Remote Sensing Image Semantic Segmentation [J].
Liu, Jia ;
Hua, Wenyi ;
Zhang, Wenhua ;
Liu, Fang ;
Xiao, Liang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-17
[35]   Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].
Liu, Ze ;
Lin, Yutong ;
Cao, Yue ;
Hu, Han ;
Wei, Yixuan ;
Zhang, Zheng ;
Lin, Stephen ;
Guo, Baining .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002
[36]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[37]   Land Cover Change Detection With Heterogeneous Remote Sensing Images: Review, Progress, and Perspective [J].
Lv, ZhiYong ;
Huang, HaiTao ;
Li, Xinghua ;
Zhao, MingHua ;
Benediktsson, Jon Atli ;
Sun, WeiWei ;
Falco, Nicola .
PROCEEDINGS OF THE IEEE, 2022, 110 (12) :1976-1991
[38]   End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform [J].
Ma, Haichuan ;
Liu, Dong ;
Yan, Ning ;
Li, Houqiang ;
Wu, Feng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) :1247-1263
[39]   A Multilevel Multimodal Fusion Transformer for Remote Sensing Semantic Segmentation [J].
Ma, Xianping ;
Zhang, Xiaokang ;
Pun, Man-On ;
Liu, Ming .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-15
[40]   A comprehensive review on type 2 fuzzy logic applications: Past, present and future [J].
Mittal, Kanika ;
Jain, Amita ;
Vaisla, Kunwar Singh ;
Castillo, Oscar ;
Kacprzyk, Janusz .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 95