Frequency-aware robust multidimensional information fusion framework for remote sensing image segmentation

被引:8
作者
Fan, Junyu [1 ]
Li, Jinjiang [2 ]
Liu, Yepeng [2 ]
Zhang, Fan [2 ]
机构
[1] Shandong Technol & Business Univ, Inst Network Technol ICT, Sch Informat & Elect Engn, Yantai 264005, Peoples R China
[2] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Semantic segmentation; Frequency information; Transformer; SEMANTIC SEGMENTATION; NETWORK;
D O I
10.1016/j.engappai.2023.107638
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Urban scene image segmentation is an important research area in high-resolution remote sensing image processing. However, due to its complex three-dimensional structure, interference factors such as occlusion, shadow, intra-class inconsistency, and inter-class indistinction affect segmentation performance. Many methods have combined local and global information using CNNs and Transformers to achieve high performance in remote sensing image segmentation tasks. However, these methods are not stable when dealing with these interference factors. Recent studies have found that semantic segmentation is highly sensitive to frequency information, so we introduced frequency information to make the model learn more comprehensively about different categories of targets from multiple dimensions. By modeling the target with local features, global information, and frequency information, the target features can be learned in multiple dimensions to reduce the impact of interference factors on the model and improve its robustness. In this paper, we consider frequency information in addition to combining CNNs and Transformers for modeling and propose a Multidimensional Information Fusion Network (MIFNet) for high-resolution remote sensing image segmentation of urban scenes. Specifically, we design an information fusion Transformer module that can adaptively associate local features, global semantic information, and frequency information and a relevant semantic aggregation module for aggregating features at different scales to construct the decoder. By aggregating image features at different depths, the specific representation of the target and the correlation between targets can be modeled in multiple dimensions, allowing the network to better recognize and understand the features of each class of targets to resist various interference factors that affect segmentation performance. We conducted extensive ablation experiments and comparative experiments on the ISPRS Vaihingen and ISPRS Potsdam benchmarks to verify our proposed method. In a large number of experiments, our method achieved the best results, with 84.53% and 87.3% mIoU scores on the Vaihingen and Potsdam datasets, respectively, proving the superiority of our method. The source code will be available at https://github.com/JunyuFan/MIFNet.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Implicit Ray Transformers for Multiview Remote Sensing Image Segmentation
    Qi, Zipeng
    Chen, Hao
    Liu, Chenyang
    Shi, Zhenwei
    Zou, Zhengxia
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [22] Dual-Domain Fusion Network Based on Wavelet Frequency Decomposition and Fuzzy Spatial Constraint for Remote Sensing Image Segmentation
    Wei, Guangyi
    Xu, Jindong
    Yan, Weiqing
    Chong, Qianpeng
    Xing, Haihua
    Ni, Mengying
    REMOTE SENSING, 2024, 16 (19)
  • [23] Efficient Transformer for Remote Sensing Image Segmentation
    Xu, Zhiyong
    Zhang, Weicun
    Zhang, Tianxiang
    Yang, Zhifang
    Li, Jiangyun
    REMOTE SENSING, 2021, 13 (18)
  • [24] Image Segmentation in a Quaternion Framework for Remote Sensing Applications
    Voronin, V.
    Semenishchev, E.
    Zelensky, A.
    Tokareva, O.
    Agaian, S.
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2020, 2020, 11399
  • [25] Multi-scale Wavelet Frequency Channel Attention for Remote Sensing Image Segmentation
    Su, Yu-Chen
    Liu, Tsung-Jung
    Liuy, Kuan-Hsien
    2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
  • [26] FDGSNet: A Multimodal Gated Segmentation Network for Remote Sensing Image Based on Frequency Decomposition
    Cui, Jian
    Liu, Jiahang
    Ni, Yue
    Wang, Jinjin
    Li, Manchun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 19756 - 19770
  • [27] Feature Fusion through Multitask CNN for Large-scale Remote Sensing Image Segmentation
    Sun, Shihao
    Yang, Lei
    Liu, Wenjie
    Li, Ruirui
    2018 10TH IAPR WORKSHOP ON PATTERN RECOGNITION IN REMOTE SENSING (PRRS), 2018,
  • [28] LMFNet: Lightweight Multimodal Fusion Network for high-resolution remote sensing image segmentation
    Wang, Tong
    Chen, Guanzhou
    Zhang, Xiaodong
    Liu, Chenxi
    Wang, Jiaqi
    Tan, Xiaoliang
    Zhou, Wenlin
    He, Chanjuan
    PATTERN RECOGNITION, 2025, 164
  • [29] Remote Sensing Image Semantic Segmentation Based on Edge Information Guidance
    He, Chu
    Li, Shenglin
    Xiong, Dehui
    Fang, Peizhang
    Liao, Mingsheng
    REMOTE SENSING, 2020, 12 (09)
  • [30] Remote Sensing Image Segmentation Network Based on Multi-Level Feature Refinement and Fusion
    Jian Yongsheng
    Zhu Daming
    Fu Zhitao
    Wen Shiya
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (04)