GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images

被引:2
作者
Cao, Yong [1 ,2 ]
Huo, Chunlei [1 ,2 ]
Xiang, Shiming [1 ,2 ]
Pan, Chunhong [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Key Lab Multimodal Artificial Intelligence Sy, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China
关键词
Cross feature fusion (CFF); global context learning; group transformer; semantic segmentation;
D O I
10.1109/JSTARS.2024.3359656
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation plays a pivotal role in interpreting high-resolution remote sensing images (RSIs), where contextual information is essential for achieving accurate segmentation. Despite the common practice of partitioning large RSIs into smaller patches for deep model input, existing methods often rely on adaptations from natural image semantic segmentation techniques, limiting their contextual scope to individual images. To address this limitation and harness a broader range of contextual information from original large-scale RSIs, this study introduces a global feature fusion network (GFFNet). GFFNet employs a novel approach by incorporating a group transformer structure alternated with group convolution, forming a lightweight global context learning branch. This design facilitates the extraction of global contextual features from the large-scale RSIs. In addition, we propose a cross feature fusion module that seamlessly integrates local features obtained from the convolutional network with the global contextual features. GFFNet serves as a versatile plugin for existing RSI semantic segmentation models, particularly beneficial when the target dataset involves cropping. This integration enhances the model's performance, especially in terms of segmenting large-scale objects. Experimental results on the ISPRS and GID-15 datasets validate the effectiveness of GFFNet in improving segmentation capabilities for large-scale objects in RSIs.
引用
收藏
页码:4222 / 4234
页数:13
相关论文
共 37 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Chen J., 2021, arXiv
  • [3] RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation Based on Visual Foundation Model
    Chen, Keyan
    Liu, Chenyang
    Chen, Hao
    Zhang, Haotian
    Li, Wenyuan
    Zou, Zhengxia
    Shi, Zhenwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17
  • [4] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [6] Chen X., 2020, IEEE Transactions on Geoscience and Remote Sensing, V59, P3532
  • [7] Context Aggregation Network for Semantic Labeling in Aerial Images
    Cheng, Wensheng
    Yang, Wen
    Wang, Min
    Wang, Gang
    Chen, Jinyong
    [J]. REMOTE SENSING, 2019, 11 (10)
  • [8] LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
    Ding, Lei
    Tang, Hao
    Bruzzone, Lorenzo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 426 - 435
  • [9] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [10] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149