GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images

被引：2

作者：

Cao, Yong ^{[1
,2
]}

Huo, Chunlei ^{[1
,2
]}

Xiang, Shiming ^{[1
,2
]}

Pan, Chunhong ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Key Lab Multimodal Artificial Intelligence Sy, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China

来源：

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING | 2024年 / 17卷

关键词：

Cross feature fusion (CFF); global context learning; group transformer; semantic segmentation;

D O I：

10.1109/JSTARS.2024.3359656

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Semantic segmentation plays a pivotal role in interpreting high-resolution remote sensing images (RSIs), where contextual information is essential for achieving accurate segmentation. Despite the common practice of partitioning large RSIs into smaller patches for deep model input, existing methods often rely on adaptations from natural image semantic segmentation techniques, limiting their contextual scope to individual images. To address this limitation and harness a broader range of contextual information from original large-scale RSIs, this study introduces a global feature fusion network (GFFNet). GFFNet employs a novel approach by incorporating a group transformer structure alternated with group convolution, forming a lightweight global context learning branch. This design facilitates the extraction of global contextual features from the large-scale RSIs. In addition, we propose a cross feature fusion module that seamlessly integrates local features obtained from the convolutional network with the global contextual features. GFFNet serves as a versatile plugin for existing RSI semantic segmentation models, particularly beneficial when the target dataset involves cropping. This integration enhances the model's performance, especially in terms of segmenting large-scale objects. Experimental results on the ISPRS and GID-15 datasets validate the effectiveness of GFFNet in improving segmentation capabilities for large-scale objects in RSIs.

引用

页码：4222 / 4234

页数：13

共 37 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] Chen J., 2021, arXiv
[3] RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation Based on Visual Foundation Model
Chen, Keyan
Liu, Chenyang
Chen, Hao
Zhang, Haotian
Li, Wenyuan
Zou, Zhengxia
Shi, Zhenwei
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17
[4] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[6] Chen X., 2020, IEEE Transactions on Geoscience and Remote Sensing, V59, P3532
[7] Context Aggregation Network for Semantic Labeling in Aerial Images
Cheng, Wensheng
Yang, Wen
Wang, Min
Wang, Gang
Chen, Jinyong
[J]. REMOTE SENSING, 2019, 11 (10)
[8] LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
Ding, Lei
Tang, Hao
Bruzzone, Lorenzo
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 426 - 435
[9] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[10] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149

← 1 2 3 4 →