A Local-Global Framework for Semantic Segmentation of Multisource Remote Sensing Images

被引：6

作者：

Qiu, Luyi ^{[1
]}

Yu, Dayu ^{[2
]}

Zhang, Chenxiao ^{[2
]}

Zhang, Xiaofeng ^{[3
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Peoples R China

[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430070, Peoples R China

[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

来源：

REMOTE SENSING | 2023年 / 15卷 / 01期

基金：

中国博士后科学基金;

关键词：

semantic segmentation; deep learning; multisource image; global-local feature fusion; contrastive learning; CLASSIFICATION;

D O I：

10.3390/rs15010231

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Recently, deep learning has been widely used in the segmentation tasks of remote sensing images. However, the existing deep learning method most focus on local contextual information and has limited field of perception, which makes it difficult to capture the long-range contextual feature of objects at large scales form very-high-resolution (VHR) images. In this paper, we present a novel Local-global Framework consisting of the dual-source fusion network and local-global transformer modules, which efficiently utilize features extracted from multiple sources and fully capture features of local and global regions. The dual-source fusion network is an encoder designed to extract features from multiple sources such as spectra, synthetic aperture radar, and elevations, which selective fuse features from multiple sources and reduce the interference of redundant features. The local-global transformer module is proposed to capture fine-grained local features and coarse-grained global features, which enables the framework to focus on recognizing multiple-scale objects from the local and global regions. Moreover, we propose a pixelwise contrastive loss, which could encourage that the prediction is pulled closer to the ground truth. The Local-global Framework achieves state-of-the-art performance with 90.45% mean f1 score on the ISPRS Vaihingen dataset and 93.20% mean f1 score on the ISPRS Potsdam dataset.

引用

页数：22

共 43 条

[41] Context Encoding for Semantic Segmentation
Zhang, Hang
Dana, Kristin
Shi, Jianping
Zhang, Zhongyue
Wang, Xiaogang
Tyagi, Ambrish
Agrawal, Amit
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7151 - 7160
[42] Multi-Stage Fusion and Multi-Source Attention Network for Multi-Modal Remote Sensing Image Segmentation
Zhao, Jiaqi
Zhou, Yong
Shi, Boyu
Yang, Jingsong
Zhang, Di
Yao, Rui
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (06)
[43] Class-Guided Feature Decoupling Network for Airborne Image Segmentation
Zhou, Feng
Hang, Renlong
Liu, Qingshan
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (03): : 2245 - 2255

← 1 2 3 4 5 →