A Local-Global Framework for Semantic Segmentation of Multisource Remote Sensing Images

被引：6

作者：

Qiu, Luyi ^{[1
]}

Yu, Dayu ^{[2
]}

Zhang, Chenxiao ^{[2
]}

Zhang, Xiaofeng ^{[3
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Peoples R China

[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430070, Peoples R China

[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

来源：

REMOTE SENSING | 2023年 / 15卷 / 01期

基金：

中国博士后科学基金;

关键词：

semantic segmentation; deep learning; multisource image; global-local feature fusion; contrastive learning; CLASSIFICATION;

D O I：

10.3390/rs15010231

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Recently, deep learning has been widely used in the segmentation tasks of remote sensing images. However, the existing deep learning method most focus on local contextual information and has limited field of perception, which makes it difficult to capture the long-range contextual feature of objects at large scales form very-high-resolution (VHR) images. In this paper, we present a novel Local-global Framework consisting of the dual-source fusion network and local-global transformer modules, which efficiently utilize features extracted from multiple sources and fully capture features of local and global regions. The dual-source fusion network is an encoder designed to extract features from multiple sources such as spectra, synthetic aperture radar, and elevations, which selective fuse features from multiple sources and reduce the interference of redundant features. The local-global transformer module is proposed to capture fine-grained local features and coarse-grained global features, which enables the framework to focus on recognizing multiple-scale objects from the local and global regions. Moreover, we propose a pixelwise contrastive loss, which could encourage that the prediction is pulled closer to the ground truth. The Local-global Framework achieves state-of-the-art performance with 90.45% mean f1 score on the ISPRS Vaihingen dataset and 93.20% mean f1 score on the ISPRS Potsdam dataset.

引用

页数：22

共 43 条

[1] Context-driven fusion of high spatial and spectral resolution images based on oversampled multiresolution analysis
Aiazzi, B
Alparone, L
Baronti, S
Garzelli, A
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2002, 40 (10): : 2300 - 2312
[2] Ao Luo, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12357), P346, DOI 10.1007/978-3-030-58610-2_21
[3] A comprehensive survey on support vector machine classification: Applications, challenges and trends
Cervantes, Jair
Garcia-Lamont, Farid
Rodriguez-Mazahua, Lisbeth
Lopez, Asdrubal
[J]. NEUROCOMPUTING, 2020, 408 : 189 - 215
[4] Chen J., 2021, TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation, P1, DOI DOI 10.1038/s41566-021-00828-5
[5] Superpixel-Based Attention Graph Neural Network for Semantic Segmentation in Aerial Images
Diao, Qi
Dai, Yaping
Zhang, Ce
Wu, Yan
Feng, Xiaoxue
Pan, Feng
[J]. REMOTE SENSING, 2022, 14 (02)
[6] LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
Ding, Lei
Tang, Hao
Bruzzone, Lorenzo
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 426 - 435
[7] Dosovitskiy A., 2020, ICLR, V20, DOI 10.48550/arXiv.2010.11929
[8] Grill J.-B., 2020, P ADV NEUR INF PROC, V33, P21271, DOI DOI 10.48550/ARXIV.2006.07733
[9] Learning longitudinal classification-regression model for infant hippocampus segmentation
Guo Y.
Wu Z.
Shen D.
[J]. Neurocomputing, 2022, 391 : 191 - 198
[10] Multi-Source Domain Adaptation with Collaborative Learning for Semantic Segmentation
He, Jianzhong
Jia, Xu
Chen, Shuaijun
Liu, Jianzhuang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11003 - 11012

← 1 2 3 4 5 →