A Local-Global Framework for Semantic Segmentation of Multisource Remote Sensing Images

被引:6
作者
Qiu, Luyi [1 ]
Yu, Dayu [2 ]
Zhang, Chenxiao [2 ]
Zhang, Xiaofeng [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430070, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
基金
中国博士后科学基金;
关键词
semantic segmentation; deep learning; multisource image; global-local feature fusion; contrastive learning; CLASSIFICATION;
D O I
10.3390/rs15010231
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Recently, deep learning has been widely used in the segmentation tasks of remote sensing images. However, the existing deep learning method most focus on local contextual information and has limited field of perception, which makes it difficult to capture the long-range contextual feature of objects at large scales form very-high-resolution (VHR) images. In this paper, we present a novel Local-global Framework consisting of the dual-source fusion network and local-global transformer modules, which efficiently utilize features extracted from multiple sources and fully capture features of local and global regions. The dual-source fusion network is an encoder designed to extract features from multiple sources such as spectra, synthetic aperture radar, and elevations, which selective fuse features from multiple sources and reduce the interference of redundant features. The local-global transformer module is proposed to capture fine-grained local features and coarse-grained global features, which enables the framework to focus on recognizing multiple-scale objects from the local and global regions. Moreover, we propose a pixelwise contrastive loss, which could encourage that the prediction is pulled closer to the ground truth. The Local-global Framework achieves state-of-the-art performance with 90.45% mean f1 score on the ISPRS Vaihingen dataset and 93.20% mean f1 score on the ISPRS Potsdam dataset.
引用
收藏
页数:22
相关论文
共 43 条
  • [41] Context Encoding for Semantic Segmentation
    Zhang, Hang
    Dana, Kristin
    Shi, Jianping
    Zhang, Zhongyue
    Wang, Xiaogang
    Tyagi, Ambrish
    Agrawal, Amit
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7151 - 7160
  • [42] Multi-Stage Fusion and Multi-Source Attention Network for Multi-Modal Remote Sensing Image Segmentation
    Zhao, Jiaqi
    Zhou, Yong
    Shi, Boyu
    Yang, Jingsong
    Zhang, Di
    Yao, Rui
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (06)
  • [43] Class-Guided Feature Decoupling Network for Airborne Image Segmentation
    Zhou, Feng
    Hang, Renlong
    Liu, Qingshan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (03): : 2245 - 2255