DSHNet: A Semantic Segmentation Model of Remote Sensing Images Based on Dual Stream Hybrid Network

被引:8
作者
Fu, Yujia [1 ]
Zhang, Xiangrong [2 ]
Wang, Mingyang [1 ]
机构
[1] Northeast Forestry Univ, Coll Comp & Control Engn, Harbin 150040, Peoples R China
[2] Heilongjiang Inst Technol, Coll Econ & Business Adm, Harbin 150050, Peoples R China
关键词
Semantics; Feature extraction; Streaming media; Remote sensing; Semantic segmentation; Transformers; Data mining; Boundary detection; cross-fusion; dual-stream remote sensing images; semantic segmentation; CLASSIFICATION;
D O I
10.1109/JSTARS.2024.3355943
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation is an important issue in intelligent interpretation of remote sensing, playing an important role in applications such as Earth observation and land data update. However, remote sensing images often contain complex ground objects and the boundaries between them are blurred, which poses a huge challenge to the semantic segmentation task of remote sensing images. This article proposes a dual stream hybrid network (DSHNet) model, which focuses on parallel extraction of semantic and boundary features in remote sensing images, and improves the performance of semantic segmentation by fully integrating dual stream information. In the semantic stream, the ViT model pretrained on remote sensing images is used as the backbone network for feature extraction. In the boundary stream, the boundary detection operator Sobel is used to capture the boundaries of different ground objects in the image, and a boundary enhancement mechanism is taken to optimize and enhance the feature representation of ground object boundaries. In addition, DSHNet designs a feature fusion module to cross-aggregate features from both semantic and boundary streams. Compared with the state-to-art semantic segmentation methods, DSHNet model has achieved the best performance on two datasets of Yellow River Estuary Wetland and Gaofen image dataset.
引用
收藏
页码:4164 / 4175
页数:12
相关论文
共 49 条
  • [1] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
  • [2] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [4] Dosovitskiy A., 2021, P INT C LEARN REPR, P1
  • [5] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [6] Super-Resolution Reconstruction of Remote Sensing Images Using Generative Adversarial Network With Shallow Information Enhancement
    Fu, Yujia
    Zhang, Xiangrong
    Wang, Mingyang
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8529 - 8540
  • [7] UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
    Gao, Yunhe
    Zhou, Mu
    Metaxas, Dimitris N.
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 61 - 71
  • [8] Optical remotely sensed time series data for land cover classification: A review
    Gomez, Cristina
    White, Joanne C.
    Wulder, Michael A.
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 116 : 55 - 72
  • [9] Guo B., 2022, IEEE Trans. Geosci. Remote Sens., V60, P1
  • [10] Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Di
    Yao, Rui
    Xue, Yong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60