DSHNet: A Semantic Segmentation Model of Remote Sensing Images Based on Dual Stream Hybrid Network

被引:12
作者
Fu, Yujia [1 ]
Zhang, Xiangrong [2 ]
Wang, Mingyang [1 ]
机构
[1] Northeast Forestry Univ, Coll Comp & Control Engn, Harbin 150040, Peoples R China
[2] Heilongjiang Inst Technol, Coll Econ & Business Adm, Harbin 150050, Peoples R China
关键词
Semantics; Feature extraction; Streaming media; Remote sensing; Semantic segmentation; Transformers; Data mining; Boundary detection; cross-fusion; dual-stream remote sensing images; semantic segmentation; CLASSIFICATION;
D O I
10.1109/JSTARS.2024.3355943
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation is an important issue in intelligent interpretation of remote sensing, playing an important role in applications such as Earth observation and land data update. However, remote sensing images often contain complex ground objects and the boundaries between them are blurred, which poses a huge challenge to the semantic segmentation task of remote sensing images. This article proposes a dual stream hybrid network (DSHNet) model, which focuses on parallel extraction of semantic and boundary features in remote sensing images, and improves the performance of semantic segmentation by fully integrating dual stream information. In the semantic stream, the ViT model pretrained on remote sensing images is used as the backbone network for feature extraction. In the boundary stream, the boundary detection operator Sobel is used to capture the boundaries of different ground objects in the image, and a boundary enhancement mechanism is taken to optimize and enhance the feature representation of ground object boundaries. In addition, DSHNet designs a feature fusion module to cross-aggregate features from both semantic and boundary streams. Compared with the state-to-art semantic segmentation methods, DSHNet model has achieved the best performance on two datasets of Yellow River Estuary Wetland and Gaofen image dataset.
引用
收藏
页码:4164 / 4175
页数:12
相关论文
共 49 条
[1]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[2]  
Chen J., 2021, arXiv
[3]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]  
Dosovitskiy A., 2021, P 9 INT C LEARN REPR
[6]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149
[7]   Super-Resolution Reconstruction of Remote Sensing Images Using Generative Adversarial Network With Shallow Information Enhancement [J].
Fu, Yujia ;
Zhang, Xiangrong ;
Wang, Mingyang .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :8529-8540
[8]   UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation [J].
Gao, Yunhe ;
Zhou, Mu ;
Metaxas, Dimitris N. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 :61-71
[9]   Optical remotely sensed time series data for land cover classification: A review [J].
Gomez, Cristina ;
White, Joanne C. ;
Wulder, Michael A. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 116 :55-72
[10]  
Guo B., 2022, IEEE Trans. Geosci. Remote Sens., V60, P1