Dual-Path Feature Aware Network for Remote Sensing Image Semantic Segmentation

被引:7
作者
Geng, Jie [1 ]
Song, Shuai [1 ]
Jiang, Wen [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; feature fusion; transformer; remote sensing image; attention mechanism; ENCODER-DECODER;
D O I
10.1109/TCSVT.2023.3317937
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation is a significant task for remote sensing interpretation, which takes advantage of contextual semantic information to classify each pixel into a specific category. Most current methods apply convolutional neural networks (CNN) to learn feature representation from remote sensing images, which may ignore the global dependencies due to the limitation of convolutional kernels. Inspired by the global feature learning ability of Transformer, we propose a novel deep model called dual-path feature aware network (DPFANet), which combines the structure of CNN and Transformer for semantic segmentation of remote sensing images. DPFANet aims to learn effective modeling ability from local to global features of images. Simultaneously, an adaptive feature fusion network is developed to fuse features from dual-path networks. Moreover, an edge optimization block is applied to constrain the edge features, whose purpose is to obtain more representative features for segmentation. Experimental results on three public remote sensing datasets verify that our proposed network yields better segmentation performance compared to other related methods.
引用
收藏
页码:3674 / 3686
页数:13
相关论文
共 67 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   MsanlfNet: Semantic Segmentation Network With Multiscale Attention and Nonlocal Filters for High-Resolution Remote Sensing Images [J].
Bai, Lin ;
Lin, Xiangyuan ;
Ye, Zhen ;
Xue, Dongling ;
Yao, Cheng ;
Hui, Meng .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[3]   All-Higher-Stages-In Adaptive Context Aggregation for Semantic Edge Detection [J].
Bo, Qihan ;
Ma, Wei ;
Lai, Yu-Kun ;
Zha, Hongbin .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) :6778-6791
[4]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[5]   A Multi-Phase Camera-LiDAR Fusion Network for 3D Semantic Segmentation With Weak Supervision [J].
Chang, Xuepeng ;
Pan, Huihui ;
Sun, Weichao ;
Gao, Huijun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) :3737-3746
[6]  
Chen J., 2021, arXiv, DOI [DOI 10.48550/ARXIV.2102.04306, 10.48550/arXiv.2102.04306]
[7]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
[8]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[9]   Multi-Scale Semantic Segmentation and Spatial Relationship Recognition of Remote Sensing Images Based on an Attention Model [J].
Cui, Wei ;
Wang, Fei ;
He, Xin ;
Zhang, Dongyou ;
Xu, Xuxiang ;
Yao, Meng ;
Wang, Ziwei ;
Huang, Jiejun .
REMOTE SENSING, 2019, 11 (09)
[10]   Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images [J].
Ding, Lei ;
Lin, Dong ;
Lin, Shaofu ;
Zhang, Jing ;
Cui, Xiaojie ;
Wang, Yuebin ;
Tang, Hao ;
Bruzzone, Lorenzo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60