Edge Detection Guide Network for Semantic Segmentation of Remote-Sensing Images

被引:62
作者
Jin, Jianhui [1 ]
Zhou, Wujie [1 ]
Yang, Rongwang [2 ]
Ye, Lv [1 ]
Yu, Lu [3 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] Zhejiang Univ, Childrens Hosp, Sch Med, Hangzhou 310030, Peoples R China
[3] Zhejiang Univ, Inst Informat & Commun Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Semantic segmentation; Remote sensing; Image edge detection; Convolution; Optical sensors; Edge detection; multi-level; multimodal; semantic segmentation;
D O I
10.1109/LGRS.2023.3234257
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The acquisition of high-resolution satellite and airborne remote sensing images has been significantly simplified due to the rapid development of sensor technology. Several practical applications of high-resolution remote sensing images (HRRSIs) are based on semantic segmentation. However, single-modal HRRSIs are difficult to classify accurately in the complex situation of some scene objects; therefore, the semantic segmentation of multi-source information fusion is gaining popularity. The inherent difference between multimodal features and the semantic gap between multi-level features typically affect the performance of existing multi-mode fusion methods. We propose a multimodal fusion network based on edge detection to address these issues. This method aids multimodal information fusion by utilizing spatial information contained in the boundary. An edge detection guide module is included in the feature extraction stage to realize the boundary information through the fusion of details and semantics between high-level and low-level features. The boundary information is extended into the well-designed multimodal adaptive fusion block (MAFB) to obtain the multimodal fusion features. Furthermore, a residual adaptive fusion block (RAFB) and a spatial position module (SPM) in the feature decoding stage have been designed to fuse multi-level features from the standpoint of local and global dependence. We compared our method to several state-of-the-art (SOTA) methods using the International Society for Photogrammetry and Remote Sensing's (ISPRS) Vaihingen and Potsdam datasets. The final results demonstrate that our method achieves excellent performance.
引用
收藏
页数:5
相关论文
共 26 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[3]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[4]  
Hu XX, 2019, IEEE IMAGE PROC, P1440, DOI [10.1109/ICIP.2019.8803025, 10.1109/icip.2019.8803025]
[5]  
ISPRS, ABOUT US
[6]  
Jiang JD, 2018, Arxiv, DOI arXiv:1806.01054
[7]  
Li R., 2021, IEEE Trans. Geosci. Remote Sens., V60
[8]  
Long J., 2015, P IEEE C COMP VIS PA, P3431, DOI DOI 10.48550/ARXIV.1411.4038
[9]   Classification with an edge: Improving semantic with boundary detection [J].
Marmanis, D. ;
Schindler, K. ;
Wegner, J. D. ;
Galliani, S. ;
Datcu, M. ;
Stilla, U. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 135 :158-172
[10]   A Hierarchical Building Detection Method for Very High Resolution Remotely Sensed Images Combined with DSM Using Graph Cut Optimization [J].
Qin, Rongjun ;
Fang, Wei .
PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2014, 80 (09) :873-883