MDSNet: a multiscale decoupled supervision network for semantic segmentation of remote sensing images

被引:3
作者
Feng, Jiangfan [1 ]
Chen, Panyu [1 ]
Gu, Zhujun [2 ,3 ]
Zeng, Maimai [2 ]
Zheng, Wei [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Chongqing, Peoples R China
[2] Pearl River Water Resources Commiss, Pearl River Water Resources Res Inst, Guangzhou, Peoples R China
[3] Room 1906,Tianshou Bldg,105 Tianshou Rd, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; remote sensing images; edge supervision; multiscale; y;
D O I
10.1080/17538947.2023.2241435
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Recent deep-learning successes have led to a new wave of semantic segmentation in remote sensing (RS) applications. However, most approaches rarely distinguish the role of the body and edge of RS ground objects; thus, our understanding of these semantic parts has been frustrated by the lack of detailed geometry and appearance. Here we present a multiscale decoupled supervision network for RS semantic segmentation. Our proposed framework extends a densely supervised encoder-decoder network with a feature decoupling module that can decouple semantic features with different scales into distinct body and edge components. We further conduct multiscale supervision of the original and decoupled body and edge features to enhance inner consistency and spatial boundaries in remote sensing image (RSI) ground objects, enabling new segmentation designs and semantic components that can learn to perform multiscale geometry and appearance. Our results outperform the previous algorithm and are robust to different datasets. These results demonstrate that decoupled supervision is an effective solution to semantic segmentation tasks of RS images.
引用
收藏
页码:2844 / 2861
页数:18
相关论文
共 32 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   Semantic Segmentation with Boundary Neural Fields [J].
Bertasius, Gedas ;
Shi, Jianbo ;
Torresani, Lorenzo .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3602-3610
[3]  
Chen J., 2021, arXiv
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]   Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images [J].
Ding, Lei ;
Lin, Dong ;
Lin, Shaofu ;
Zhang, Jing ;
Cui, Xiaojie ;
Wang, Yuebin ;
Tang, Hao ;
Bruzzone, Lorenzo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[6]  
Jaderberg M, 2016, Arxiv, DOI [arXiv:1506.02025, DOI 10.48550/ARXIV:1506.02025]
[7]  
Kirillov A, 2023, Arxiv, DOI [arXiv:2304.02643, 10.48550/arXiv.2304.02643]
[8]  
Lee CY, 2014, Arxiv, DOI arXiv:1409.5185
[9]   A Review of Remote Sensing for Environmental Monitoring in China [J].
Li, Jun ;
Pei, Yanqiu ;
Zhao, Shaohua ;
Xiao, Rulin ;
Sang, Xiao ;
Zhang, Chengye .
REMOTE SENSING, 2020, 12 (07)
[10]  
Li XT, 2020, Img Proc Comp Vis Re, V12362, P435, DOI 10.1007/978-3-030-58520-4_26