Multi-Resolution Learning and Semantic Edge Enhancement for Super-Resolution Semantic Segmentation of Urban Scene Images

被引：4

作者：

Shu, Ruijun ^{[1
,2
]}

Zhao, Shengjie ^{[1
,3
]}

机构：

[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China

[2] Chinese Acad Sci, Shanghai Inst Microsyst & Informat Technol, Shanghai 200050, Peoples R China

[3] Tongji Univ, Sch Software Engn, Shanghai 201804, Peoples R China

来源：

SENSORS | 2024年 / 24卷 / 14期

关键词：

image semantic segmentation; super-resolution semantic segmentation; multi-resolution learning; semantic edge enhancement;

D O I：

10.3390/s24144522

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Super-resolution semantic segmentation (SRSS) is a technique that aims to obtain high-resolution semantic segmentation results based on resolution-reduced input images. SRSS can significantly reduce computational cost and enable efficient, high-resolution semantic segmentation on mobile devices with limited resources. Some of the existing methods require modifications of the original semantic segmentation network structure or add additional and complicated processing modules, which limits the flexibility of actual deployment. Furthermore, the lack of detailed information in the low-resolution input image renders existing methods susceptible to misdetection at the semantic edges. To address the above problems, we propose a simple but effective framework called multi-resolution learning and semantic edge enhancement-based super-resolution semantic segmentation (MS-SRSS) which can be applied to any existing encoder-decoder based semantic segmentation network. Specifically, a multi-resolution learning mechanism (MRL) is proposed that enables the feature encoder of the semantic segmentation network to improve its feature extraction ability. Furthermore, we introduce a semantic edge enhancement loss (SEE) to alleviate the false detection at the semantic edges. We conduct extensive experiments on the three challenging benchmarks, Cityscapes, Pascal Context, and Pascal VOC 2012, to verify the effectiveness of our proposed MS-SRSS method. The experimental results show that, compared with the existing methods, our method can obtain the new state-of-the-art semantic segmentation performance.

引用

页数：14

共 35 条

[11] BiAttnNet: Bilateral Attention for Improving Real-Time Semantic Segmentation [J].

Li, Genling ;

Li, Liang ;

Zhang, Jiawan .

IEEE SIGNAL PROCESSING LETTERS, 2022, 29 :46-50

[12] Residual spatial fusion network for RGB-thermal semantic segmentation [J].

Li, Ping ;

Chen, Junjie ;

Lin, Binbin ;

Xu, Xianghua .

NEUROCOMPUTING, 2024, 595

[13] Bridging knowledge distillation gap for few-sample unsupervised semantic segmentation [J].

Li, Ping ;

Chen, Junjie ;

Tang, Chen .

INFORMATION SCIENCES, 2024, 673

[14]

Lin Y., 2023, P INT C IM VIS COMP

[15] TRANSFORMER AND CNN HYBRID NETWORK FOR SUPER-RESOLUTION SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGERY [J].

Liu, Yutong ;

Gao, Kun ;

Wang, Hong ;

Wang, Junwei ;

Zhang, Xiaodian ;

Wang, Pengyu ;

Li, Shuzhong .

IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, :6940-6943

[16] A simple approach for quantizing neural networks [J].

Maly, Johannes ;

Saab, Rayan .

APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2023, 66 :138-150

[17] ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network [J].

Mehta, Sachin ;

Rastegari, Mohammad ;

Shapiro, Linda ;

Hajishirzi, Hannaneh .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9182-9192

[18] The Role of Context for Object Detection and Semantic Segmentation in the Wild [J].

Mottaghi, Roozbeh ;

Chen, Xianjie ;

Liu, Xiaobai ;

Cho, Nam-Gyu ;

Lee, Seong-Whan ;

Fidler, Sanja ;

Urtasun, Raquel ;

Yuille, Alan .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :891-898

[19] Learning Deconvolution Network for Semantic Segmentation [J].

Noh, Hyeonwoo ;

Hong, Seunghoon ;

Han, Bohyung .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1520-1528

[20] BASNet: Boundary-Aware Salient Object Detection [J].

Qin, Xuebin ;

Zhang, Zichen ;

Huang, Chenyang ;

Gao, Chao ;

Dehghan, Masood ;

Jagersand, Martin .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7471-7481

← 1 2 3 4 →