Exploring New Backbone and Attention Module for Semantic Segmentation in Street Scenes

被引:20
作者
Fan, Lei [1 ]
Wang, Wei-Chien [2 ]
Zha, Fuyuan [1 ]
Yan, Jiapeng [1 ]
机构
[1] Hefei Univ Technol, Sch Elect Engn & Automat, Hefei 230009, Anhui, Peoples R China
[2] Queensland Univ Technol, Sci & Engn Fac, Brisbane, Qld 4001, Australia
关键词
Semantic segmentation; segmentation backbone; attention mechanism; street scenes; CNN;
D O I
10.1109/ACCESS.2018.2880877
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation, as dense pixel-wise classification task, played an important tache in scene understanding. There are two main challenges in many state-of-the-art works: 1) most backbone of segmentation models that often were extracted from pretrained classification models generated poor performance in small categories because they were lacking in spatial information and 2) the gap of combination between high-level and low-level features in segmentation models has led to inaccurate predictions. To handle these challenges, in this paper, we proposed a new tailored backbone and attention select module for segmentation tasks. Specifically, our new backbone was modified from the original Resnet, which can yield better segmentation performance. Attention select module employed spatial and channel self-attention mechanism to reinforce the propagation of contextual features, which can aggregate semantic and spatial information simultaneously. In addition, based on our new backbone and attention select module, we further proposed our segmentation model for street scenes understanding. We conducted a series of ablation studies on two public benchmarks, including Cityscapes and CamVid dataset to demonstrate the effectiveness of our proposals. Our model achieved a mIoU score of 71.5% on the Cityscapes test set with only fine annotation data and 60.1% on the CamVid test set.
引用
收藏
页码:71566 / 71580
页数:15
相关论文
共 66 条
[1]  
[Anonymous], P IEEE INT C COMP VI
[2]  
[Anonymous], 2014, PROC 2 INT C LEARN R
[3]  
[Anonymous], 2014, Computer Science
[4]  
[Anonymous], 2015, ARXIV PREPRINT ARXIV
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], P 24 INT C PATT REC
[7]  
[Anonymous], 2016, Coco-stuff: Thing and stuff classes in context
[8]  
[Anonymous], 2018, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2018.00262
[9]  
[Anonymous], 2015, IEEE T IMAGE PROCESS, DOI DOI 10.1109/TIP.2015.2487833
[10]  
[Anonymous], IEEE T PATTERN ANAL