Fully convolutional network with attention modules for semantic segmentation

被引:1
作者
Yunjia Huang
Haixia Xu
机构
[1] Xiangtan University,School of Automation and Electronic Information
来源
Signal, Image and Video Processing | 2021年 / 15卷
关键词
Semantic segmentation; Fully convolutional network; Attention module;
D O I
暂无
中图分类号
学科分类号
摘要
Fully convolutional network is a powerful end-to-end model for semantic segmentation. However, it performs prediction pixel by pixel to pose weak consistency on intra-category. This paper proposes fully convolutional network with attention modules for semantic segmentation. Based on the framework of fully convolutional network, the post-processing attention module and skip-layer attention module are introduced to enhance the relevancy among pixels. Post-processing attention module is to calculate the similarity among pixels to obtain global information. Skip-layer attention module is designed to combine semantic information from a deep, coarse layer with contour information from a shallow, fine layer to produce the feature with high resolution and strong semantic information. Loss function, obtained by cross-entropy between estimated probability and label, is to optimize the network. Extensive experiments demonstrate that the proposed approach is superior to DeepLab and other models in performance of mean IoU with moderate computational complexity
引用
收藏
页码:1031 / 1039
页数:8
相关论文
共 31 条
[1]  
Rother C(2004)GrabCut-interactive foreground extraction using iterated graph cuts ACM Trans. Graph. 23 309-314
[2]  
Kolmogorov V(2016)Segmentation and recognition of cancer cells based on mathematical morphology Electron. Sci. Technol. 29 36-38
[3]  
Blake A(2016)Semantic segmentation of images exploiting DCT based features and random forest Pattern Recogn. 52 260-273
[4]  
Jian-Feng Xia(2006)Reducing the dimensionality of data with neural networks Science 313 504-507
[5]  
RavD Bober M(2017)Fully convolutional networks for semantic segmentation IEEE Trans. Patten Anal. Mach. Intell. 39 640-651
[6]  
Farinella GM(2018)Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs IEEE Trans. Pattern Anal. Mach. Intell. 40 834-848
[7]  
Guarnera M(2017)Segent: deep convolutional encoder-decoder architecture for image segmentation IEEE Trans. Pattern Anal. Mach. Intell. 39 2481-2495
[8]  
Battiato S(2010)The pascal visual object classes (VOC) challenge Int. J. Comput. Vis. 88 303-338
[9]  
Hinton GE(2018)Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs IEEE Trans. Pattern Anal. Mach. Intell. 40 834-848
[10]  
Salakhutdinov RR(undefined)undefined undefined undefined undefined-undefined