Lightweight Semantic Segmentation Network Based on Attention Coding

被引:3
作者
Chen Xiaolong [1 ]
Zhao Ji [1 ,2 ]
Chen Siyi [1 ]
机构
[1] Xiangtan Univ, Sch Automat & Elect Informat, Xiangtan 411100, Hunan, Peoples R China
[2] Tsinghua Univ, Natl CIMS Engn Technol Res Ctr, Beijing 100084, Peoples R China
关键词
image processing; semantic segmentation; self-attention module; lightweight network; encoder-decoder structures;
D O I
10.3788/LOP202158.1410012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To address the issues of high computational complexity and large memory footprint of the attention map of the self-attention mechanism and to improve the performance of the semantic segmentation network, we propose a lightweight network based on attention coding. The network uses an adaptive positional attention module and global attention upsampling module to encode and decode long-range dependency information, respectively. When calculating the attention map, adaptive positional attention module excludes useless basis sets and context information is obtained. A global attention upsampling module uses global context information to guide low-level features to reconstruct high-resolution images. Experimental results show that the segmentation accuracy of the network on the PASCAL VOC2012 verification set reaches a value of 84. 99o. Compared with dual attention network, which has a similar segmentation accuracy, the giga floating-point operations per second and the GPU memory of the network are reduced by 16.9% and 12.9%, respectively.
引用
收藏
页数:9
相关论文
共 28 条
[1]  
Adam H., Rethinking Atrous Convolution for Semantic Image Segmentation
[2]  
[Anonymous], 2018, OCNET OBJECT CONTEXT
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Real-Time Semantic Segmentation Algorithm Based on Feature Fusion Technology [J].
Cai Yu ;
Huang Xuegong ;
Zhian, Zhang ;
Zhu Xinnian ;
Ma Xiang .
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
[5]   Mixed High-Order Attention Network for Person Re-Identification [J].
Chen, Binghui ;
Deng, Weihong ;
Hu, Jiani .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :371-381
[6]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]   Real-Time Semantic Segmentation Based on Dilated Convolution Smoothing and Lightweight Up-Sampling [J].
Cheng Xiaoyue ;
Zhao Longzhang ;
Hu Qiong ;
Shi Jiapeng .
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
[9]  
Cordts M., 2015, CVPR Workshop on The Future of Datasets in Vision, V2
[10]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223