Lightweight Semantic Segmentation Network Based on Attention Coding

被引：3

作者：

Chen Xiaolong ^{[1
]}

Zhao Ji ^{[1
,2
]}

Chen Siyi ^{[1
]}

机构：

[1] Xiangtan Univ, Sch Automat & Elect Informat, Xiangtan 411100, Hunan, Peoples R China

[2] Tsinghua Univ, Natl CIMS Engn Technol Res Ctr, Beijing 100084, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2021年 / 58卷 / 14期

关键词：

image processing; semantic segmentation; self-attention module; lightweight network; encoder-decoder structures;

D O I：

10.3788/LOP202158.1410012

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

To address the issues of high computational complexity and large memory footprint of the attention map of the self-attention mechanism and to improve the performance of the semantic segmentation network, we propose a lightweight network based on attention coding. The network uses an adaptive positional attention module and global attention upsampling module to encode and decode long-range dependency information, respectively. When calculating the attention map, adaptive positional attention module excludes useless basis sets and context information is obtained. A global attention upsampling module uses global context information to guide low-level features to reconstruct high-resolution images. Experimental results show that the segmentation accuracy of the network on the PASCAL VOC2012 verification set reaches a value of 84. 99o. Compared with dual attention network, which has a similar segmentation accuracy, the giga floating-point operations per second and the GPU memory of the network are reduced by 16.9% and 12.9%, respectively.

引用

页数：9

共 28 条

[1]

Adam H., Rethinking Atrous Convolution for Semantic Image Segmentation

[2]

[Anonymous], 2018, OCNET OBJECT CONTEXT

[3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[4] Real-Time Semantic Segmentation Algorithm Based on Feature Fusion Technology [J].

Cai Yu ;

Huang Xuegong ;

Zhian, Zhang ;

Zhu Xinnian ;

Ma Xiang .

LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)

[5] Mixed High-Order Attention Network for Person Re-Identification [J].

Chen, Binghui ;

Deng, Weihong ;

Hu, Jiani .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :371-381

[6] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[8] Real-Time Semantic Segmentation Based on Dilated Convolution Smoothing and Lightweight Up-Sampling [J].

Cheng Xiaoyue ;

Zhao Longzhang ;

Hu Qiong ;

Shi Jiapeng .

LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)

[9]

Cordts M., 2015, CVPR Workshop on The Future of Datasets in Vision, V2

[10] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

← 1 2 3 →