Contextual Attention Refinement Network for Real-Time Semantic Segmentation

被引:18
作者
Hao, Shijie [1 ,2 ]
Zhou, Yuan [1 ,2 ]
Zhang, Youming [3 ]
Guo, Yanrong [1 ,2 ]
机构
[1] Hefei Univ Technol, Minist Educ, Key Lab Knowledge Engn Big Data, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
[3] Northeastern Univ, Sch Math & Stat, Qinhuangdao 066004, Hebei, Peoples R China
关键词
Real-time semantic segmentation; contextual attention refinement module; semantic context loss;
D O I
10.1109/ACCESS.2020.2981842
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, significant progress has been made in pixel-level semantic segmentation using deep neural networks. However, for the current semantic segmentation methods, it is still challenging to achieve the balance between segmentation accuracy and computational cost. To address this issue, we propose the Contextual Attention Refinement Network (CARNet). In this method, we construct the Contextual Attention Refinement Module (CARModule), which learns an attention vector to guide the fusion of low-level and high-level features for obtaining higher segmentation accuracy. The CARModule is lightweight and can be directly equipped with different types of network structures. To better optimize the network, we additionally consider the semantic information, and further introduce the Semantic Context Loss (SCLoss) into the overall loss function. In the experiments, we validate the effectiveness and efficiency of our method on several public datasets for semantic segmentation. The results show that our method achieves a good balance on accuracy and computational costs.
引用
收藏
页码:55230 / 55240
页数:11
相关论文
共 39 条
[1]  
[Anonymous], 2018, IEEE ROBOT AUTOM LET
[2]  
[Anonymous], 2014, INT C LEARN REPR ICL
[3]  
[Anonymous], 2016, ENET DEEP NEURAL NET
[4]  
[Anonymous], 2015, COMPUTER SCI
[5]  
[Anonymous], P IEEE CVF C COMP VI
[6]  
[Anonymous], P MLITS NIPS WORKSH
[7]  
Badrinarayanan V, 2015, SEGNET DEEP CONVOLUT
[8]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[9]   Semantic object classes in video: A high-definition ground truth database [J].
Brostow, Gabriel J. ;
Fauqueur, Julien ;
Cipolla, Roberto .
PATTERN RECOGNITION LETTERS, 2009, 30 (02) :88-97
[10]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848