Contextual Attention Refinement Network for Real-Time Semantic Segmentation

被引:18
作者
Hao, Shijie [1 ,2 ]
Zhou, Yuan [1 ,2 ]
Zhang, Youming [3 ]
Guo, Yanrong [1 ,2 ]
机构
[1] Hefei Univ Technol, Minist Educ, Key Lab Knowledge Engn Big Data, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
[3] Northeastern Univ, Sch Math & Stat, Qinhuangdao 066004, Hebei, Peoples R China
关键词
Real-time semantic segmentation; contextual attention refinement module; semantic context loss;
D O I
10.1109/ACCESS.2020.2981842
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, significant progress has been made in pixel-level semantic segmentation using deep neural networks. However, for the current semantic segmentation methods, it is still challenging to achieve the balance between segmentation accuracy and computational cost. To address this issue, we propose the Contextual Attention Refinement Network (CARNet). In this method, we construct the Contextual Attention Refinement Module (CARModule), which learns an attention vector to guide the fusion of low-level and high-level features for obtaining higher segmentation accuracy. The CARModule is lightweight and can be directly equipped with different types of network structures. To better optimize the network, we additionally consider the semantic information, and further introduce the Semantic Context Loss (SCLoss) into the overall loss function. In the experiments, we validate the effectiveness and efficiency of our method on several public datasets for semantic segmentation. The results show that our method achieves a good balance on accuracy and computational costs.
引用
收藏
页码:55230 / 55240
页数:11
相关论文
共 39 条
[31]   High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs [J].
Wang, Ting-Chun ;
Liu, Ming-Yu ;
Zhu, Jun-Yan ;
Tao, Andrew ;
Kautz, Jan ;
Catanzaro, Bryan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8798-8807
[32]  
Wu Z., 2017, ARXIVABS171200213
[33]   BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation [J].
Yu, Changqian ;
Wang, Jingbo ;
Peng, Chao ;
Gao, Changxin ;
Yu, Gang ;
Sang, Nong .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :334-349
[34]  
Yu F, 2015, 4 INT C LEARN REPR S, V1511, P7122
[35]   ICNet for Real-Time Semantic Segmentation on High-Resolution Images [J].
Zhao, Hengshuang ;
Qi, Xiaojuan ;
Shen, Xiaoyong ;
Shi, Jianping ;
Jia, Jiaya .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :418-434
[36]   Pyramid Scene Parsing Network [J].
Zhao, Hengshuang ;
Shi, Jianping ;
Qi, Xiaojuan ;
Wang, Xiaogang ;
Jia, Jiaya .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6230-6239
[37]   Conditional Random Fields as Recurrent Neural Networks [J].
Zheng, Shuai ;
Jayasumana, Sadeep ;
Romera-Paredes, Bernardino ;
Vineet, Vibhav ;
Su, Zhizhong ;
Du, Dalong ;
Huang, Chang ;
Torr, Philip H. S. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1529-1537
[38]   Scene Parsing through ADE20K Dataset [J].
Zhou, Bolei ;
Zhao, Hang ;
Puig, Xavier ;
Fidler, Sanja ;
Barriuso, Adela ;
Torralba, Antonio .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5122-5130
[39]   Low-Rank Graph-Regularized Structured Sparse Regression for Identifying Genetic Biomarkers [J].
Zhu, Xiaofeng ;
Suk, Heung-Il ;
Huang, Heng ;
Shen, Dinggang .
IEEE Transactions on Big Data, 2017, 3 (04) :405-414