Lightweight Semantic Segmentation Network based on Attention Feature Fusion

被引:0
作者
Kuang, Xianyan [1 ]
Liu, Ping [1 ]
Chen, Yixi [1 ]
Zhang, Jianhua [1 ]
机构
[1] Jiangxi Univ Sci & Technol, Sch Elect Engn & Automat, Intelligent Traff & Intelligent Comp Lab, Ganzhou 341000, Peoples R China
基金
中国国家自然科学基金;
关键词
lightweight network; semantic segmentation; attention mechanism; feature fusion; Cityscapes;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the popularization of intelligent mobile devices, the lightweight semantic segmentation networks for terminal-oriented have made great progress. However, the low and middle-level features are not be fully used in the networks, and It's often overlooked that low-level noises on high-level features are superimposed, which can cause the important information to be blurred. In this paper, a lightweight semantic segmentation of Attention Feature Fusion Network (AFFNet) is proposed. In this network, a multi-branch structure is adopted to utilize feature information of different stages. An attention feature fusion module is designed, in which the feature information in each stage is weighted with the channel attention and spatial attention to avoid noise superimposition and information coverage in the fusion stage. The loss function of Object Weighted Focus Loss and Cross Entropy Loss (OWFL+CEL) is introduced in the training process to suppress the class imbalance problem. The results show that, with the operation of a single RTX2080 GPU, the mean intersection over union (mIoU) of the method on the Cityscapes dataset is 70.8%, and it reaches 35.7 frames per second(FPS) under high resolution input, which proves that this method has better performance and practical value than similar networks in the tasks of real-time semantic segmentation.
引用
收藏
页码:1584 / 1591
页数:8
相关论文
共 40 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[6]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[7]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149
[8]  
Han S, 2016, Arxiv, DOI arXiv:1510.00149
[9]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[10]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269