Real-time Image Semantic Segmentation Based on Block Adaptive Feature Fusion

被引:0
|
作者
Huang T.-H. [1 ]
Nie Z.-Y. [1 ]
Wang Q.-G. [2 ]
Li S. [3 ]
Yan L.-C. [1 ]
Guo D.-S. [1 ]
机构
[1] College of Information Science and Engineering, National Huaqiao University, Xiamen
[2] Institute for Intelligent Systems, University of Johannesburg, Johannesburg
[3] the Hong Kong Polytechnic University, Hong Kong
来源
基金
中国国家自然科学基金;
关键词
Block adaptive feature fusion (BAFF); Deep learning; Real-time semantic segmentation network; SkipNet;
D O I
10.16383/j.aas.c180645
中图分类号
学科分类号
摘要
Recently, image semantic segmentation has made great progress with deep learning, which benefits robotics and automatic driving vehicle. This paper proposes a real-time semantic segmentation algorithm based on block adaptive feature fusion (BAFF). Under the framework of a light convolutional network, a block adaptive feature fusion algorithm is proposed in the context-embedding module, to improve the accuracy of real-time semantic segmentation. First, the problem caused by the different size of receptive field in layers is analyzed, and a feature fusion mechanism with block weight is presented on SkipNet. Then, layers' feature integration is carried on by three-dimension convolution. The feature- weights are calculated by an additional network with depthwise-separable-convolutions (DSC). Finally, the features are fused under adaptive weights. Experiments show that this method obtains excellent segmentation results with a good balance between rapidity and accuracy and owns robustness on segmentation of complex scenes. Copyright © 2021 Acta Automatica Sinica. All rights reserved.
引用
收藏
页码:1137 / 1148
页数:11
相关论文
共 30 条
  • [21] Yu F, Koltun V., Multi-scale context aggregation by dilated convolutions, (2015)
  • [22] Hu J, Shen L, Albanie S, Sun G, Wu E H., Squeeze-and- excitation networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, (2017)
  • [23] Zhang Ting, Li Yu-Jian, Hu Hai-He, Zhang Ya-Hong, A gender classification model based on cross-connected convolutional neural networks, Acta Automatica Sinica, 42, 6, pp. 858-865, (2016)
  • [24] Rumelhart D E, Hinton G E, Williams R J., Learning representations by back-propagating errors, Nature, 323, 6088, pp. 533-536, (1986)
  • [25] Michael A N., Neural Networks and Deep Learning
  • [26] Cordts M, Omran M, Ramos S, Rehfeld T, Enzweiler M, Benenson R, Franke U, Roth S, Schiele B., The cityscapes dataset for semantic urban scene understanding, Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3213-3223, (2016)
  • [27] Garcia-Garcia A, Orts-Escolano S, Oprea S, Villenamartinez V, Garciarodriguez J., A review on deep learning techniques applied to semantic segmentation, (2017)
  • [28] Paszke A, Chaurasia A, Kim S, Culurciello E., ENet: A deep neural network architecture for real-time semantic segmentation, (2016)
  • [29] Poudel R P K, Bonde U, Liwicki S, Zach C., ContextNet: Exploring context and detail for semantic segmentation in real-time, (2018)
  • [30] Romera Ealvarez J M, Bergasa L M, Arroyo R., ERFNet: Efficient residual factorized convnet for real-time semantic segmentation, IEEE Transactions on Intelligent Transportation Systems, 19, 1, pp. 263-272, (2017)