Semantic Relocation Parallel Network for Semantic Segmentation

被引:0
|
作者
Chen S. [1 ]
Xu L. [1 ]
Zou B. [2 ]
Chen J. [3 ]
机构
[1] School of Computer Science, Xiangtan University, Xiangtan
[2] School of Computer Science and Engineering, Central South University, Changsha
[3] Computer Center, Xiangtan University, Xiangtan
来源
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics | 2022年 / 34卷 / 03期
关键词
Feature extractor; Feature fusion; Semantic relocation; Semantic segmentation;
D O I
10.3724/SP.J.1089.2022.18909
中图分类号
学科分类号
摘要
Semantic segmentation is an essential issue in the computer vision field, the difficulty of which lies in the accurate prediction of the pixel level and the edge division of similar objects. The encoder-encoder structure is widely used in many methods to capture the global information of semantic objects. However, continuous subsampling causes irreversible loss of spatial information of the feature map. A parallel semantic relocation (SRPNet) based network is proposed. Specifically, a high-resolution global spatial path is designed to extract rich spatial information in which feature maps have high resolution. In feature extraction path, a powerful feature extractor is used to expand the receptive field by fast subsampling. Besides, a semantic relocation module (SRM) is designed to compensate for the lack of context information caused by multiple subsamples. Dice loss is employed to alleviate the imbalance of positive and negative samples in the dataset and obtain better segmentation performance. Finally, the proposed network is evaluated on the Cityscapes and CamVid dataset. The results show that SRPNet can improve the previous best result by approximately 3.1% and 1.8% measured by mIoU on the CamVid dataset and the Cityscapes dataset, respectively. © 2022, Beijing China Science Journal Publishing Co. Ltd. All right reserved.
引用
收藏
页码:373 / 381
页数:8
相关论文
共 32 条
  • [1] Davis L S, Rosenfeld A, Weszka J S., Region extraction by averaging and thresholding, IEEE Transactions on Systems, Man, and Cybernetics, 3, pp. 383-388, (1975)
  • [2] Senthilkumaran N, Rajesh R., Edge detection techniques for image segmentation - a survey of soft computing approaches, Proceedings of the International Conference on Advances in Recent Technologies in Communication and Computing, pp. 844-846, (2009)
  • [3] Nowozin S, Christoph H L., Structured learning and prediction in computer vision, Foundations and Trends in Computer Graphics and Vision, 6, 3, pp. 185-365, (2010)
  • [4] Sulaiman S N, Isa N A M., Adaptive fuzzy-K-means clustering algorithm for image segmentation, IEEE Transactions on Consumer Electronics, 56, 4, pp. 2661-2668, (2010)
  • [5] Long J, Shelhamer E, Darrell T., Fully convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431-3440, (2015)
  • [6] Mehta S, Rastegari M, Caspi A, Et al., ESPNet: efficient spatial pyramid of dilated convolutions for semantic segmentation, Proceedings of the European Conference on Computer Vision, pp. 561-580, (2018)
  • [7] Li H C, Xiong P F, Fan H Q, Et al., DFANet: deep feature aggregation for real-time semantic segmentation, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9514-9523, (2019)
  • [8] Wu T Y, Tang S, Zhang R, Et al., CGNet: a light-weight context guided network for semantic segmentation, IEEE Transactions on Image Processing, 30, pp. 1169-1179, (2021)
  • [9] Badrinarayanan V, Kendall A, Cipolla R., SegNet: a deep convolutional encoder-decoder architecture for image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 12, pp. 2481-2495, (2017)
  • [10] Chen L C, Papandreou G, Kokkinos I, Et al., DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs, IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 4, pp. 834-848, (2018)