A semantic segmentation algorithm for fashion images based on modified mask RCNN

被引:0
作者
Wentao He
Jing’an Wang
Lei Wang
Ruru Pan
Weidong Gao
机构
[1] Jiangnan University,Key Laboratory of Eco
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Clothes; Semantic segmentation; Mask RCNN; Feature pyramid network; Detection accuracy;
D O I
暂无
中图分类号
学科分类号
摘要
The semantic segmentation of human body images has huge application potential in many fields, such as autonomous driving, artificial intelligence (AI) face changing, and virtual try-on. Nowadays, many researchers use additional human body posture information to generate multi-level human body analysis images. However, the existing method has limitations when faced with multiple poses and overlapping targets. In this paper, a novel algorithm based on Mask RCNN which has pixel-level accuracy is proposed. In the feature extraction process, a multi-scale feature fusion module applying dilated convolution is proposed to obtain richer semantic information from different perceptual fields. We added a small residual module to the original residual unit structure to increase the size of the receptive field of each layer to capture details and global characteristics. Three convolution kernels with different ratios are designed to obtain receptive fields of different scales. The experimental results show that our method has better performance while considering both object positioning and target classification.
引用
收藏
页码:28427 / 28444
页数:17
相关论文
共 57 条
  • [1] Arsalan M(2019)FRED-Net: fully residual encoder–decoder network for accurate iris segmentation Expert Syst Appl 122 217-241
  • [2] Kim DS(2017)Segnet: a deep convolutional encoder-decoder architecture for image segmentation IEEE Trans Pattern Anal Mach Intell 39 2481-2495
  • [3] Lee MB(2020)Multi-layer adaptive feature fusion for semantic segmentation Neural Process Lett 51 1081-1092
  • [4] Owais M(2019)Res2net: a new multi-scale backbone architecture IEEE Trans Pattern Anal Mach Intell 70 41-65
  • [5] Park KR(2018)A survey on deep learning techniques for image and video semantic segmentation Appl Soft Comput 42 386-397
  • [6] Badrinarayanan V(2020)Mask R-CNN IEEE Trans Pattern Anal Mach Intell 13 1565-634
  • [7] Kendall A(2021)DeepLabV3-Refiner-based semantic segmentation model for dense 3D point clouds Remote Sens 34 616-265
  • [8] Cipolla R(2019)Automatic pixel-level multiple damage detection of concrete structure using fully convolutional network Comput Civ Infrastruct Eng 16 253-146
  • [9] Chen Y(2013)Fashion parsing with weak color-category labels IEEE Trans Multimed 13 131-11536
  • [10] Hu H(2020)Deep context aware recurrent neural network for semantic segmentation of large scale unstructured 3D point cloud Neural Process Lett 74 11517-99