Image Style Transfer Algorithm Based on Semantic Segmentation

被引:14
作者
Lin, Zhijie [1 ]
Wang, Zhizhong [2 ]
Chen, Haibo [2 ]
Ma, Xiaolong [3 ]
Xie, Chuan [4 ]
Xing, Wei [2 ]
Zhao, Lei [2 ]
Song, Wei [1 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[3] Huzhou Univ, Sch Management, Huzhou 313000, Peoples R China
[4] Hangzhou Vocat & Tech Coll, Hangzhou 310018, Peoples R China
关键词
Semantics; Feature extraction; Image segmentation; Data mining; Optimization; Kernel; Training; Image style transfer; semantic segmentation; semantic mismatching; feature extraction; fine semantic guidance; mask R-CNN;
D O I
10.1109/ACCESS.2021.3054969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the existing image style transfer algorithms transfer the whole image style as a whole. Style feature is a set of correlation matrix based on style image, namely Gram matrix. Each matrix is a global description of the style image. This kind of methods can perform well in the insensitive semantic scenes (such as the style transfer between landscape photos), but in the sensitive semantic scenes (such as the style transfer between portrait photos), the problem of semantic mismatch will be highlighted, such as transferring the background texture of the style image to the foreground of the target image. Although the existing research takes the manually annotated semantic image as an input of the algorithm, and then guides the style transfer based on the semantic information, and finally achieves good results in the style transfer between portraits. But there are still two problems: first, semantic images need to be manually annotated, which costs human resources. In practical applications, large-scale image style transfer is often needed. Second, the details of the synthesized image are fuzzy, and the definition is not enough. We propose an image style transfer algorithm based on semantic segmentation to resolve semantic mismatching in image style transfer. Our algorithm extracts the semantic information of style image and content image automatically through a semantic segmentation network and uses the semantic information to guide the style transfer. Our algorithm builds a semantic segmentation network based on mask R-CNN, introduces semantic information, and then makes style transfer on the patch level, realizes the style transfer between similar objects (consistent semantic information). Experiments on Celeba and Wikiart show that our method could automatically extract the semantic information of style image and content image. Compared with the state-of-art approaches in this field, our method can effectively avoid semantic mismatch in the process of image style transfer. That is, it can maintain semantic consistency in the process of style transfer.
引用
收藏
页码:54518 / 54529
页数:12
相关论文
共 29 条
[1]  
[Anonymous], 2018, WIKIART DB OL
[2]  
[Anonymous], 2019, P CVPR, DOI DOI 10.1080/10503307.2019.1649730
[3]  
[Anonymous], 2018, LARGE SCALE CELEBFAC
[4]  
[Anonymous], 2018, LABELME CPDK
[5]  
Armi L., 2019, Int. J. Image Process. Pattern Recognit, V2, P1, DOI DOI 10.48550/ARXIV.1904.06554
[6]   StyleBank: An Explicit Representation for Neural Image Style Transfer [J].
Chen, Dongdong ;
Yuan, Lu ;
Liao, Jing ;
Yu, Nenghai ;
Hua, Gang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2770-2779
[7]   Creative and diverse artwork generation using adversarial networks [J].
Chen, Haibo ;
Zhao, Lei ;
Qiu, Lihong ;
Wang, Zhizhong ;
Zhang, Huiming ;
Xing, Wei ;
Lu, Dongming .
IET COMPUTER VISION, 2020, 14 (08) :650-657
[8]   Image Style Transfer Using Convolutional Neural Networks [J].
Gatys, Leon A. ;
Ecker, Alexander S. ;
Bethge, Matthias .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2414-2423
[9]   Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization [J].
Huang, Xun ;
Belongie, Serge .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1510-1519
[10]   Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields [J].
Jing, Yongcheng ;
Liu, Yang ;
Yang, Yezhou ;
Feng, Zunlei ;
Yu, Yizhou ;
Tao, Dacheng ;
Song, Mingli .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :244-260