Attention mechanism-based generative adversarial networks for image cartoonization

被引：0

作者：

Zhao, Wenqing ^{[1
]}

Zhu, Jianlin ^{[2
]}

Li, Ping ^{[3
]}

Huang, Jin ^{[1
]}

Tang, Junwei ^{[1
]}

机构：

[1] Wuhan Text Univ, Sch Comp Sci & Artificial Intelligence, Wuhan 430200, Peoples R China

[2] South Cent Minzu Univ, Dept Comp Sci, Wuhan 430070, Peoples R China

[3] Hong Kong Polytech Univ, Kowloon, Hong Kong, Peoples R China

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 06期

关键词：

Cartoonization; Style transfer; Generative adversarial networks; Attention mechanism;

D O I：

10.1007/s00371-024-03404-4

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

As a common art form in daily life, cartoon images play an important role in the fields of movie production and science education. However, intelligently generating different style cartoon images from real-world photographs often has a number of problems, which mainly include: (1) The generated images do not have obvious cartoon-style textures; (2) the generated images are prone to structural confusion, color artifacts, and loss of the original image content. Therefore, style transfer and preservation of original content is a great challenge in the field of image cartoonization. In this paper, we propose an attention mechanism-based generating adversarial networks for image cartoonization to address the above problem. The method uses the attention module to perform feature correction on the deep network features extracted from the residual blocks in the generative model, so that it strengthens the cartoon features of the generated image and enhances the ability of the generative model to perceive the cartoon style. At the same time, we also use the attention module to perform feature correction on the features extracted from the convolutional block in the discriminative model, so that it can strengthen the discriminative ability between the generated cartoon image and the real cartoon image while reducing the structural confusion, color artifacts, and loss of the original image content caused by style transfer. Qualitative experiments and quantitative evaluations demonstrate the advantages of our method in terms of style transfer and content preservation, while ablation study validates the role of each module in the method. The code is https://github.com/zwq11/Image-Cartoonization.git.

引用

页码：3971 / 3984

页数：14

共 42 条

[1]

Carion Nicolas., 2020, EUR C COMPUT VIS

[2] CartoonGAN: Generative Adversarial Networks for Photo Cartoonization [J].

Chen, Yang ;

Lai, Yu-Kun ;

Liu, Yong-Jin .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9465-9474

[3]

Chen Y, 2017, IEEE IMAGE PROC, P2010, DOI 10.1109/ICIP.2017.8296634

[4] Deep Colorization [J].

Cheng, Zezhou ;

Yang, Qingxiong ;

Sheng, Bin .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :415-423

[5] Image-to-Image Translation via Group-wise Deep Whitening-and-Coloring Transformation [J].

Cho, Wonwoong ;

Choi, Sungha ;

Park, David Keetae ;

Shin, Inkyu ;

Choo, Jaegul .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10631-10639

[6]

Dumoulin V, 2018, Arxiv, DOI [arXiv:1603.07285, 10.48550/arXiv.1603.07285, DOI 10.48550/ARXIV.1603.07285]

[7]

Gatys L, 2016, J VISION, V16, P326, DOI [10.1167/16.12.326, DOI 10.1167/16.12.326, 10.1167/16.12.326]

[8]

Gatys LA, 2015, ADV NEUR IN, V28

[9] Generative Adversarial Networks [J].

Goodfellow, Ian ;

Pouget-Abadie, Jean ;

Mirza, Mehdi ;

Xu, Bing ;

Warde-Farley, David ;

Ozair, Sherjil ;

Courville, Aaron ;

Bengio, Yoshua .

COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144

[10]

Guan Q., 2018, ARXIV

← 1 2 3 4 5 →