Multi-style cartoonization: Leveraging multiple datasets with generative adversarial networks

被引:1
|
作者
Cai, Jianlu [1 ]
Li, Frederick W. B. [2 ]
Nan, Fangzhe [1 ]
Yang, Bailin [1 ,3 ]
机构
[1] Zhejiang Gongshang Univ, Dept Comp Sci & Technol, Hangzhou, Peoples R China
[2] Univ Durham, Dept Comp Sci, Durham, England
[3] Zhejiang Gongshang Univ, Dept Comp Sci, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
generative adversarial network; multi-style transfer; photo cartoonization;
D O I
10.1002/cav.2269
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Scene cartoonization aims to convert photos into stylized cartoons. While generative adversarial networks (GANs) can generate high-quality images, previous methods focus on individual images or single styles, ignoring relationships between datasets. We propose a novel multi-style scene cartoonization GAN that leverages multiple cartoon datasets jointly. Our main technical contribution is a multi-branch style encoder that disentangles representations to model styles as distributions over entire datasets rather than images. Combined with a multi-task discriminator and perceptual losses optimizing across collections, our model achieves state-of-the-art diverse stylization while preserving semantics. Experiments demonstrate that by learning from inter-dataset relationships, our method translates photos into cartoon images with improved realism and abstraction fidelity compared to prior arts, without iterative re-training for new styles. We introduce a multi-style scene cartoonization GAN aiming to enhance the technique of photo-to-cartoon conversion. By amalgamating multiple cartoon datasets and employing innovative encoding methods, our model achieves more realistic and abstract cartoon effects, surpassing previous approaches. By capturing relationships between datasets, we can provide high-quality cartoon images without the need for tedious iterative retraining, marking a subtle but significant advancement in the field. image
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Generating Long and Coherent Text with Multi-Level Generative Adversarial Networks
    Tang, Tianyi
    Li, Junyi
    Zhao, Wayne Xin
    Wen, Ji-Rong
    WEB AND BIG DATA, APWEB-WAIM 2021, PT II, 2021, 12859 : 49 - 63
  • [22] Image Synthesis in Multi-Contrast MRI With Conditional Generative Adversarial Networks
    Dar, Salman U. H.
    Yurt, Mahmut
    Karacan, Levent
    Erdem, Aykut
    Erdem, Erkut
    Cukur, Tolga
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (10) : 2375 - 2388
  • [23] A novel missing data imputation approach based on clinical conditional Generative Adversarial Networks applied to EHR datasets
    Bernardini, Michele
    Doinychko, Anastasiia
    Romeo, Luca
    Frontoni, Emanuele
    Amini, Massih-Reza
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [24] Leveraging Dual Variational Autoencoders and Generative Adversarial Networks for Enhanced Multimodal Interaction in Zero-Shot Learning
    Li, Ning
    Chen, Jie
    Fu, Nanxin
    Xiao, Wenzhuo
    Ye, Tianrun
    Gao, Chunming
    Zhang, Ping
    ELECTRONICS, 2024, 13 (03)
  • [25] Multi-Model Medical Image Segmentation Using Multi-Stage Generative Adversarial Networks
    Khaled, Afifa
    Han, Jian-Jun
    Ghaleb, Taher A.
    IEEE ACCESS, 2022, 10 : 28590 - 28599
  • [26] Generative adversarial networks for multi-fidelity matrix completion with massive missing entries
    Liu, Zongqi
    Song, Xueguan
    Yang, Jie
    Zhang, Chao
    Tao, Dacheng
    INFORMATION FUSION, 2024, 111
  • [27] Multi-discriminator Generative Adversarial Networks for Improved Thin Retinal Vessel Segmentation
    Tjio, Gabriel
    Li, Shaohua
    Xu, Xinxing
    Ting, Daniel Shu Wei
    Liu, Yong
    Goh, Rick Siow Mong
    OPHTHALMIC MEDICAL IMAGE ANALYSIS, 2019, 11855 : 148 - 155
  • [28] Generative adversarial networks with multi-scale and attention mechanisms for underwater image enhancement
    Wang, Ziyang
    Zhao, Liquan
    Zhong, Tie
    Jia, Yanfei
    Cui, Ying
    FRONTIERS IN MARINE SCIENCE, 2023, 10
  • [29] MDVA-GAN: multi-domain visual attribution generative adversarial networks
    Muhammad Nawaz
    Feras Al-Obeidat
    Abdallah Tubaishat
    Tehseen Zia
    Fahad Maqbool
    Alvaro Rocha
    Neural Computing and Applications, 2023, 35 : 8035 - 8050
  • [30] MDVA-GAN: multi-domain visual attribution generative adversarial networks
    Nawaz, Muhammad
    Al-Obeidat, Feras
    Tubaishat, Abdallah
    Zia, Tehseen
    Maqbool, Fahad
    Rocha, Alvaro
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (11) : 8035 - 8050