Self-attention Handwriting Generative Model

被引：0

作者：

Wang, Yu-Chiao ^{[1
]}

Hsieh, Tung-Ju ^{[1
]}

Chiang, Pei-Ying ^{[1
]}

机构：

[1] Natl Taipei Univ Technol, Comp Sci & Informat Engn, Taipei, Taiwan

来源：

PROCEEDINGS SIGGRAPH ASIA 2024 POSTERS | 2024年

关键词：

Handwriting; Font Generation; GAN; Self-attention;

D O I：

10.1145/3681756.3697883

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advancements in font generation models include GANs, diffusion models, and transformers. This study introduces a GAN-based model, zi2zi self-attention, which enhances the zi2zi model by incorporating Residual Blocks (ResNet Block) and Self-Attention Layers in the encoder and decoder. These enhancements improve the ability to capture complex font details, mimicking the writer's style while preserving the source image's content. The study also introduces Perceptual Loss, using high-level features from the VGG-19 network to improve visual consistency and quality. The Adam optimizer with appropriate learning rates and Beta parameters is employed to enhance convergence speed and stability. For comparison, four other mainstream models are evaluated: MX-Font, CF-Font, SDT, and Font-Diffuser. These models differ from zi2zi in their focus on extracting content characters and writing style features rather than solely using adversarial networks. Experimental results show the potential and effectiveness of zi2zi-Self-Attention in generating high-quality, visually appealing handwritten

引用

页数：2

共 8 条

[1] Disentangling Writer and Character Styles for Handwriting Generation [J].

Dai, Gang ;

Zhang, Yifan ;

Wang, Qingfeng ;

Du, Qing ;

Yu, Zhuliang ;

Liu, Zhuoman ;

Huang, Shuangping .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :5977-5986

[2] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[3]

He Tianchang, 2018, zi2zi: Master Chinese Calligraphy with Conditional Adversarial Networks

[4] Perceptual Losses for Real-Time Style Transfer and Super-Resolution [J].

Johnson, Justin ;

Alahi, Alexandre ;

Li Fei-Fei .

COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :694-711

[5]

Park Song, 2021, ICCV 2021

[6]

Vaswani A, 2017, Advances in neural information processing systems, P5998, DOI [10.48550/arXiv.1706.03762, DOI 10.48550/ARXIV.1706.03762]

[7] CF-Font: Content Fusion for Few-shot Font Generation [J].

Wang, Chi ;

Zhou, Min ;

Ge, Tiezheng ;

Jiang, Yuning ;

Bao, Hujun ;

Xu, Weiwei .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :1858-1867

[8]

Yang ZH, 2024, AAAI CONF ARTIF INTE, P6603

← 1 →