Disentangling Writer and Character Styles for Handwriting Generation

被引:28
作者
Dai, Gang [1 ]
Zhang, Yifan [2 ]
Wang, Qingfeng [1 ]
Du, Qing [1 ]
Yu, Zhuliang [1 ]
Liu, Zhuoman [3 ]
Huang, Shuangping [1 ,4 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Hong Kong Polytech Univ, Hong Kong, Peoples R China
[4] Pazhou Lab, Guangzhou, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
关键词
ONLINE;
D O I
10.1109/CVPR52729.2023.00579
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training machines to synthesize diverse handwritings is an intriguing task. Recently, RNN-based methods have been proposed to generate stylized online Chinese characters. However, these methods mainly focus on capturing a person's overall writing style, neglecting subtle style inconsistencies between characters written by the same person. For example, while a person's handwriting typically exhibits general uniformity (e.g., glyph slant and aspect ratios), there are still small style variations in finer details (e.g., stroke length and curvature) of characters. In light of this, we propose to disentangle the style representations at both writer and character levels from individual handwritings to synthesize realistic stylized online handwritten characters. Specifically, we present the style-disentangled Transformer (SDT), which employs two complementary contrastive objectives to extract the style commonalities of reference samples and capture the detailed style patterns of each sample, respectively. Extensive experiments on various language scripts demonstrate the effectiveness of SDT. Notably, our empirical findings reveal that the two learned style representations provide information at different frequency magnitudes, underscoring the importance of separate style extraction. Our source code is public at: https://github.com/dailenson/SDT.
引用
收藏
页码:5977 / 5986
页数:10
相关论文
共 45 条
[1]   DeepWriting: Making Digital Ink Editable via Deep Generative Modeling [J].
Aksan, Emre ;
Pece, Fabrizio ;
Hilliges, Otmar .
PROCEEDINGS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2018), 2018,
[2]  
[Anonymous], 2017, Zi2zi: Master Chinese calligraphy with conditional adversarial networks
[3]  
[Anonymous], 2016, 25 INT JOINT C ARTIF
[4]  
Berndt Donald J, 1994, ACM SIGKDD INT C KNO
[5]   Handwriting Transformers [J].
Bhunia, Ankan Kumar ;
Khan, Salman ;
Cholakkal, Hisham ;
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
Shah, Mubarak .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :1066-1074
[6]   Generating Handwritten Chinese Characters using CycleGAN [J].
Chang, Bo ;
Zhang, Qiong ;
Pan, Shenyi ;
Meng, Lili .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :199-207
[7]  
Chen Zhounan, 2022, ACCV, P1060
[8]   FAST FOURIER TRANSFORM AND ITS APPLICATIONS [J].
COOLEY, JW ;
LEWIS, PAW ;
WELCH, PD .
IEEE TRANSACTIONS ON EDUCATION, 1969, E 12 (01) :27-&
[9]  
Gan J, 2021, AAAI CONF ARTIF INTE, V35, P7484
[10]  
Gao TY, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P6894