CLIP-FONT: SEMENTIC SELF-SUPERVISED FEW-SHOT FONT GENERATION WITH CLIP

被引:1
作者
Xiong, Jialu [1 ]
Wang, Yefei [2 ]
Zeng, Jinshan [2 ]
机构
[1] Jiangxi Normal Univ, Sch Digital Ind, Shangrao, Peoples R China
[2] Jiangxi Normal Univ, Sch Comp & Informat Engn, Nanchang, Jiangxi, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024 | 2024年
基金
中国国家自然科学基金;
关键词
Font generation; few-shot; self-supervision;
D O I
10.1109/ICASSP48485.2024.10447490
中图分类号
学科分类号
摘要
Font design is a very resource-intensive endeavor, especially for intricate fonts. The task of few-shot font generation (FFG) has attracted great interest recently. This method captures style from a limited set of reference glyphs and then transfers it to other characters to generate diverse style fonts. Existing FFG methods mainly revolve around learning font content or style. However, these methods often only learn content or style or lack the ability to represent style and content, resulting in poor font quality. To address these issues, we introduce CLIP-Font-a novel few-shot font generation model. CLIP-Font uses font text semantics for self-supervision to guide font generation at the content level, and uses attention-based contrast learning at the style level to capture the representation capabilities of the font fine-grained style enhancement model. Experimental results on various datasets demonstrate the effectiveness of our method, surpassing the performance of existing FFG techniques.
引用
收藏
页码:3620 / 3624
页数:5
相关论文
共 24 条
  • [21] Few-Shot Steel Surface Defect Recognition via Self-Supervised Teacher–Student Model With Min–Max Instances Similarity
    Wang, Tianlei
    Li, Zeliang
    Xu, Ying
    Chen, Jiacong
    Genovese, Angelo
    Piuri, Vincenzo
    Scotti, Fabio
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [22] SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification
    Peng, Fang
    Yang, Xiaoshan
    Xiao, Linhui
    Wang, Yaowei
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3469 - 3480
  • [23] LOGEN: Few-Shot Logical Knowledge-Conditioned Text Generation With Self-Training
    Deng, Shumin
    Yang, Jiacheng
    Ye, Hongbin
    Tan, Chuanqi
    Chen, Mosha
    Huang, Songfang
    Huang, Fei
    Chen, Huajun
    Zhang, Ningyu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2124 - 2133
  • [24] Self-taught cross-domain few-shot learning with weakly supervised object localization and task-decomposition
    Liu, Xiyao
    Ji, Zhong
    Pang, Yanwei
    Han, Zhi
    KNOWLEDGE-BASED SYSTEMS, 2023, 265