CLIP-FONT: SEMENTIC SELF-SUPERVISED FEW-SHOT FONT GENERATION WITH CLIP

被引：1

作者：

Xiong, Jialu ^{[1
]}

Wang, Yefei ^{[2
]}

Zeng, Jinshan ^{[2
]}

机构：

[1] Jiangxi Normal Univ, Sch Digital Ind, Shangrao, Peoples R China

[2] Jiangxi Normal Univ, Sch Comp & Informat Engn, Nanchang, Jiangxi, Peoples R China

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

Font generation; few-shot; self-supervision;

D O I：

10.1109/ICASSP48485.2024.10447490

中图分类号：

学科分类号：

摘要：

Font design is a very resource-intensive endeavor, especially for intricate fonts. The task of few-shot font generation (FFG) has attracted great interest recently. This method captures style from a limited set of reference glyphs and then transfers it to other characters to generate diverse style fonts. Existing FFG methods mainly revolve around learning font content or style. However, these methods often only learn content or style or lack the ability to represent style and content, resulting in poor font quality. To address these issues, we introduce CLIP-Font-a novel few-shot font generation model. CLIP-Font uses font text semantics for self-supervision to guide font generation at the content level, and uses attention-based contrast learning at the style level to capture the representation capabilities of the font fine-grained style enhancement model. Experimental results on various datasets demonstrate the effectiveness of our method, surpassing the performance of existing FFG techniques.

引用

页码：3620 / 3624

页数：5

共 24 条

[21] Few-Shot Steel Surface Defect Recognition via Self-Supervised Teacher–Student Model With Min–Max Instances Similarity
Wang, Tianlei
Li, Zeliang
Xu, Ying
Chen, Jiacong
Genovese, Angelo
Piuri, Vincenzo
Scotti, Fabio
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[22] SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification
Peng, Fang
Yang, Xiaoshan
Xiao, Linhui
Wang, Yaowei
Xu, Changsheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3469 - 3480
[23] LOGEN: Few-Shot Logical Knowledge-Conditioned Text Generation With Self-Training
Deng, Shumin
Yang, Jiacheng
Ye, Hongbin
Tan, Chuanqi
Chen, Mosha
Huang, Songfang
Huang, Fei
Chen, Huajun
Zhang, Ningyu
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2124 - 2133
[24] Self-taught cross-domain few-shot learning with weakly supervised object localization and task-decomposition
Liu, Xiyao
Ji, Zhong
Pang, Yanwei
Han, Zhi
KNOWLEDGE-BASED SYSTEMS, 2023, 265

← 1 2 3 →