ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

被引：2

作者：

Chen, Dar-Yen ^{[1
]}

Tennent, Hamish ^{[1
]}

Hsu, Ching-Wen ^{[1
]}

机构：

[1] PicCollage, Taipei, Taiwan

来源：

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024 | 2024年

关键词：

D O I：

10.1109/CVPR52733.2024.00823

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work introduces ArtAdapter, a transformative text-to-image (T2I) style transfer framework that transcends traditional limitations of color, brushstrokes, and object shape, capturing high-level style elements such as composition and distinctive artistic expression. The integration of a multi-level style encoder with our proposed explicit adaptation mechanism enables ArtAdapter to achieve unprecedented fidelity in style transfer, ensuring close alignment with textual descriptions. Additionally, the incorporation of an Auxiliary Content Adapter (ACA) effectively separates content from style, alleviating the borrowing of content from style references. Moreover, our novel fast finetuning approach could further enhance zero-shot style representation while mitigating the risk of overfitting. Comprehensive evaluations confirm that ArtAdapter surpasses current state-of-the-art methods.

引用

页码：8619 / 8628

页数：10

共 56 条

[1] An Jie, 2021, P IEEE CVF C COMP VI
[2] [Anonymous], 2022, christophschuhmann/improved-aesthetic-predictor: CLIP+MLP aesthetic score predictor
[3] [Anonymous], 2023, ChatGPT
[4] [Anonymous], 2017, BMVC
[5] Betker James, 2023, IMPROVING IMAGE GENE
[6] Chen Haibo, 2021, ADV NEURAL INFORM PR
[7] Chen Hao, 2021, ADV NEUR IN
[8] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Cheng, Kun
Cun, Xiaodong
Zhang, Yong
Xia, Menghan
Yin, Fei
Zhu, Mingrui
Wang, Xuan
Wang, Jue
Wang, Nannan
[J]. PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
[9] Weakly Supervised Region-Level Contrastive Learning for Efficient Object Detection
Deng, Yuang
Zhang, Yuhang
Dai, Wenrui
Zhang, Xiaopeng
Xiong, Hongkai
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
[10] Dhariwal P, 2021, ADV NEUR IN, V34

← 1 2 3 4 5 6 →