ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

被引:2
作者
Chen, Dar-Yen [1 ]
Tennent, Hamish [1 ]
Hsu, Ching-Wen [1 ]
机构
[1] PicCollage, Taipei, Taiwan
来源
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024 | 2024年
关键词
D O I
10.1109/CVPR52733.2024.00823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work introduces ArtAdapter, a transformative text-to-image (T2I) style transfer framework that transcends traditional limitations of color, brushstrokes, and object shape, capturing high-level style elements such as composition and distinctive artistic expression. The integration of a multi-level style encoder with our proposed explicit adaptation mechanism enables ArtAdapter to achieve unprecedented fidelity in style transfer, ensuring close alignment with textual descriptions. Additionally, the incorporation of an Auxiliary Content Adapter (ACA) effectively separates content from style, alleviating the borrowing of content from style references. Moreover, our novel fast finetuning approach could further enhance zero-shot style representation while mitigating the risk of overfitting. Comprehensive evaluations confirm that ArtAdapter surpasses current state-of-the-art methods.
引用
收藏
页码:8619 / 8628
页数:10
相关论文
共 56 条
  • [1] An Jie, 2021, P IEEE CVF C COMP VI
  • [2] [Anonymous], 2022, christophschuhmann/improved-aesthetic-predictor: CLIP+MLP aesthetic score predictor
  • [3] [Anonymous], 2023, ChatGPT
  • [4] [Anonymous], 2017, BMVC
  • [5] Betker James, 2023, IMPROVING IMAGE GENE
  • [6] Chen Haibo, 2021, ADV NEURAL INFORM PR
  • [7] Chen Hao, 2021, ADV NEUR IN
  • [8] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
    Cheng, Kun
    Cun, Xiaodong
    Zhang, Yong
    Xia, Menghan
    Yin, Fei
    Zhu, Mingrui
    Wang, Xuan
    Wang, Jue
    Wang, Nannan
    [J]. PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
  • [9] Weakly Supervised Region-Level Contrastive Learning for Efficient Object Detection
    Deng, Yuang
    Zhang, Yuhang
    Dai, Wenrui
    Zhang, Xiaopeng
    Xiong, Hongkai
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [10] Dhariwal P, 2021, ADV NEUR IN, V34