Query-Selected Global Attention for Text guided Image Style Transfer using Diffusion Model

被引:0
|
作者
Hwang, Jungmin [1 ]
Lee, Won-Sook [1 ]
机构
[1] Univ Ottawa, Fac Engn, Sch EECS, Ottawa, ON, Canada
关键词
Diffusion; Style Transfer; Query Selection; Global Attention;
D O I
10.1109/CAI59869.2024.00207
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diffusion models have gained tremendous interest in image generation. Additionally, guided text methods for manipulating source images have shown successful progress. However, research on style transfer using diffusion models is still ongoing to address the trade-off between style transfer and content preservation. One representative solution to the issue is contrastive learning in a self-supervised manner, which is useful for extracting specific features from the same location on source and generated images for every pixel. However, there are instances where it is necessary to preserve certain areas, which contain more information from the source image compared to other areas in the image. Therefore, we propose anchoring the areas for preservation and intentionally selecting features at the anchor points through a query-selected global attention method. This enables our method to generate an image that preserves the content of the source while transferring the style without the need for additional fine-tuning or auxiliary network. Our diffusion model follows a simple architecture to enhance image quality and speed up inference time, in comparison to other diffusion methods. Our experimental results also demonstrate superior performance.
引用
收藏
页码:1162 / 1166
页数:5
相关论文
共 50 条
  • [21] GLAD: A Global-Attention-Based Diffusion Model for Infrared and Visible Image Fusion
    Guo, Haozhe
    Chen, Mengjie
    Li, Kaijiang
    Su, Hao
    Lv, Pei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 345 - 356
  • [22] TIC: text-guided image colorization using conditional generative model
    Ghosh, Subhankar
    Roy, Prasun
    Bhattacharya, Saumik
    Pal, Umapada
    Blumenstein, Michael
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41121 - 41136
  • [23] TIC: text-guided image colorization using conditional generative model
    Subhankar Ghosh
    Prasun Roy
    Saumik Bhattacharya
    Umapada Pal
    Michael Blumenstein
    Multimedia Tools and Applications, 2024, 83 : 41121 - 41136
  • [24] RSDiff: remote sensing image generation from text using diffusion model
    Ahmad Sebaq
    Mohamed ElHelw
    Neural Computing and Applications, 2024, 36 (36) : 23103 - 23111
  • [25] Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model
    Mahlow, Felipe Rodrigues Perche
    Castaneda, William Alberto Cruz
    Zanella, Andre Felipe
    Sarzi-Ribeiro, Regilene Aparecida
    IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (12) : 1000 - 1008
  • [26] Aerial Diffusion: Text Guided Ground-to-Aerial View Synthesis from a Single Image using Diffusion Models
    Kothandaraman, Divya
    Zhou, Tianyi
    Lin, Ming
    Manocha, Dinesh
    PROCEEDINGS SIGGRAPH ASIA 2023 TECHNICAL COMMUNICATIONS, SA TECHNICAL COMMUNICATIONS 2023, 2023,
  • [27] Text-to-Audio Generation using Instruction-Guided Latent Diffusion Model
    Ghosal, Deepanway
    Majumder, Navonil
    Mehrish, Ambuj
    Poria, Soujanya
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3590 - 3598
  • [28] Dementia Prediction Support Model Using Regression Analysis and Image Style Transfer
    Baek, Ji-Won
    Chung, Kyungyong
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [29] Non-parallel text style transfer with domain adaptation and an attention model (vol 51, pg 4609, 2021)
    Hu, Mingxuan
    He, Min
    APPLIED INTELLIGENCE, 2021, 51 (11) : 8564 - 8564
  • [30] Deep color calibration for UAV imagery in crop monitoring using semantic style transfer with local to global attention
    Huang, Huasheng
    Yang, Aqing
    Tang, Yu
    Zhuang, Jiajun
    Hou, Chaojun
    Tan, Zhiping
    Dananjayan, Sathian
    He, Yong
    Guo, Qiwei
    Luo, Shaoming
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2021, 104