A Diffusion Model Translator for Efficient Image-to-Image Translation

被引:3
|
作者
Xia, Mengfei [1 ]
Zhou, Yu [1 ]
Yi, Ran [2 ]
Liu, Yong-Jin [1 ]
Wang, Wenping [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, MOE Key Lab Pervas Comp, Beijing 100084, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[3] Texas A&M Univ, Dept Comp Sci & Comp Engn, College Stn, TX 77840 USA
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Task analysis; Noise reduction; Diffusion models; Diffusion processes; Training; Computer science; Trajectory; image translation; deep learning; generative models;
D O I
10.1109/TPAMI.2024.3435448
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying diffusion models to image-to-image translation (I2I) has recently received increasing attention due to its practical applications. Previous attempts inject information from the source image into each denoising step for an iterative refinement, thus resulting in a time-consuming implementation. We propose an efficient method that equips a diffusion model with a lightweight translator, dubbed a Diffusion Model Translator (DMT), to accomplish I2I. Specifically, we first offer theoretical justification that in employing the pioneering DDPM work for the I2I task, it is both feasible and sufficient to transfer the distribution from one domain to another only at some intermediate step. We further observe that the translation performance highly depends on the chosen timestep for domain transfer, and therefore propose a practical strategy to automatically select an appropriate timestep for a given task. We evaluate our approach on a range of I2I applications, including image stylization, image colorization, segmentation to image, and sketch to image, to validate its efficacy and general utility. The comparisons show that our DMT surpasses existing methods in both quality and efficiency. Code is available at https://github.com/THU-LYJ-Lab/dmt.
引用
收藏
页码:10272 / 10283
页数:12
相关论文
共 50 条
  • [31] DMDIT: Diverse multi-domain image-to-image translation
    Shao, Mingwen
    Zhang, Youcai
    Liu, Huan
    Wang, Chao
    Li, Le
    Shao, Xun
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [32] GEN: Generative Equivariant Networks for Diverse Image-to-Image Translation
    Shamsolmoali, Pourya
    Zareapoor, Masoumeh
    Das, Swagatam
    Garcia, Salvador
    Granger, Eric
    Yang, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) : 874 - 886
  • [33] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
    Deng, Haipeng
    Wu, Qiuxia
    Huang, Han
    Yang, Xiaowei
    Wang, Zhiyong
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22) : 16593 - 16605
  • [34] Deep Generative Adversarial Networks for Image-to-Image Translation: A Review
    Alotaibi, Aziz
    SYMMETRY-BASEL, 2020, 12 (10): : 1 - 26
  • [35] InvolutionGAN: lightweight GAN with involution for unsupervised image-to-image translation
    Haipeng Deng
    Qiuxia Wu
    Han Huang
    Xiaowei Yang
    Zhiyong Wang
    Neural Computing and Applications, 2023, 35 : 16593 - 16605
  • [36] Underwater dam crack image generation based on unsupervised image-to-image translation
    Huang, Ben
    Kang, Fei
    Li, Xinyu
    Zhu, Sisi
    AUTOMATION IN CONSTRUCTION, 2024, 163
  • [37] Semantically Consistent Image-to-Image Translation for Unsupervised Domain Adaptation
    Brehm, Stephan
    Scherer, Sebastian
    Lienhart, Rainer
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 131 - 141
  • [38] Deep learning for thermal-RGB image-to-image translation
    Wadsworth, Emma
    Mahajan, Advait
    Prasad, Raksha
    Menon, Rajesh
    INFRARED PHYSICS & TECHNOLOGY, 2024, 141
  • [39] Deep Networks for Image-to-Image Translation with Mux and Demux Layers
    Liu, Hanwen
    Michelini, Pablo Navarrete
    Zhu, Dan
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT V, 2019, 11133 : 150 - 165
  • [40] Joint Image-to-Image Translation for Traffic Monitoring Driver Face Image Enhancement
    Hu, Chang-Hui
    Liu, Yu
    Xu, Lin-Tao
    Jing, Xiao-Yuan
    Lu, Xiao-Bo
    Yang, Wan-Kou
    Liu, Pan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 7961 - 7973