A Diffusion Model Translator for Efficient Image-to-Image Translation

被引：3

作者：

Xia, Mengfei ^{[1
]}

Zhou, Yu ^{[1
]}

Yi, Ran ^{[2
]}

Liu, Yong-Jin ^{[1
]}

Wang, Wenping ^{[3
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, MOE Key Lab Pervas Comp, Beijing 100084, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China

[3] Texas A&M Univ, Dept Comp Sci & Comp Engn, College Stn, TX 77840 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 12期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Task analysis; Noise reduction; Diffusion models; Diffusion processes; Training; Computer science; Trajectory; image translation; deep learning; generative models;

D O I：

10.1109/TPAMI.2024.3435448

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Applying diffusion models to image-to-image translation (I2I) has recently received increasing attention due to its practical applications. Previous attempts inject information from the source image into each denoising step for an iterative refinement, thus resulting in a time-consuming implementation. We propose an efficient method that equips a diffusion model with a lightweight translator, dubbed a Diffusion Model Translator (DMT), to accomplish I2I. Specifically, we first offer theoretical justification that in employing the pioneering DDPM work for the I2I task, it is both feasible and sufficient to transfer the distribution from one domain to another only at some intermediate step. We further observe that the translation performance highly depends on the chosen timestep for domain transfer, and therefore propose a practical strategy to automatically select an appropriate timestep for a given task. We evaluate our approach on a range of I2I applications, including image stylization, image colorization, segmentation to image, and sketch to image, to validate its efficacy and general utility. The comparisons show that our DMT surpasses existing methods in both quality and efficiency. Code is available at https://github.com/THU-LYJ-Lab/dmt.

引用

页码：10272 / 10283

页数：12

共 50 条

[1] DiffI2I: Efficient Diffusion Model for Image-to-Image Translation
Xia, Bin
Zhang, Yulun
Wang, Shiyin
Wang, Yitong
Wu, Xinglong
Tian, Yapeng
Yang, Wenming
Timotfe, Radu
Van Gool, Luc
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1578 - 1593
[2] Unpaired Image-to-Image Translation with Diffusion Adversarial Network
Tu, Hangyao
Wang, Zheng
Zhao, Yanwei
MATHEMATICS, 2024, 12 (20)
[3] Dissecting and Mitigating Semantic Discrepancy in Stable Diffusion for Image-to-Image Translation
Yuan, Yifan
Yang, Guanqun
Wang, James Z.
Zhang, Hui
Shan, Hongming
Wang, Fei-Yue
Zhang, Junping
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2025, 12 (04) : 705 - 718
[4] Allowing Supervision in Unsupervised Deformable- Instances Image-to-Image Translation
Liu, Yu
Su, Sitong
Zhu, Junchen
Zheng, Feng
Gao, Lianli
Song, Jingkuan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5335 - 5349
[5] Vector Quantized Image-to-Image Translation
Chen, Yu-Jie
Cheng, Shin-I
Chiu, Wei-Chen
Tseng, Hung-Yu
Lee, Hsin-Ying
COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 440 - 456
[6] Unsupervised Image-to-Image Translation: A Review
Hoyez, Henri
Schockaert, Cedric
Rambach, Jason
Mirbach, Bruno
Stricker, Didier
SENSORS, 2022, 22 (21)
[7] Image-to-Image Translation: Methods and Applications
Pang, Yingxue
Lin, Jianxin
Qin, Tao
Chen, Zhibo
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3859 - 3881
[8] Complementary, Heterogeneous and Adversarial Networks for Image-to-Image Translation
Gao, Fei
Xu, Xingxin
Yu, Jun
Shang, Meimei
Li, Xiang
Tao, Dacheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3487 - 3498
[9] Literature Review of Generative models for Image-to-Image translation problems
Kamil, Anwar
Shaikh, Talal
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 341 - 346
[10] Zero-shot Image-to-Image Translation
Parmar, Gaurav
Singh, Krishna Kumar
Zhang, Richard
Li, Yijun
Lu, Jingwan
Zhu, Jun-Yan
PROCEEDINGS OF SIGGRAPH 2023 CONFERENCE PAPERS, SIGGRAPH 2023, 2023,

← 1 2 3 4 5 →