Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model

被引：18

作者：

Yang, Shiyuan ^{[1
]}

Chen, Xiaodong ^{[2
]}

Liao, Jing ^{[1
]}

机构：

[1] City Univ Hong Kong, Hong Kong, Peoples R China

[2] Tianjin Univ, Tianjin, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

关键词：

Image Inpainting; Diffusion Model; Multimodal;

D O I：

10.1145/3581783.3612200

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, text-to-image denoising diffusion probabilistic models (DDPMs) have demonstrated impressive image generation capabilities and have also been successfully applied to image inpainting. However, in practice, users often require more control over the inpainting process beyond textual guidance, especially when they want to composite objects with customized appearance, color, shape, and layout. Unfortunately, existing diffusion-based inpainting methods are limited to single-modal guidance and require task-specific training, hindering their cross-modal scalability. To address these limitations, we propose Uni-paint, a unified framework for multi-modal inpainting that offers various modes of guidance, including unconditional, text-driven, stroke-driven, exemplar-driven inpainting, as well as a combination of these modes. Furthermore, our Uni-paint is based on pretrained Stable Diffusion and does not require task-specific training on specific datasets, enabling few-shot generalizability to customized images. We have conducted extensive qualitative and quantitative evaluations that show our approach achieves comparable results to existing single-modal methods while offering multimodal inpainting capabilities not available in other methods. Code is available at https://github.com/ysy31415/unipaint.

引用

页码：3190 / 3199

页数：10

共 16 条

[1] Image Inpainting Based on Improved Tensor Diffusion Model
Cui Xuehong
Pan Zhenkuan
Wei Weibo
ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 833 - 837
[2] Nonlocal Curvature-Driven Diffusion Model for Image Inpainting
Li, Li
Yu, Han
FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 2, PROCEEDINGS, 2009, : 513 - 516
[3] Research of Diffusion Coefficient in The Total Variation Image Inpainting Model
He Jing
Zhao Feng-qun
Zhou Qian
Zhang Pei-ru
PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE OF MODELLING AND SIMULATION, VOL III: MODELLING AND SIMULATION IN ELECTRONICS, COMPUTING, AND BIO-MEDICINE, 2008, : 382 - 386
[4] A New Oriented-Diffusion Image Inpainting Framework for Striped Texture Images
Zhu Yong
Wang Gui
Han Zhike
2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 79 - 84
[5] MULTIGRID METHOD FOR A MODIFIED CURVATURE DRIVEN DIFFUSION MODEL FOR IMAGE INPAINTING
Brito-Loeza, Carlos
Chen, Ke
JOURNAL OF COMPUTATIONAL MATHEMATICS, 2008, 26 (06) : 856 - 875
[6] ORTHOGONAL-DIRECTIONAL FORWARD DIFFUSION IMAGE INPAINTING AND DENOISING MODEL
Wu Jiying Ruan Qiuqi An Gaoyun(Institute of Information Science
Journal of Electronics(China), 2008, (05) : 622 - 628
[7] A Novel Diffusion-Model-Based OCT Image Inpainting Algorithm for Wide Saturation Artifacts
Ji, Bangning
He, Gang
Chen, Zhengguo
Zhao, Ling
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 284 - 295
[8] MULTIGRID METHOD FOR A MODIFIED CURVATURE DRIVEN DIFFUSION MODEL FOR IMAGE INPAINTING
Carlos Brito-Loeza
Journal of Computational Mathematics, 2008, 26 (06) : 856 - 875
[9] Adaptive prompt guided unified image restoration with latent diffusion model
Lv, Xiang
Shao, Mingwen
Wan, Yecong
Qiao, Yuanjian
Wang, Changzhong
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 146
[10] BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
Ju, Xuan
Liu, Xian
Wang, Xintao
Bian, Yuxuan
Shan, Ying
Xu, Qiang
COMPUTER VISION - ECCV 2024, PT XX, 2025, 15078 : 150 - 168

← 1 2 →