Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model

被引:18
|
作者
Yang, Shiyuan [1 ]
Chen, Xiaodong [2 ]
Liao, Jing [1 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] Tianjin Univ, Tianjin, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
关键词
Image Inpainting; Diffusion Model; Multimodal;
D O I
10.1145/3581783.3612200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, text-to-image denoising diffusion probabilistic models (DDPMs) have demonstrated impressive image generation capabilities and have also been successfully applied to image inpainting. However, in practice, users often require more control over the inpainting process beyond textual guidance, especially when they want to composite objects with customized appearance, color, shape, and layout. Unfortunately, existing diffusion-based inpainting methods are limited to single-modal guidance and require task-specific training, hindering their cross-modal scalability. To address these limitations, we propose Uni-paint, a unified framework for multi-modal inpainting that offers various modes of guidance, including unconditional, text-driven, stroke-driven, exemplar-driven inpainting, as well as a combination of these modes. Furthermore, our Uni-paint is based on pretrained Stable Diffusion and does not require task-specific training on specific datasets, enabling few-shot generalizability to customized images. We have conducted extensive qualitative and quantitative evaluations that show our approach achieves comparable results to existing single-modal methods while offering multimodal inpainting capabilities not available in other methods. Code is available at https://github.com/ysy31415/unipaint.
引用
收藏
页码:3190 / 3199
页数:10
相关论文
共 16 条
  • [1] Image Inpainting Based on Improved Tensor Diffusion Model
    Cui Xuehong
    Pan Zhenkuan
    Wei Weibo
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 833 - 837
  • [2] Nonlocal Curvature-Driven Diffusion Model for Image Inpainting
    Li, Li
    Yu, Han
    FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 2, PROCEEDINGS, 2009, : 513 - 516
  • [3] Research of Diffusion Coefficient in The Total Variation Image Inpainting Model
    He Jing
    Zhao Feng-qun
    Zhou Qian
    Zhang Pei-ru
    PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE OF MODELLING AND SIMULATION, VOL III: MODELLING AND SIMULATION IN ELECTRONICS, COMPUTING, AND BIO-MEDICINE, 2008, : 382 - 386
  • [4] A New Oriented-Diffusion Image Inpainting Framework for Striped Texture Images
    Zhu Yong
    Wang Gui
    Han Zhike
    2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 79 - 84
  • [5] MULTIGRID METHOD FOR A MODIFIED CURVATURE DRIVEN DIFFUSION MODEL FOR IMAGE INPAINTING
    Brito-Loeza, Carlos
    Chen, Ke
    JOURNAL OF COMPUTATIONAL MATHEMATICS, 2008, 26 (06) : 856 - 875
  • [6] ORTHOGONAL-DIRECTIONAL FORWARD DIFFUSION IMAGE INPAINTING AND DENOISING MODEL
    Wu Jiying Ruan Qiuqi An Gaoyun(Institute of Information Science
    Journal of Electronics(China), 2008, (05) : 622 - 628
  • [7] A Novel Diffusion-Model-Based OCT Image Inpainting Algorithm for Wide Saturation Artifacts
    Ji, Bangning
    He, Gang
    Chen, Zhengguo
    Zhao, Ling
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 284 - 295
  • [9] Adaptive prompt guided unified image restoration with latent diffusion model
    Lv, Xiang
    Shao, Mingwen
    Wan, Yecong
    Qiao, Yuanjian
    Wang, Changzhong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 146
  • [10] BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
    Ju, Xuan
    Liu, Xian
    Wang, Xintao
    Bian, Yuxuan
    Shan, Ying
    Xu, Qiang
    COMPUTER VISION - ECCV 2024, PT XX, 2025, 15078 : 150 - 168