Adaptive prompt guided unified image restoration with latent diffusion model

被引:0
|
作者
Lv, Xiang [1 ]
Shao, Mingwen [1 ]
Wan, Yecong [1 ]
Qiao, Yuanjian [1 ]
Wang, Changzhong [2 ]
机构
[1] China Univ Petr East China, Qingdao Inst Software, Coll Comp Sci & Technol, State Key Lab Chem Safety, Qingdao 266580, Peoples R China
[2] Bohai Univ, Coll Math, Jinzhou 121013, Peoples R China
基金
中国国家自然科学基金;
关键词
Diffusion model; Image restoration; Degradation prior; Prompt learning; NETWORK; REMOVAL;
D O I
10.1016/j.engappai.2025.110267
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, Diffusion Models (DMs) have witnessed the remarkable success in image restoration tasks. However, DMs are not flexible and adaptive in dealing with uncertain multiple forms of image degradation (e.g., noise, blur and soon) due to the lack of degradation prior, resulting in undesirable boundary artifacts. In addition, DMs require a large number of inference iterations to restore clean image, which consumes massive computational resources. To address the forementioned limitations, we propose an adaptive unified two-stage restoration method based on latent diffusion model, termed APDiff that can effectively and adaptively handle real-world images with various degradation types. Specifically, in Stage I, we pre-train a Degradation Adaptive Prompt Learning Network (DAPLNet-S1) to obtain degradation prompt by exploring differences between low quality (LQ) and ground truth (GT) images adaptively. Then, we encode it into the latent space as key discriminant information for different degraded images. In Stage II, we propose a latent diffusion model to directly estimate a degradation prompt similar in pre-train DAPLNet-S1 only using LQ images. Meanwhile, to restore different degradation images effectively, we design a Prompt Guided Fourier Transformer Restorer to integrate the extracted prompt, which enhances characterization ability of model for global frequency feature and local spatial information. Since the generated prompts are low-dimensional latent vector representations, this can significantly reduce computational complexity of diffusion model. Thus, during the inference process, our method takes only 0.09 s to restore an image of SPA+. Extensive experiments demonstrate that APDiff achieves state-of-the-art performance for multi-degradation tasks.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Universal Image Restoration with Text Prompt Diffusion
    Yu, Bing
    Fan, Zhenghui
    Xiang, Xue
    Chen, Jiahui
    Huang, Dongjin
    SENSORS, 2024, 24 (12)
  • [2] AdaptBIR: Adaptive Blind Image Restoration with latent diffusion prior for higher fidelity
    Liu, Yingqi
    He, Jingwen
    Liu, Yihao
    Lin, Xinqi
    Yu, Fanghua
    Hu, Jinfan
    Qiao, Yu
    Dong, Chao
    PATTERN RECOGNITION, 2024, 155
  • [3] Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration
    Zhou, Shihao
    Pan, Jinshan
    Shi, Jinglei
    Chen, Duosheng
    Qu, Lishen
    Yang, Jufeng
    COMPUTER VISION - ECCV 2024, PT XVI, 2025, 15074 : 246 - 264
  • [4] Adaptive bidirectional diffusion for image restoration
    Fu ShuJun
    Zhang CaiMing
    SCIENCE CHINA-INFORMATION SCIENCES, 2010, 53 (12) : 2452 - 2460
  • [5] Adaptive bidirectional diffusion for image restoration
    FU ShuJun 1
    2 School of Mathematics
    ScienceChina(InformationSciences), 2010, 53 (12) : 2452 - 2460
  • [6] UniFRD: A Unified Method for Facial Image Restoration Based on Diffusion Probabilistic Model
    Jian, Muwei
    Wang, Rui
    Yu, Xiaoyang
    Xu, Feng
    Yu, Hui
    Lam, Kin-Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13494 - 13506
  • [7] Adaptive bidirectional diffusion for image restoration
    ShuJun Fu
    CaiMing Zhang
    Science China Information Sciences, 2010, 53 : 2452 - 2460
  • [8] GuidePaint: lossless image-guided diffusion model for ancient mural image restoration
    Jialv Hu
    Ying Yu
    Qixue Zhou
    npj Heritage Science, 13 (1):
  • [9] Generative Diffusion Prior for Unified Image Restoration and Enhancement
    Fei, Ben
    Lyu, Zhaoyang
    Pan, Liang
    Zhang, Junzhe
    Yang, Weidong
    Luo, Tianyue
    Zhang, Bo
    Dai, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9935 - 9946
  • [10] DLDiff: Image Detail-Guided Latent Diffusion Model for Low-Light Image Enhancement
    Xue, Minglong
    He, Yanyi
    He, Jinhong
    Zhong, Senming
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2255 - 2259