Adaptive prompt guided unified image restoration with latent diffusion model

被引：0

作者：

Lv, Xiang ^{[1
]}

Shao, Mingwen ^{[1
]}

Wan, Yecong ^{[1
]}

Qiao, Yuanjian ^{[1
]}

Wang, Changzhong ^{[2
]}

机构：

[1] China Univ Petr East China, Qingdao Inst Software, Coll Comp Sci & Technol, State Key Lab Chem Safety, Qingdao 266580, Peoples R China

[2] Bohai Univ, Coll Math, Jinzhou 121013, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 146卷

基金：

中国国家自然科学基金;

关键词：

Diffusion model; Image restoration; Degradation prior; Prompt learning; NETWORK; REMOVAL;

D O I：

10.1016/j.engappai.2025.110267

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, Diffusion Models (DMs) have witnessed the remarkable success in image restoration tasks. However, DMs are not flexible and adaptive in dealing with uncertain multiple forms of image degradation (e.g., noise, blur and soon) due to the lack of degradation prior, resulting in undesirable boundary artifacts. In addition, DMs require a large number of inference iterations to restore clean image, which consumes massive computational resources. To address the forementioned limitations, we propose an adaptive unified two-stage restoration method based on latent diffusion model, termed APDiff that can effectively and adaptively handle real-world images with various degradation types. Specifically, in Stage I, we pre-train a Degradation Adaptive Prompt Learning Network (DAPLNet-S1) to obtain degradation prompt by exploring differences between low quality (LQ) and ground truth (GT) images adaptively. Then, we encode it into the latent space as key discriminant information for different degraded images. In Stage II, we propose a latent diffusion model to directly estimate a degradation prompt similar in pre-train DAPLNet-S1 only using LQ images. Meanwhile, to restore different degradation images effectively, we design a Prompt Guided Fourier Transformer Restorer to integrate the extracted prompt, which enhances characterization ability of model for global frequency feature and local spatial information. Since the generated prompts are low-dimensional latent vector representations, this can significantly reduce computational complexity of diffusion model. Thus, during the inference process, our method takes only 0.09 s to restore an image of SPA+. Extensive experiments demonstrate that APDiff achieves state-of-the-art performance for multi-degradation tasks.

引用

页数：14

共 50 条

[1] Universal Image Restoration with Text Prompt Diffusion
Yu, Bing
Fan, Zhenghui
Xiang, Xue
Chen, Jiahui
Huang, Dongjin
SENSORS, 2024, 24 (12)
[2] AdaptBIR: Adaptive Blind Image Restoration with latent diffusion prior for higher fidelity
Liu, Yingqi
He, Jingwen
Liu, Yihao
Lin, Xinqi
Yu, Fanghua
Hu, Jinfan
Qiao, Yu
Dong, Chao
PATTERN RECOGNITION, 2024, 155
[3] Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration
Zhou, Shihao
Pan, Jinshan
Shi, Jinglei
Chen, Duosheng
Qu, Lishen
Yang, Jufeng
COMPUTER VISION - ECCV 2024, PT XVI, 2025, 15074 : 246 - 264
[4] Adaptive bidirectional diffusion for image restoration
Fu ShuJun
Zhang CaiMing
SCIENCE CHINA-INFORMATION SCIENCES, 2010, 53 (12) : 2452 - 2460
[5] Adaptive bidirectional diffusion for image restoration
FU ShuJun 1
2 School of Mathematics
ScienceChina(InformationSciences), 2010, 53 (12) : 2452 - 2460
[6] UniFRD: A Unified Method for Facial Image Restoration Based on Diffusion Probabilistic Model
Jian, Muwei
Wang, Rui
Yu, Xiaoyang
Xu, Feng
Yu, Hui
Lam, Kin-Man
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13494 - 13506
[7] Adaptive bidirectional diffusion for image restoration
ShuJun Fu
CaiMing Zhang
Science China Information Sciences, 2010, 53 : 2452 - 2460
[8] GuidePaint: lossless image-guided diffusion model for ancient mural image restoration
Jialv Hu
Ying Yu
Qixue Zhou
npj Heritage Science, 13 (1):
[9] Generative Diffusion Prior for Unified Image Restoration and Enhancement
Fei, Ben
Lyu, Zhaoyang
Pan, Liang
Zhang, Junzhe
Yang, Weidong
Luo, Tianyue
Zhang, Bo
Dai, Bo
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9935 - 9946
[10] DLDiff: Image Detail-Guided Latent Diffusion Model for Low-Light Image Enhancement
Xue, Minglong
He, Yanyi
He, Jinhong
Zhong, Senming
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2255 - 2259

← 1 2 3 4 5 →