Generative Diffusion Prior for Unified Image Restoration and Enhancement

被引:83
作者
Fei, Ben [1 ,2 ]
Lyu, Zhaoyang [2 ]
Pan, Liang [3 ]
Zhang, Junzhe [3 ]
Yang, Weidong [1 ]
Luo, Tianyue [1 ]
Zhang, Bo [2 ]
Dai, Bo [2 ]
机构
[1] Fudan Univ, Shanghai, Peoples R China
[2] Shanghai AI Lab, Shanghai, Peoples R China
[3] Nanyang Technol Univ, S Lab, Singapore, Singapore
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
EXPOSURE-FUSION; NETWORK; RECONSTRUCTION; FIDELITY;
D O I
10.1109/CVPR52729.2023.00958
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing image restoration methods mostly leverage the posterior distribution of natural images. However, they often assume known degradation and also require supervised training, which restricts their adaptation to complex real applications. In this work, we propose the Generative Diffusion Prior (GDP) to effectively model the posterior distributions in an unsupervised sampling manner. GDP utilizes a pre-train denoising diffusion generative model (DDPM) for solving linear inverse, non-linear, or blind problems. Specifically, GDP systematically explores a protocol of conditional guidance, which is verified more practical than the commonly used guidance way. Furthermore, GDP is strength at optimizing the parameters of degradation model during the denoising process, achieving blind image restoration. Besides, we devise hierarchical guidance and patch-based methods, enabling the GDP to generate images of arbitrary resolutions. Experimentally, we demonstrate GDP 's versatility on several image datasets for linear problems, such as super-resolution, deblurring, inpainting, and colorization, as well as non-linear and blind issues, such as low-light enhancement and HDR image recovery. GDP outperforms the current leading unsupervised methods on the diverse benchmarks in reconstruction quality and perceptual quality. Moreover, GDP also generalizes well for natural images or synthesized images with arbitrary sizes from various tasks out of the distribution of the ImageNet training set. The project page is available at https://generativediffusionprior.github.io/
引用
收藏
页码:9935 / 9946
页数:12
相关论文
共 100 条
[41]  
Li Chongyi, 2021, ARXIV210300860
[42]   SwinIR: Image Restoration Using Swin Transformer [J].
Liang, Jingyun ;
Cao, Jiezhang ;
Sun, Guolei ;
Zhang, Kai ;
Van Gool, Luc ;
Timofte, Radu .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :1833-1844
[43]   Flow-based Kernel Prior with Application to Blind Super-Resolution [J].
Liang, Jingyun ;
Zhang, Kai ;
Gu, Shuhang ;
Gool, Luc Van ;
Timofte, Radu .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10596-10605
[44]   DSLR: Deep Stacked Laplacian Restorer for Low-Light Image Enhancement [J].
Lim, Seokjae ;
Kim, Wonjun .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :4272-4284
[45]  
Liu Xihui, 2021, ARXIV211205744
[46]   LLNet: A deep autoencoder approach to natural low-light image enhancement [J].
Lore, Kin Gwn ;
Akintayo, Adedotun ;
Sarkar, Soumik .
PATTERN RECOGNITION, 2017, 61 :650-662
[47]   TBEFN: A Two-Branch Exposure-Fusion Network for Low-Light Image Enhancement [J].
Lu, Kun ;
Zhang, Lihong .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :4093-4105
[48]  
Lv F, 2018, BMVC, V220, P4
[49]  
Meng Chenlin, 2021, ARXIV210801073
[50]   PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models [J].
Menon, Sachit ;
Damian, Alexandru ;
Hu, Shijia ;
Ravi, Nikhil ;
Rudin, Cynthia .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2434-2442