AdaptBIR: Adaptive Blind Image Restoration with latent diffusion prior for higher fidelity

被引：2

作者：

Liu, Yingqi ^{[1
,2
]}

He, Jingwen ^{[3
]}

Liu, Yihao ^{[4
]}

Lin, Xinqi ^{[1
,2
]}

Yu, Fanghua ^{[1
]}

Hu, Jinfan ^{[1
,2
]}

Qiao, Yu ^{[1
,4
]}

Dong, Chao ^{[1
,4
]}

机构：

[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China

[2] Univ Chinese Acad Sci, Beijing, Peoples R China

[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China

[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 155卷

基金：

中国国家自然科学基金;

关键词：

Image restoration; Diffusion model; Adaptive adjustment; SUPERRESOLUTION;

D O I：

10.1016/j.patcog.2024.110659

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work aims to help diffusion models get their footing in the low-level vision field, solving the pain point of insufficient fidelity. Specifically, we propose an Adaptive Blind Image Restoration framework with latent diffusion prior - AdaptBIR, which can adaptively distinguish and address various ranges of degradations. First, we quantitatively categorize images through an Image Quality Assessment (IQA) method. Then, a dual- encoder degradation removal module is employed with the guidance of IQA scores to reach better information preservation. Lastly, we utilize a two-phase controller to handle the reconstruction process in an organized manner. Extensive experiments show that applying such an adaptive framework achieves better performance on both fidelity and perceptual metrics. In this way, AdaptBIR represents more than just a novel framework, it paves the way for a broader application of the diffusion model in blind image restoration tasks.

引用

页数：9

共 40 条

[1] GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution [J].

Chan, Kelvin C. K. ;

Wang, Xintao ;

Xu, Xiangyu ;

Gu, Jinwei ;

Loy, Chen Change .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14240-14249

[2]

Chen JY, 2023, Arxiv, DOI arXiv:2305.10855

[3]

Dhariwal P, 2021, ADV NEUR IN, V34

[4] Image Super-Resolution Using Deep Convolutional Networks [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307

[5] DGD-cGAN: A dual generator for image dewatering and restoration [J].

Gonzalez-Sabbagh, Salma ;

Robles-Kelly, Antonio ;

Gao, Shang .

PATTERN RECOGNITION, 2024, 148

[6]

Ho Jonathan., 2020, ADV NEURAL INFORM PR, V33, P6840

[7] Blind Image Quality Index With Cross-Domain Interaction and Cross-Scale Integration [J].

Hu, Bo ;

Zhu, Guang ;

Li, Leida ;

Gan, Ji ;

Li, Weisheng ;

Gao, Xinbo .

IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :2729-2739

[8]

Gulrajani I, 2017, ADV NEUR IN, V30

[9] MUSIQ: Multi-scale Image Quality Transformer [J].

Ke, Junjie ;

Wang, Qifei ;

Wang, Yilin ;

Milanfar, Peyman ;

Yang, Feng .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5128-5137

[10]

Li AB, 2024, Arxiv, DOI arXiv:2405.04167

← 1 2 3 4 →