RDSR:Reparameterized Lightweight Diffusion Model for Image Super-Resolution

被引：0

作者：

Sun, Ouyang ^{[1
]}

Long, Jun ^{[2
]}

Huang, Wenti ^{[3
]}

Yang, Zhan ^{[2
]}

Li, ChenHao ^{[1
]}

机构：

[1] Xinjiang Univ, Urumqi, Peoples R China

[2] Cent South Univ, Changsha, Peoples R China

[3] Hunan Univ Sci & Technol, Xiangtan, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII | 2025年 / 15038卷

关键词：

Neural networks; super-resolution; diffusion model;

D O I：

10.1007/978-981-97-8685-5_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The diffusion model has achieved impressive results on low-level tasks, recent studies attempt to design efficient diffusion models for Image Super-Resolution. However, they have mainly focused on reducing the number of parameters and FLops through various network designs. Although these methods can decrease the number of parameters and floating-point operations, they may not necessarily reduce actual running time. To enable DM inference faster on limited computational resources while retaining their quality and flexibility, we propose a Reparameterized Lightweight Diffusion Model SR network (RDSR), which consists of a Latent Prior Encoder (LPE), Reparameterized Decoder (RepD), and diffusion model conditioned on degraded images. Specifically, we first pretrain a LPE, it takes paired HR and LR patches as inputs, mapping input from pixel space to latent space. RepD has a VGG-like inference-time body composed of nothing but a stack of 3x3 convolution and ReLU, while the training-time model has a multi-branch. Our diffusion model serve as a bridge between LPE and RepD: LPE employs distillation loss to supervise reverse diffusion process, the output of reverse process diffusion as a modulator to guide RepD to reconstruct high-quality results. RDSR can effectively reduce GPU memory consumption and improve inference speed. Extensive experiments on SR benchmarks demonstrate the superiority of our RDSR over state-of-the-art DM methods, e.g., RDSR-2.2M achieve 30.11 dB PSNR on DIV2K100 dataset that surpass equal-order DM-based models, while trading-off the parameter, efficiency, and accuracy well: running 55.8x up arrow faster than DiffIR on Intel(R) Xeon(R) Platinum 8255C CPU.

引用

页码：94 / 107

页数：14

共 50 条

[21] Lightweight Image Super-Resolution by Multi-Scale Aggregation
Wan, Jin
Yin, Hui
Liu, Zhihao
Chong, Aixin
Liu, Yanting
IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (02) : 372 - 382
[22] An Accurate and Lightweight Method for Human Body Image Super-Resolution
Liu, Yunan
Zhang, Shanshan
Xu, Jie
Yang, Jian
Tai, Yu-Wing
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 2888 - 2897
[23] Lightweight adaptive enhanced attention network for image super-resolution
Li Wang
Lizhong Xu
Jianqiang Shi
Jie Shen
Fengcheng Huang
Multimedia Tools and Applications, 2022, 81 : 6513 - 6537
[24] Transformer-based image super-resolution and its lightweight
Zhang, Dongxiao
Qi, Tangyao
Gao, Juhao
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (26) : 68625 - 68649
[25] Lightweight Progressive Residual Clique Network for Image Super-Resolution
Huang, Baojin
He, Zheng
Wang, Zhongyuan
Jiang, Kui
Wang, Guangcheng
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 767 - 772
[26] A lightweight generative adversarial network for single image super-resolution
Lu, Xinbiao
Xie, Xupeng
Ye, Chunlin
Xing, Hao
Liu, Zecheng
Cai, Changchun
VISUAL COMPUTER, 2024, 40 (01) : 41 - 52
[27] A lightweight network with bidirectional constraints for single image super-resolution
Chen, Liangliang
Guo, Lin
Cheng, Deqiang
Kou, Qiqi
Gao, Rui
OPTIK, 2021, 239
[28] Diffusion Models, Image Super-Resolution, and Everything: A Survey
Moser, Brian B.
Shanbhag, Arundhati S.
Raue, Federico
Frolov, Stanislav
Palacio, Sebastian
Dengel, Andreas
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[29] Multi-Modal Prior-Guided Diffusion Model for Blind Image Super-Resolution
Huang, Detian
Song, Jiaxun
Huang, Xiaoqian
Hu, Zhenzhen
Zeng, Huanqiang
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 316 - 320
[30] Blind Image Super-Resolution via Domain Translation and Diffusion Models
Zhang, Ying
Li, Yinghua
Zhang, Zifei
Zhang, Chaojun
He, Jinglu
2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 238 - 244

← 1 2 3 4 5 →