RDSR:Reparameterized Lightweight Diffusion Model for Image Super-Resolution

被引:0
|
作者
Sun, Ouyang [1 ]
Long, Jun [2 ]
Huang, Wenti [3 ]
Yang, Zhan [2 ]
Li, ChenHao [1 ]
机构
[1] Xinjiang Univ, Urumqi, Peoples R China
[2] Cent South Univ, Changsha, Peoples R China
[3] Hunan Univ Sci & Technol, Xiangtan, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII | 2025年 / 15038卷
关键词
Neural networks; super-resolution; diffusion model;
D O I
10.1007/978-981-97-8685-5_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The diffusion model has achieved impressive results on low-level tasks, recent studies attempt to design efficient diffusion models for Image Super-Resolution. However, they have mainly focused on reducing the number of parameters and FLops through various network designs. Although these methods can decrease the number of parameters and floating-point operations, they may not necessarily reduce actual running time. To enable DM inference faster on limited computational resources while retaining their quality and flexibility, we propose a Reparameterized Lightweight Diffusion Model SR network (RDSR), which consists of a Latent Prior Encoder (LPE), Reparameterized Decoder (RepD), and diffusion model conditioned on degraded images. Specifically, we first pretrain a LPE, it takes paired HR and LR patches as inputs, mapping input from pixel space to latent space. RepD has a VGG-like inference-time body composed of nothing but a stack of 3x3 convolution and ReLU, while the training-time model has a multi-branch. Our diffusion model serve as a bridge between LPE and RepD: LPE employs distillation loss to supervise reverse diffusion process, the output of reverse process diffusion as a modulator to guide RepD to reconstruct high-quality results. RDSR can effectively reduce GPU memory consumption and improve inference speed. Extensive experiments on SR benchmarks demonstrate the superiority of our RDSR over state-of-the-art DM methods, e.g., RDSR-2.2M achieve 30.11 dB PSNR on DIV2K100 dataset that surpass equal-order DM-based models, while trading-off the parameter, efficiency, and accuracy well: running 55.8x up arrow faster than DiffIR on Intel(R) Xeon(R) Platinum 8255C CPU.
引用
收藏
页码:94 / 107
页数:14
相关论文
共 50 条
  • [21] Lightweight Image Super-Resolution by Multi-Scale Aggregation
    Wan, Jin
    Yin, Hui
    Liu, Zhihao
    Chong, Aixin
    Liu, Yanting
    IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (02) : 372 - 382
  • [22] An Accurate and Lightweight Method for Human Body Image Super-Resolution
    Liu, Yunan
    Zhang, Shanshan
    Xu, Jie
    Yang, Jian
    Tai, Yu-Wing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 2888 - 2897
  • [23] Lightweight adaptive enhanced attention network for image super-resolution
    Li Wang
    Lizhong Xu
    Jianqiang Shi
    Jie Shen
    Fengcheng Huang
    Multimedia Tools and Applications, 2022, 81 : 6513 - 6537
  • [24] Transformer-based image super-resolution and its lightweight
    Zhang, Dongxiao
    Qi, Tangyao
    Gao, Juhao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (26) : 68625 - 68649
  • [25] Lightweight Progressive Residual Clique Network for Image Super-Resolution
    Huang, Baojin
    He, Zheng
    Wang, Zhongyuan
    Jiang, Kui
    Wang, Guangcheng
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 767 - 772
  • [26] A lightweight generative adversarial network for single image super-resolution
    Lu, Xinbiao
    Xie, Xupeng
    Ye, Chunlin
    Xing, Hao
    Liu, Zecheng
    Cai, Changchun
    VISUAL COMPUTER, 2024, 40 (01) : 41 - 52
  • [27] A lightweight network with bidirectional constraints for single image super-resolution
    Chen, Liangliang
    Guo, Lin
    Cheng, Deqiang
    Kou, Qiqi
    Gao, Rui
    OPTIK, 2021, 239
  • [28] Diffusion Models, Image Super-Resolution, and Everything: A Survey
    Moser, Brian B.
    Shanbhag, Arundhati S.
    Raue, Federico
    Frolov, Stanislav
    Palacio, Sebastian
    Dengel, Andreas
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [29] Multi-Modal Prior-Guided Diffusion Model for Blind Image Super-Resolution
    Huang, Detian
    Song, Jiaxun
    Huang, Xiaoqian
    Hu, Zhenzhen
    Zeng, Huanqiang
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 316 - 320
  • [30] Blind Image Super-Resolution via Domain Translation and Diffusion Models
    Zhang, Ying
    Li, Yinghua
    Zhang, Zifei
    Zhang, Chaojun
    He, Jinglu
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 238 - 244