UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration

被引:0
|
作者
Yu, Yongbo [1 ]
Li, Weidong [1 ]
Bai, Linyan [2 ,3 ]
Duan, Jinlong [1 ]
Zhang, Xuehai [1 ]
机构
[1] Henan Univ Technol, Coll Informat Sci & Engn, Zhengzhou 450001, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[3] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China
来源
关键词
Diffusion models; Attention mechanism; Weather-degraded image restoration; Image restoration; Vision transformer; RAINDROP REMOVAL; NETWORK;
D O I
10.1007/s00371-024-03659-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Restoring multi-weather-degraded images is significant for subsequent high-level computer vision tasks. However, most existing image restoration algorithms only target single-weather-degraded images, and there are few general models for multi-weather-degraded image restoration. In this paper, we propose a diffusion model for multi-weather-degraded image restoration, namely a universal transformer-based diffusion model (UTDM) for multi-weather-degraded images restoration, by combining the denoising diffusion probability model and Vision Transformer (ViT). First, UTDM uses weather-degraded images as conditions to guide the diffusion model to generate clean background images through reverse sampling. Secondly, we propose a Cascaded Fusion Noise Estimation Transformer (CFNET) based on ViT, which utilizes degraded and noisy images for noise estimation. By introducing cascaded contextual fusion attention in a cascaded manner to compute contextual fusion attention mechanisms for different heads, CFNET explores the commonalities and characteristics of multi-weather-degraded images, fully capturing global and local feature information to improve the model's generalization ability on various weather-degraded images. UTDM outperformed the existing algorithm by 0.14-4.55,dB on the Raindrop-A test set, and improved by 0.99 dB and 1.24 dB compared with Transweather on the Snow100K-L and Test1 test sets. Experimental results show that our method outperforms general and specific restoration task algorithms on synthetic and real-world degraded image datasets. Code and dataset are available at: https://github.com/RHEPI/UTDM.
引用
收藏
页码:4269 / 4285
页数:17
相关论文
共 50 条
  • [21] Transformer-based multi-source images instance segmentation network for composite materials
    Ke Y.
    Fu Y.
    Zhou W.
    Zhu W.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2023, 52 (02):
  • [22] Restoration of atmospheric turbulence-degraded images based on multi-frame information
    Yang, Dequan
    Chen, Ziyang
    Zhang, Yuanyuan
    Wu, Xiaoyan
    Sasaki, Osami
    Pu, Jixiong
    Optics Express, 2025, 33 (01) : 369 - 383
  • [23] Histology Image Artifact Restoration with Lightweight Transformer Based Diffusion Model
    Wang, Chong
    He, Zhenqi
    He, Junjun
    Ye, Jin
    Shen, Yiqing
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PT II, AIME 2024, 2024, 14845 : 81 - 89
  • [24] SRT: Improved transformer-based model for classification of 2D heartbeat images
    Wu, Wenwen
    Huang, Yanqi
    Wu, Xiaomei
    Biomedical Signal Processing and Control, 2024, 88
  • [25] TransMF: Transformer-Based Multi-Scale Fusion Model for Crack Detection
    Ju, Xiaochen
    Zhao, Xinxin
    Qian, Shengsheng
    MATHEMATICS, 2022, 10 (13)
  • [26] Multi-Modal Pedestrian Crossing Intention Prediction with Transformer-Based Model
    Wang, Ting-Wei
    Lai, Shang-Hong
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (05)
  • [27] SRT: Improved transformer-based model for classification of 2D heartbeat images
    Wu, Wenwen
    Huang, Yanqi
    Wu, Xiaomei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [28] Pedestrian Crossing Intention Prediction with Multi-Modal Transformer-Based Model
    Wang, Ting Wei
    Lai, Shang-Hong
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1349 - 1356
  • [29] Transformer-Based Intelligent Prediction Model for Multimodal Multi-Objective Optimization
    Dang, Qianlong
    Zhang, Guanghui
    Wang, Ling
    Yu, Yang
    Yang, Shuai
    He, Xiaoyu
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2025, 20 (01) : 34 - 49
  • [30] MelodyDiffusion: Chord-Conditioned Melody Generation Using a Transformer-Based Diffusion Model
    Li, Shuyu
    Sung, Yunsick
    MATHEMATICS, 2023, 11 (08)