UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration

被引:0
|
作者
Yu, Yongbo [1 ]
Li, Weidong [1 ]
Bai, Linyan [2 ,3 ]
Duan, Jinlong [1 ]
Zhang, Xuehai [1 ]
机构
[1] Henan Univ Technol, Coll Informat Sci & Engn, Zhengzhou 450001, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[3] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China
来源
关键词
Diffusion models; Attention mechanism; Weather-degraded image restoration; Image restoration; Vision transformer; RAINDROP REMOVAL; NETWORK;
D O I
10.1007/s00371-024-03659-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Restoring multi-weather-degraded images is significant for subsequent high-level computer vision tasks. However, most existing image restoration algorithms only target single-weather-degraded images, and there are few general models for multi-weather-degraded image restoration. In this paper, we propose a diffusion model for multi-weather-degraded image restoration, namely a universal transformer-based diffusion model (UTDM) for multi-weather-degraded images restoration, by combining the denoising diffusion probability model and Vision Transformer (ViT). First, UTDM uses weather-degraded images as conditions to guide the diffusion model to generate clean background images through reverse sampling. Secondly, we propose a Cascaded Fusion Noise Estimation Transformer (CFNET) based on ViT, which utilizes degraded and noisy images for noise estimation. By introducing cascaded contextual fusion attention in a cascaded manner to compute contextual fusion attention mechanisms for different heads, CFNET explores the commonalities and characteristics of multi-weather-degraded images, fully capturing global and local feature information to improve the model's generalization ability on various weather-degraded images. UTDM outperformed the existing algorithm by 0.14-4.55,dB on the Raindrop-A test set, and improved by 0.99 dB and 1.24 dB compared with Transweather on the Snow100K-L and Test1 test sets. Experimental results show that our method outperforms general and specific restoration task algorithms on synthetic and real-world degraded image datasets. Code and dataset are available at: https://github.com/RHEPI/UTDM.
引用
收藏
页码:4269 / 4285
页数:17
相关论文
共 50 条
  • [41] DiffSurf: A Transformer-Based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose
    Yoshiyasu, Yusuke
    Sun, Leyuan
    COMPUTER VISION-ECCV 2024, PT LXXXII, 2025, 15140 : 246 - 264
  • [42] Explainable Transformer-Based Deep Learning Model for the Detection of Malaria Parasites from Blood Cell Images
    Islam, Md. Robiul
    Nahiduzzaman, Md.
    Goni, Md. Omaer Faruq
    Sayeed, Abu
    Anower, Md. Shamim
    Ahsan, Mominul
    Haider, Julfikar
    SENSORS, 2022, 22 (12)
  • [43] TransSurv: Transformer-Based Survival Analysis Model Integrating Histopathological Images and Genomic Data for Colorectal Cancer
    Lv, Zhilong
    Lin, Yuexiao
    Yan, Rui
    Wang, Ying
    Zhang, Fa
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (06) : 3411 - 3420
  • [44] OSASformer: A transformer-based model for OSAS screening via multi-source representation fusion
    Hou, Yuanyuan
    Wang, Bin
    Zhang, Chengxi
    Wang, Qiang
    Li, Jiang
    Meng, Pingping
    Zhang, Yongxiang
    Han, Chao
    Hong, Feng
    Zhang, Tong
    KNOWLEDGE-BASED SYSTEMS, 2025, 316
  • [45] VPCFormer: A transformer-based multi-view finger vein recognition model and a new benchmark
    Zhao, Pengyang
    Song, Yizhuo
    Wang, Siqi
    Xue, Jing-Hao
    Zhao, Shuping
    Liao, Qingmin
    Yang, Wenming
    PATTERN RECOGNITION, 2024, 148
  • [46] Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
    Tang, Yi
    Kawasaki, Hiroshi
    Iwaguchi, Takafumi
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5419 - 5427
  • [47] 2D medical image synthesis using transformer-based denoising diffusion probabilistic model
    Pan, Shaoyan
    Wang, Tonghe
    Qiu, Richard L. J.
    Axente, Marian
    Chang, Chih-Wei
    Peng, Junbo
    Patel, Ashish B.
    Shelton, Joseph
    Patel, Sagar A.
    Roper, Justin
    Yang, Xiaofeng
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (10):
  • [48] TransFusion: A Practical and Effective Transformer-Based Diffusion Model for 3D Human Motion Prediction
    Tian, Sibo
    Zheng, Minghui
    Liang, Xiao
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6232 - 6239
  • [49] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
    Wang, Ke
    Zhang, Wei
    Liu, Yong
    Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974
  • [50] A transformer-based multi-features fusion model for prediction of conversion in mild cognitive impairment
    Zheng, Guowei
    Zhang, Yu
    Zhao, Ziyang
    Wang, Yin
    Liu, Xia
    Shang, Yingying
    Cong, Zhaoyang
    Dimitriadis, Stavros I.
    Yao, Zhijun
    Hu, Bin
    METHODS, 2022, 204 : 241 - 248