UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration

被引：0

作者：

Yu, Yongbo ^{[1
]}

Li, Weidong ^{[1
]}

Bai, Linyan ^{[2
,3
]}

Duan, Jinlong ^{[1
]}

Zhang, Xuehai ^{[1
]}

机构：

[1] Henan Univ Technol, Coll Informat Sci & Engn, Zhengzhou 450001, Peoples R China

[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China

[3] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China

来源：

VISUAL COMPUTER | 2024年

关键词：

Diffusion models; Attention mechanism; Weather-degraded image restoration; Image restoration; Vision transformer; RAINDROP REMOVAL; NETWORK;

D O I：

10.1007/s00371-024-03659-x

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Restoring multi-weather-degraded images is significant for subsequent high-level computer vision tasks. However, most existing image restoration algorithms only target single-weather-degraded images, and there are few general models for multi-weather-degraded image restoration. In this paper, we propose a diffusion model for multi-weather-degraded image restoration, namely a universal transformer-based diffusion model (UTDM) for multi-weather-degraded images restoration, by combining the denoising diffusion probability model and Vision Transformer (ViT). First, UTDM uses weather-degraded images as conditions to guide the diffusion model to generate clean background images through reverse sampling. Secondly, we propose a Cascaded Fusion Noise Estimation Transformer (CFNET) based on ViT, which utilizes degraded and noisy images for noise estimation. By introducing cascaded contextual fusion attention in a cascaded manner to compute contextual fusion attention mechanisms for different heads, CFNET explores the commonalities and characteristics of multi-weather-degraded images, fully capturing global and local feature information to improve the model's generalization ability on various weather-degraded images. UTDM outperformed the existing algorithm by 0.14-4.55,dB on the Raindrop-A test set, and improved by 0.99 dB and 1.24 dB compared with Transweather on the Snow100K-L and Test1 test sets. Experimental results show that our method outperforms general and specific restoration task algorithms on synthetic and real-world degraded image datasets. Code and dataset are available at: https://github.com/RHEPI/UTDM.

引用

页码：4269 / 4285

页数：17

共 50 条

[41] DiffSurf: A Transformer-Based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose
Yoshiyasu, Yusuke
Sun, Leyuan
COMPUTER VISION-ECCV 2024, PT LXXXII, 2025, 15140 : 246 - 264
[42] Explainable Transformer-Based Deep Learning Model for the Detection of Malaria Parasites from Blood Cell Images
Islam, Md. Robiul
Nahiduzzaman, Md.
Goni, Md. Omaer Faruq
Sayeed, Abu
Anower, Md. Shamim
Ahsan, Mominul
Haider, Julfikar
SENSORS, 2022, 22 (12)
[43] TransSurv: Transformer-Based Survival Analysis Model Integrating Histopathological Images and Genomic Data for Colorectal Cancer
Lv, Zhilong
Lin, Yuexiao
Yan, Rui
Wang, Ying
Zhang, Fa
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (06) : 3411 - 3420
[44] OSASformer: A transformer-based model for OSAS screening via multi-source representation fusion
Hou, Yuanyuan
Wang, Bin
Zhang, Chengxi
Wang, Qiang
Li, Jiang
Meng, Pingping
Zhang, Yongxiang
Han, Chao
Hong, Feng
Zhang, Tong
KNOWLEDGE-BASED SYSTEMS, 2025, 316
[45] VPCFormer: A transformer-based multi-view finger vein recognition model and a new benchmark
Zhao, Pengyang
Song, Yizhuo
Wang, Siqi
Xue, Jing-Hao
Zhao, Shuping
Liao, Qingmin
Yang, Wenming
PATTERN RECOGNITION, 2024, 148
[46] Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
Tang, Yi
Kawasaki, Hiroshi
Iwaguchi, Takafumi
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5419 - 5427
[47] 2D medical image synthesis using transformer-based denoising diffusion probabilistic model
Pan, Shaoyan
Wang, Tonghe
Qiu, Richard L. J.
Axente, Marian
Chang, Chih-Wei
Peng, Junbo
Patel, Ashish B.
Shelton, Joseph
Patel, Sagar A.
Roper, Justin
Yang, Xiaofeng
PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (10):
[48] TransFusion: A Practical and Effective Transformer-Based Diffusion Model for 3D Human Motion Prediction
Tian, Sibo
Zheng, Minghui
Liang, Xiao
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6232 - 6239
[49] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
Wang, Ke
Zhang, Wei
Liu, Yong
Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974
[50] A transformer-based multi-features fusion model for prediction of conversion in mild cognitive impairment
Zheng, Guowei
Zhang, Yu
Zhao, Ziyang
Wang, Yin
Liu, Xia
Shang, Yingying
Cong, Zhaoyang
Dimitriadis, Stavros I.
Yao, Zhijun
Hu, Bin
METHODS, 2022, 204 : 241 - 248

← 1 2 3 4 5 →