A Prior Guided Wavelet-Spatial Dual Attention Transformer Framework for Heavy Rain Image Restoration

被引:7
作者
Zhang, Ronghui [1 ]
Yu, Jiongze [1 ]
Chen, Junzhou [1 ]
Li, Guofa [2 ]
Lin, Liang [3 ]
Wang, Danwei [4 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangdong Prov Key Lab Intelligent Transport Syst, Guangzhou 510275, Peoples R China
[2] Chongqing Univ, Coll Mech & Vehicle Engn, Chongqing 400044, Peoples R China
[3] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510275, Peoples R China
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Rain; Transformers; Wavelet transforms; Task analysis; Feature extraction; Image restoration; Visualization; Heavy rain; image deraining; transformer; wavelet attention; spatial attention; REMOVAL; MODEL;
D O I
10.1109/TMM.2024.3359480
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heavy rain significantly reduces image visibility, hindering tasks like autonomous driving and video surveillance. Many existing rain removal methods, while effective in light rain, falter under heavy rain due to their reliance on purely spatial features. Recognizing this challenge, we introduce the Wavelet-Spatial Dual Attention Transformer Framework (WSDformer). This innovative architecture adeptly captures both frequency and spatial characteristics, anchored by the wavelet-spatial dual attention (WSDA) mechanism. While the spatial attention zeroes in on intricate local details, the wavelet attention leverages wavelet decomposition to encompass diverse frequency information, augmenting the spatial representations. Furthermore, addressing the persistent issue of incomplete structural detail restoration, we integrate the PriorFormer Block (PFB). This unique module, underpinned by the Prior Fusion Attention (PFA), synergizes residual channel prior features with input features, thereby enhancing background structures and guiding precise rain feature extraction. To navigate the intrinsic constraints of U-shaped transformers, such as semantic discontinuities and subdued multi-scale interactions from skip connections, our Cross Interaction U-Shaped Transformer Network is introduced. This design empowers superior semantic layers to streamline the extraction of their lower-tier counterparts, optimizing network learning. Empirical analysis reveals our method's leading prowess across rainy image datasets and achieves state-of-the-art performance, with notable supremacy in heavy rainfall conditions. This superiority extends to diverse visual challenges and real-world rainy scenarios, affirming its broad applicability and robustness.
引用
收藏
页码:7043 / 7057
页数:15
相关论文
共 44 条
[1]   Restoring Snow-Degraded Single Images With Wavelet in Vision Transformer [J].
Agbodike, Obinna ;
Chen, Jenhui .
IEEE ACCESS, 2023, 11 :99470-99480
[2]   Beyond Deep Residual Learning for Image Restoration: Persistent Homology-Guided Manifold Simplification [J].
Bae, Woong ;
Yoo, Jaejun ;
Ye, Jong Chul .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1141-1149
[3]   Pre-Trained Image Processing Transformer [J].
Chen, Hanting ;
Wang, Yunhe ;
Guo, Tianyu ;
Xu, Chang ;
Deng, Yiping ;
Liu, Zhenhua ;
Ma, Siwei ;
Xu, Chunjing ;
Xu, Chao ;
Gao, Wen .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305
[4]   Learning A Sparse Transformer Network for Effective Image Deraining [J].
Chen, Xiang ;
Li, Hao ;
Li, Mingqiang ;
Pan, Jinshan .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :5896-5905
[5]   A Generalized Low-Rank Appearance Model for Spatio-Temporally Correlated Rain Streaks [J].
Chen, Yi-Lei ;
Hsu, Chiou-Ting .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1968-1975
[6]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[7]   QSAM-Net: Rain Streak Removal by Quaternion Neural Network With Self-Attention Module [J].
Frants, Vladimir ;
Agaian, Sos ;
Panetta, Karen .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :789-798
[8]   Removing rain from single images via a deep detail network [J].
Fu, Xueyang ;
Huang, Jiabin ;
Zeng, Delu ;
Huang, Yue ;
Ding, Xinghao ;
Paisley, John .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1715-1723
[9]   Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation [J].
Gu, Shuhang ;
Meng, Deyu ;
Zuo, Wangmeng ;
Zhang, Lei .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1717-1725
[10]   Deep Wavelet Prediction for Image Super-resolution [J].
Guo, Tiantong ;
Mousavi, Hojjat Seyed ;
Vu, Tiep Huu ;
Monga, Vishal .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1100-1109