RainFormer: a pyramid transformer for single image deraining

被引：4

作者：

Yang, Hao ^{[1
]}

Zhou, Dongming ^{[1
]}

Cao, Jinde ^{[2
,3
]}

Zhao, Qian ^{[1
]}

Li, Miao ^{[1
]}

机构：

[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650504, Yunnan, Peoples R China

[2] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China

[3] Yonsei Univ, Yonsei Frontier Lab, Seoul 03722, South Korea

来源：

JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Single image deraining; Vision transformer; Image restoration; Neural networks; Deep learning; RAIN;

D O I：

10.1007/s11227-022-04895-5

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Rain impairs the performance of outdoor vision systems, such as automated driving systems and outdoor surveillance systems. Therefore, as an image preprocessing technique, image deraining has great potential for application. Defects of convolutional neural networks (small receptive field and non-adaptive to input content) limit the further improvement of deraining model performance. Recently, a novel neural network, transformer, has demonstrated impressive performance on natural language processing and vision tasks. However, using transformer for image deraining still has some issues: Although transformers have powerful long-range computing capabilities, it lacks the ability to model local features, which is critical for image deraining. In addition, transformer uses fixed-size patches to process images, which leads to pixels at the edges of the patches that cannot use the local features of neighboring pixels to restore rain-free images. In this paper, we propose a novel pyramid transformer for image deraining. To address the first issue, we design a residual-Dconv feed-forward network (RDFN), where depth-wise convolution improves the capability of modeling local features. To address the second issue, we introduce multi-resolution features into the transformer, which allows the transformer to obtain patches with different scales, thus enabling the boundary pixels to utilize local features. Furthermore, we propose a novel multi-scale fusion bridge (MSFB) to effectively integrate the extracted multi-scale features and capture the correlation between different scales. Extensive experiments on synthetic and real-world images demonstrate that the proposed deraining model achieves superior performance, especially the PSNR value achieves 47.55 dB on the SPA-Data dataset. We also further validate the effectiveness of the proposed model on subsequent high-level computer vision tasks.

引用

页码：6115 / 6140

页数：26

共 50 条

[41] Single Image Deraining: A Comprehensive Benchmark Analysis
Li, Siyuan
Araujo, Iago Breno
Ren, Wenqi
Wang, Zhangyang
Tokuda, Eric K.
Hirata Junior, Roberto
Cesar-Junior, Roberto
Zhang, Jiawan
Guo, Xiaojie
Cao, Xiaochun
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3833 - 3842
[42] A FAST AND EFFICIENT NETWORK FOR SINGLE IMAGE DERAINING
Yang, Youzhao
Lu, Hong
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2030 - 2034
[43] SPT: Spatial Pyramid Transformer for Image Captioning
Zhang, Haonan
Zeng, Pengpeng
Gao, Lianli
Lyu, Xinyu
Song, Jingkuan
Shen, Heng Tao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4829 - 4842
[44] BILATERAL RECURRENT NETWORK FOR SINGLE IMAGE DERAINING
Shang, Wei
Zhu, Pengfei
Ren, Dongwei
Shi, Hong
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2503 - 2507
[45] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
Bin Liu
Siyan Fang
Neural Computing and Applications, 2023, 35 : 22387 - 22404
[46] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
Liu, Bin
Fang, Siyan
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22387 - 22404
[47] Single Image Deraining With Continuous Rain Density Estimation
Yu, Lei
Wang, Bishan
He, Jingwei
Xia, Gui-Song
Yang, Wen
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 443 - 456
[48] Dual Heterogeneous Complementary Networks for Single Image Deraining
Nanba, Yuuto
Miyata, Hikaru
Han, Xian-Hua
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 567 - 576
[49] An unsupervised generative adversarial network for single image deraining
Song, Zhiying
Guo, Yuting
Ma, Zifan
Tang, Ruocong
Liu, Linfeng
IET IMAGE PROCESSING, 2021, 15 (13) : 3105 - 3117
[50] Bijective Multi-mode Deraining on Single Image
Qiu, Song
Xu, Wei
Sun, Li
Zhai, Gaojie
Chen, Xinyuan
Li, Qingli
2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,

← 1 2 3 4 5 →