RainFormer: a pyramid transformer for single image deraining

被引:4
|
作者
Yang, Hao [1 ]
Zhou, Dongming [1 ]
Cao, Jinde [2 ,3 ]
Zhao, Qian [1 ]
Li, Miao [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650504, Yunnan, Peoples R China
[2] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China
[3] Yonsei Univ, Yonsei Frontier Lab, Seoul 03722, South Korea
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 06期
基金
中国国家自然科学基金;
关键词
Single image deraining; Vision transformer; Image restoration; Neural networks; Deep learning; RAIN;
D O I
10.1007/s11227-022-04895-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Rain impairs the performance of outdoor vision systems, such as automated driving systems and outdoor surveillance systems. Therefore, as an image preprocessing technique, image deraining has great potential for application. Defects of convolutional neural networks (small receptive field and non-adaptive to input content) limit the further improvement of deraining model performance. Recently, a novel neural network, transformer, has demonstrated impressive performance on natural language processing and vision tasks. However, using transformer for image deraining still has some issues: Although transformers have powerful long-range computing capabilities, it lacks the ability to model local features, which is critical for image deraining. In addition, transformer uses fixed-size patches to process images, which leads to pixels at the edges of the patches that cannot use the local features of neighboring pixels to restore rain-free images. In this paper, we propose a novel pyramid transformer for image deraining. To address the first issue, we design a residual-Dconv feed-forward network (RDFN), where depth-wise convolution improves the capability of modeling local features. To address the second issue, we introduce multi-resolution features into the transformer, which allows the transformer to obtain patches with different scales, thus enabling the boundary pixels to utilize local features. Furthermore, we propose a novel multi-scale fusion bridge (MSFB) to effectively integrate the extracted multi-scale features and capture the correlation between different scales. Extensive experiments on synthetic and real-world images demonstrate that the proposed deraining model achieves superior performance, especially the PSNR value achieves 47.55 dB on the SPA-Data dataset. We also further validate the effectiveness of the proposed model on subsequent high-level computer vision tasks.
引用
收藏
页码:6115 / 6140
页数:26
相关论文
共 50 条
  • [41] Single Image Deraining: A Comprehensive Benchmark Analysis
    Li, Siyuan
    Araujo, Iago Breno
    Ren, Wenqi
    Wang, Zhangyang
    Tokuda, Eric K.
    Hirata Junior, Roberto
    Cesar-Junior, Roberto
    Zhang, Jiawan
    Guo, Xiaojie
    Cao, Xiaochun
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3833 - 3842
  • [42] A FAST AND EFFICIENT NETWORK FOR SINGLE IMAGE DERAINING
    Yang, Youzhao
    Lu, Hong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2030 - 2034
  • [43] SPT: Spatial Pyramid Transformer for Image Captioning
    Zhang, Haonan
    Zeng, Pengpeng
    Gao, Lianli
    Lyu, Xinyu
    Song, Jingkuan
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4829 - 4842
  • [44] BILATERAL RECURRENT NETWORK FOR SINGLE IMAGE DERAINING
    Shang, Wei
    Zhu, Pengfei
    Ren, Dongwei
    Shi, Hong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2503 - 2507
  • [45] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
    Bin Liu
    Siyan Fang
    Neural Computing and Applications, 2023, 35 : 22387 - 22404
  • [46] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
    Liu, Bin
    Fang, Siyan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22387 - 22404
  • [47] Single Image Deraining With Continuous Rain Density Estimation
    Yu, Lei
    Wang, Bishan
    He, Jingwei
    Xia, Gui-Song
    Yang, Wen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 443 - 456
  • [48] Dual Heterogeneous Complementary Networks for Single Image Deraining
    Nanba, Yuuto
    Miyata, Hikaru
    Han, Xian-Hua
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 567 - 576
  • [49] An unsupervised generative adversarial network for single image deraining
    Song, Zhiying
    Guo, Yuting
    Ma, Zifan
    Tang, Ruocong
    Liu, Linfeng
    IET IMAGE PROCESSING, 2021, 15 (13) : 3105 - 3117
  • [50] Bijective Multi-mode Deraining on Single Image
    Qiu, Song
    Xu, Wei
    Sun, Li
    Zhai, Gaojie
    Chen, Xinyuan
    Li, Qingli
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,