RainFormer: a pyramid transformer for single image deraining

被引:4
|
作者
Yang, Hao [1 ]
Zhou, Dongming [1 ]
Cao, Jinde [2 ,3 ]
Zhao, Qian [1 ]
Li, Miao [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650504, Yunnan, Peoples R China
[2] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China
[3] Yonsei Univ, Yonsei Frontier Lab, Seoul 03722, South Korea
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 06期
基金
中国国家自然科学基金;
关键词
Single image deraining; Vision transformer; Image restoration; Neural networks; Deep learning; RAIN;
D O I
10.1007/s11227-022-04895-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Rain impairs the performance of outdoor vision systems, such as automated driving systems and outdoor surveillance systems. Therefore, as an image preprocessing technique, image deraining has great potential for application. Defects of convolutional neural networks (small receptive field and non-adaptive to input content) limit the further improvement of deraining model performance. Recently, a novel neural network, transformer, has demonstrated impressive performance on natural language processing and vision tasks. However, using transformer for image deraining still has some issues: Although transformers have powerful long-range computing capabilities, it lacks the ability to model local features, which is critical for image deraining. In addition, transformer uses fixed-size patches to process images, which leads to pixels at the edges of the patches that cannot use the local features of neighboring pixels to restore rain-free images. In this paper, we propose a novel pyramid transformer for image deraining. To address the first issue, we design a residual-Dconv feed-forward network (RDFN), where depth-wise convolution improves the capability of modeling local features. To address the second issue, we introduce multi-resolution features into the transformer, which allows the transformer to obtain patches with different scales, thus enabling the boundary pixels to utilize local features. Furthermore, we propose a novel multi-scale fusion bridge (MSFB) to effectively integrate the extracted multi-scale features and capture the correlation between different scales. Extensive experiments on synthetic and real-world images demonstrate that the proposed deraining model achieves superior performance, especially the PSNR value achieves 47.55 dB on the SPA-Data dataset. We also further validate the effectiveness of the proposed model on subsequent high-level computer vision tasks.
引用
收藏
页码:6115 / 6140
页数:26
相关论文
共 50 条
  • [31] Spatial-guided informative semantic joint transformer for single-image deraining
    Haiyan Li
    Shaolin Peng
    Xun Lang
    Shuhua Ye
    Hongsong Li
    The Journal of Supercomputing, 2024, 80 : 6522 - 6551
  • [32] Spatial-guided informative semantic joint transformer for single-image deraining
    Li, Haiyan
    Peng, Shaolin
    Lang, Xun
    Ye, Shuhua
    Li, Hongsong
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (05): : 6522 - 6551
  • [33] DeTformer: A Novel Efficient Transformer Framework for Image Deraining
    Ragini, Thatikonda
    Prakash, Kodali
    Cheruku, Ramalingaswamy
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (02) : 1030 - 1052
  • [34] DeTformer: A Novel Efficient Transformer Framework for Image Deraining
    Thatikonda Ragini
    Kodali Prakash
    Ramalingaswamy Cheruku
    Circuits, Systems, and Signal Processing, 2024, 43 : 1030 - 1052
  • [35] Learning A Sparse Transformer Network for Effective Image Deraining
    Chen, Xiang
    Li, Hao
    Li, Mingqiang
    Pan, Jinshan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5896 - 5905
  • [36] Combined with Pyramid Split Attention and Multi-Scale Feature Learning Network for Single Image Deraining
    Chen, Hui
    Zhu, Songhao
    Liang, Zhiwei
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7064 - 7069
  • [37] LPN-IDD: A Lightweight Pyramid Network for Image Deraining and Detection
    Babar, Kainat
    Yaseen, Muhammad Usman
    Al-Shamayleh, Ahmad Sami
    Imran, Muhammad
    Al-Ghushami, Abdullah Hussein
    Akhunzada, Adnan
    IEEE ACCESS, 2024, 12 : 37103 - 37119
  • [38] Multi-scale transformer with conditioned prompt for image deraining
    Wu, Xianhao
    Chen, Hongming
    Chen, Xiang
    Xu, Guili
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [39] Magic ELF: Image Deraining Meets Association Learning and Transformer
    Jiang, Kui
    Wang, Zhongyuan
    Chen, Chen
    Wang, Zheng
    Cui, Laizhong
    Lin, Chia-Wen
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [40] Deep Learning Algorithms for Single Image Deraining
    Grigoras, Radu
    Ciocoiu, Iulian B.
    PROCEEDINGS OF THE 2018 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2018,