RainFormer: a pyramid transformer for single image deraining

被引:4
|
作者
Yang, Hao [1 ]
Zhou, Dongming [1 ]
Cao, Jinde [2 ,3 ]
Zhao, Qian [1 ]
Li, Miao [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650504, Yunnan, Peoples R China
[2] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China
[3] Yonsei Univ, Yonsei Frontier Lab, Seoul 03722, South Korea
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 06期
基金
中国国家自然科学基金;
关键词
Single image deraining; Vision transformer; Image restoration; Neural networks; Deep learning; RAIN;
D O I
10.1007/s11227-022-04895-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Rain impairs the performance of outdoor vision systems, such as automated driving systems and outdoor surveillance systems. Therefore, as an image preprocessing technique, image deraining has great potential for application. Defects of convolutional neural networks (small receptive field and non-adaptive to input content) limit the further improvement of deraining model performance. Recently, a novel neural network, transformer, has demonstrated impressive performance on natural language processing and vision tasks. However, using transformer for image deraining still has some issues: Although transformers have powerful long-range computing capabilities, it lacks the ability to model local features, which is critical for image deraining. In addition, transformer uses fixed-size patches to process images, which leads to pixels at the edges of the patches that cannot use the local features of neighboring pixels to restore rain-free images. In this paper, we propose a novel pyramid transformer for image deraining. To address the first issue, we design a residual-Dconv feed-forward network (RDFN), where depth-wise convolution improves the capability of modeling local features. To address the second issue, we introduce multi-resolution features into the transformer, which allows the transformer to obtain patches with different scales, thus enabling the boundary pixels to utilize local features. Furthermore, we propose a novel multi-scale fusion bridge (MSFB) to effectively integrate the extracted multi-scale features and capture the correlation between different scales. Extensive experiments on synthetic and real-world images demonstrate that the proposed deraining model achieves superior performance, especially the PSNR value achieves 47.55 dB on the SPA-Data dataset. We also further validate the effectiveness of the proposed model on subsequent high-level computer vision tasks.
引用
收藏
页码:6115 / 6140
页数:26
相关论文
共 50 条
  • [21] Multi-scale Dilated Convolution Transformer for Single Image Deraining
    Wu, Xianhao
    JiyangLu
    Wu, Jindi
    Li, Yufeng
    2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [22] Multi-scale Channel Transformer Network for Single Image Deraining
    Namba, Yuto
    Han, Xian-Hua
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [23] Single image deraining via deep pyramid network with spatial contextual information aggregation
    Wang, Cong
    Wu, Yutong
    Cai, Yu
    Yao, Guangle
    Su, Zhixun
    Wang, Hongyan
    APPLIED INTELLIGENCE, 2020, 50 (05) : 1437 - 1447
  • [24] Single image deraining via deep pyramid network with spatial contextual information aggregation
    Cong Wang
    Yutong Wu
    Yu Cai
    Guangle Yao
    Zhixun Su
    Hongyan Wang
    Applied Intelligence, 2020, 50 : 1437 - 1447
  • [25] CTFCD: Channel transformer based on full convolutional decoder for single image deraining
    Tan, Shaohan
    Chen, Hui
    Zhu, Songhao
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [26] MLTDNet: an efficient multi-level transformer network for single image deraining
    Gao, Feng
    Mu, Xiangyu
    Ouyang, Chao
    Yang, Kai
    Ji, Shengchang
    Guo, Jie
    Wei, Haokun
    Wang, Nan
    Ma, Lei
    Yang, Biao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16): : 14013 - 14027
  • [27] Image Deraining Based on Adaptive Perceptual Pyramid Network
    Yang, Ai-Ping
    Wang, Chao-Chen
    Wang, Jian
    Zhang, Teng-Fei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2022, 43 (04): : 470 - 479
  • [28] MLTDNet: an efficient multi-level transformer network for single image deraining
    Feng Gao
    Xiangyu Mu
    Chao Ouyang
    Kai Yang
    Shengchang Ji
    Jie Guo
    Haokun Wei
    Nan Wang
    Lei Ma
    Biao Yang
    Neural Computing and Applications, 2022, 34 : 14013 - 14027
  • [29] Image Deraining Transformer with Sparsity and Frequency Guidance
    Song, Tianyu
    Li, Pengpeng
    Jin, Guiyue
    Jin, Jiyu
    Fan, Shumin
    Chen, Xiang
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1889 - 1894
  • [30] Denseformer for Single Image Deraining
    Wang, Tianming
    Wang, Kaige
    Li, Qing
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2023, 33 (04) : 651 - 661