Multi-scale Dilated Convolution Transformer for Single Image Deraining

被引:0
|
作者
Wu, Xianhao [1 ]
JiyangLu [1 ]
Wu, Jindi [1 ]
Li, Yufeng [1 ]
机构
[1] Shenyang Aerosp Univ, Coll Elect Informat Engn, Shenyang, Peoples R China
关键词
Single image deraining; Transformers; Dilated-convolution; QUALITY ASSESSMENT;
D O I
10.1109/MMSP59012.2023.10337643
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, Transformer-based methods have achieved significant improvements over convolutional neural networks (CNNs) in single image deraining, due to the powerful ability of modeling non-local information. In fact, rich local-global information representations are equally important for better satisfying rain removal. In this paper, we propose an effective image deraining method by integrating a CNN model into the Transformer backbone to accelerate network convergence, called Multi-scale Dilated-convolution Transformer (MDT), which fully leverages the learning capabilities of Transformers on non-local features, seamlessly integrating local detail extraction and global structural representation. The fundamental building unit of our framework is the Multi-scale Dilated-convolution Transformer Block (MDTB) with different dilation rates, which consists of the Dilconv Self-Attention (DSA) and the Dilconv Feed-Forward Network (DFN). Specifically, the former processes the contextual information via dilated convolutions and enables the model to emphasize spatially-varying rain distribution features, while the latter integrates the dual-branch information to facilitate the local feature learning for better feature aggregation. Extensive evaluations demonstrate that our model reaches superior performance, significantly improving the image deraining quality.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining
    Chen, Xiang
    Pan, Jinshan
    Dong, Jiangxin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 25627 - 25636
  • [32] DeepFake detection with multi-scale convolution and vision transformer
    Lin, Hao
    Huang, Wenmin
    Luo, Weiqi
    Lu, Wei
    DIGITAL SIGNAL PROCESSING, 2023, 134
  • [33] DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
    Jiao, Jiayu
    Tang, Yu-Ming
    Lin, Kun-Yu
    Gao, Yipeng
    Ma, Andy J.
    Wang, Yaowei
    Zheng, Wei-Shi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8906 - 8919
  • [34] Lightweight Object Detection Combined with Multi-Scale Dilated-Convolution and Multi-Scale Deconvolution
    Yi, Qingming
    Lü, Renyi
    Shi, Min
    Luo, Aiwen
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (12): : 41 - 48
  • [35] Multi-scale representation for image deraining with state space model
    Li, Yufeng
    Xie, Chuanlong
    Chen, Hongming
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [36] Symmetric Enhancement of Visual Clarity through a Multi-Scale Dilated Residual Recurrent Network Approach for Image Deraining
    Bhutto, Jameel Ahmed
    Zhang, Ruihong
    Rahman, Ziaur
    SYMMETRY-BASEL, 2023, 15 (08):
  • [37] Recursive Multi-Scale Image Deraining With Sub-Pixel Convolution Based Feature Fusion and Context Aggregation
    Sharma, Suvash
    Tang, Bo
    Ball, John E.
    Carruth, Daniel W.
    Dabbiru, Lalitha
    IEEE ACCESS, 2020, 8 (08): : 177495 - 177505
  • [38] MPCT: A medical image fusion method based on multi-scale pyramid convolution and Transformer
    Xu, Yi
    Wang, Zijie
    Wu, Shoucai
    Zhan, Xiongfei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [39] Combined with Pyramid Split Attention and Multi-Scale Feature Learning Network for Single Image Deraining
    Chen, Hui
    Zhu, Songhao
    Liang, Zhiwei
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7064 - 7069
  • [40] UAV image detection based on multi-scale spatial attention mechanism with hybrid dilated convolution
    Zhou, Yue
    Jiang, Yutong
    Yang, Zhonglin
    Li, Xingxin
    Sun, Wenchen
    Zhen, Han
    Wang, Ying
    2024 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MEDIA COMPUTING, ICIPMC 2024, 2024, : 279 - 284