Multi-scale Channel Transformer Network for Single Image Deraining

被引:0
作者
Namba, Yuto [1 ]
Han, Xian-Hua [2 ]
机构
[1] Yamaguchi Univ, Fac Sci, Yamaguchi, Japan
[2] Yamaguchi Univ, Fac Sci, Grad Sch Sci & Technol Innovat, Yamaguchi, Japan
来源
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022 | 2022年
关键词
Transformer; Single image deraining; Low-level vision task; Computer vision; RAIN STREAKS REMOVAL;
D O I
10.1145/3551626.3564946
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Single image deraining is a very challenging task, as it requires not only restoring the spatial details and high contextual structures of the images, but also removing multiple layers of rain with varying degrees of blurring and resolutions. Recently, due to the powerful modeling capability of long-dependency, transformer-based models have manifested superior performance for high-level vision tasks, and have begun to be applied for low-level vision tasks such as various image restoration applications. However, its computational complexity increases quadratically with spatial resolutions, making it impossible to apply it to high-resolution images. In this study, we propose a novel Channel Transformer, which performs self-attention in the channel direction instead of the spatial direction. Specifically, we first incorporate multiple channel transformer blocks into a multi-scale architecture to extract multi-scale contexts and exploit channel long-dependence, and then learn a coarse estimation of the rain-free image. Finally, an original-resolution CNN-based module is employed to refine the coarse estimation via leveraging the previously learned multi-scale contexts. Experiments on several benchmark datasets demonstrate its superiority over the state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 60 条
  • [1] Brown TB, 2020, Arxiv, DOI arXiv:2005.14165
  • [2] Carion N., 2020, EUROPEAN C COMPUTER
  • [3] Visual Depth Guided Color Image Rain Streaks Removal Using Sparse Coding
    Chen, Duan-Yu
    Chen, Chien-Cheng
    Kang, Li-Wei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (08) : 1430 - 1455
  • [4] Pre-Trained Image Processing Transformer
    Chen, Hanting
    Wang, Yunhe
    Guo, Tianyu
    Xu, Chang
    Deng, Yiping
    Liu, Zhenhua
    Ma, Siwei
    Xu, Chunjing
    Xu, Chao
    Gao, Wen
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12294 - 12305
  • [5] Dosovitskiy A., 2021, INT C LEARNING REPRE
  • [6] Fan Z.W., 2018, ACMMM
  • [7] Fedus W, 2022, Arxiv, DOI arXiv:2101.03961
  • [8] Fu X.Y., 2021, AAAI
  • [9] Lightweight Pyramid Networks for Image Deraining
    Fu, Xueyang
    Liang, Borong
    Huang, Yue
    Ding, Xinghao
    Paisley, John
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (06) : 1794 - 1807
  • [10] Removing rain from single images via a deep detail network
    Fu, Xueyang
    Huang, Jiabin
    Zeng, Delu
    Huang, Yue
    Ding, Xinghao
    Paisley, John
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1715 - 1723