UDAformer: Underwater image enhancement based on dual attention transformer

被引:50
作者
Shen, Zhen [1 ]
Xu, Haiyong [1 ]
Luo, Ting [2 ]
Song, Yang [2 ]
He, Zhouyan [2 ]
机构
[1] Ningbo Univ, Sch Math & Stat, Ningbo 315211, Peoples R China
[2] Ningbo Univ, Coll Sci & Technol, Ningbo 315211, Peoples R China
来源
COMPUTERS & GRAPHICS-UK | 2023年 / 111卷
关键词
Underwater image enhancement; Self -attention mechanism; Transformer;
D O I
10.1016/j.cag.2023.01.009
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Underwater images suffer from color casts and low contrast degraded due to wavelength-dependent light scatter and abortion of the underwater environment, which impacts the application of high-level computer vision tasks. Considering the characteristics of uneven degradation and loss of color channel of underwater images, a novel dual attention transformer-based underwater image en-hancement method, called UDAformer, is proposed. Specifically, Dual Attention Transformer Block (DATB) combining Channel Self-Attention Transformer (CSAT) with Pixel Self-Attention Transformer is proposed for efficient encoding and decoding of underwater image features. Then, the shifted window method for the pixel self-attention (SW-PSAT) is proposed to improve computational efficiency. Finally, the underwater images are recovered through the design of residual connections based on the underwater imaging model. Experimental results demonstrate the proposed UDAformer surpasses previous state-of-the-art methods, both qualitatively and quantitatively. The code is publicly available at: https://github.com/ShenZhen0502/UDAformer.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:77 / 88
页数:12
相关论文
共 49 条
[1]   RIECNN: real-time image enhanced CNN for traffic sign recognition [J].
Abdel-Salam, Reem ;
Mostafa, Rana ;
Abdel-Gawad, Ahmed H. .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (08) :6085-6096
[2]   Underwater Single Image Color Restoration Using Haze-Lines and a New Quantitative Dataset [J].
Berman, Dana ;
Levy, Deborah ;
Avidan, Shai ;
Treibitz, Tali .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (08) :2822-2837
[3]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[4]   CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification [J].
Chen, Chun-Fu ;
Fan, Quanfu ;
Panda, Rameswar .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :347-356
[5]   Color Channel Compensation (3C): A Fundamental Pre-Processing Step for Image Enhancement [J].
Codruta, Ancuti O. ;
Ancuti, Cosmin ;
De Vleeschouwer, Christophe ;
Sbert, Mateu .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :2653-2665
[6]  
Dai X, 2021, PROC IEEE INT C COMP, P2998
[7]  
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arXiv.1810.04805]
[8]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9]   Transmission Estimation in Underwater Single Images [J].
Drews-, P., Jr. ;
do Nascimento, E. ;
Moraes, F. ;
Botelho, S. ;
Campos, M. .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :825-830
[10]  
Feifan Lv JW, 2018, BRIT MACHINE VISION