Underwater Image Enhancement Based on Transformer, Attention, and Multi-Color-Space Inputs

被引:0
作者
Lu, Liqiong [1 ]
Wu, Dong [1 ]
Wang, Liuyin [1 ]
Zhang, Wanzhen [2 ]
Liu, Tonglai [2 ]
机构
[1] Lingnan Normal Univ, Sch Comp Sci & Intelligence Educ, Zhanjiang 524048, Peoples R China
[2] Zhongkai Univ Agr & Engn, Coll Informat Sci & Technol, Guangzhou 510225, Peoples R China
关键词
Image enhancement; Attention mechanisms; Image color analysis; Transformers; Convolutional neural networks; Visualization; Circuit faults; Object detection; Social sciences; Labeling; Underwater image enhancement; transformer block; CNN; ResNet-50; multi-dimensional attention mechanism; multi-color-space inputs; MODEL;
D O I
10.1109/ACCESS.2025.3577005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Underwater images suffer from color distortion, low contrast and blurring caused by the attenuation, refraction, and scattering of light. For many maritime operations, underwater image enhancement is essential. This paper proposes an underwater image enhancement method based on transformer, attention and multi-color-space inputs. First, transformer block is embedded into ResNet-50 as the backbone network. Combining this backbone network and multi-scale feature fusion forms the fundamental framework of our method. This framework contributes to mine the rich features of different scene underwater image and outputs a feature map of the same size as the input image. Then, multi-dimensional attention mechanism is added to the feature maps of different scales to mine key areas that require enhancement in underwater images and calculate the enhancement value of each pixel. Finally, multi-color-space inputs including RGB, HSV and LAB as the inputs for CNN to enhancement images from various aspects such as color deviation, brightness and saturation. The comparison of image quality evaluation metrics, visual enhancement performance evaluation and impact on the performance of underwater object detection with other underwater enhancement methods on datasets including SUIM-E, UIEB, EUVP, RUIE-RIQS, RUIE-UCCS and URPC-2018 prove the good performance of our underwater image enhancement method.
引用
收藏
页码:103682 / 103696
页数:15
相关论文
共 40 条
[21]  
Mousavi M., 2023, Electronics, V12, P29
[22]   Human-Visual-System-Inspired Underwater Image Quality Measures [J].
Panetta, Karen ;
Gao, Chen ;
Agaian, Sos .
IEEE JOURNAL OF OCEANIC ENGINEERING, 2016, 41 (03) :541-551
[23]   U-Shape Transformer for Underwater Image Enhancement [J].
Peng, Lintao ;
Zhu, Chunli ;
Bian, Liheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :3066-3079
[24]   Generalization of the Dark Channel Prior for Single Image Restoration [J].
Peng, Yan-Tsung ;
Cao, Keming ;
Cosman, Pamela C. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) :2856-2868
[25]   Underwater Image Restoration Based on Image Blurriness and Light [J].
Peng, Yan-Tsung ;
Cosman, Pamela C. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) :1579-1594
[26]   SGUIE-Net: Semantic Attention Guided Underwater Image Enhancement With Multi-Scale Perception [J].
Qi, Qi ;
Li, Kunqian ;
Zheng, Haiyong ;
Gao, Xiang ;
Hou, Guojia ;
Sun, Kun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :6816-6830
[27]   Underwater Image Co-Enhancement With Correlation Feature Matching and Joint Learning [J].
Qi, Qi ;
Zhang, Yongchang ;
Tian, Fei ;
Wu, Q. M. Jonathan ;
Li, Kunqian ;
Luan, Xin ;
Song, Dalei .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) :1133-1147
[28]   UDAformer: Underwater image enhancement based on dual attention transformer [J].
Shen, Zhen ;
Xu, Haiyong ;
Luo, Ting ;
Song, Yang ;
He, Zhouyan .
COMPUTERS & GRAPHICS-UK, 2023, 111 :77-88
[29]   CPDM: Content-preserving diffusion model for underwater image enhancement [J].
Shi, Xiaowen ;
Wang, Yuan-Gen .
SCIENTIFIC REPORTS, 2024, 14 (01)
[30]   A Rapid Scene Depth Estimation Model Based on Underwater Light Attenuation Prior for Underwater Image Restoration [J].
Song, Wei ;
Wang, Yan ;
Huang, Dongmei ;
Tjondronegoro, Dian .
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 :678-688