Underwater Image Enhancement via Adaptive Group Attention-Based Multiscale Cascade Transformer

被引:90
作者
Huang, Zhixiong [1 ]
Li, Jinjiang [2 ]
Hua, Zhen [1 ]
Fan, Linwei [3 ]
机构
[1] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Shandong, Peoples R China
[2] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai 264005, Shandong, Peoples R China
[3] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan 250014, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Image color analysis; Image restoration; Imaging; Attenuation; Transformers; Image enhancement; Estimation; Adaptive group attention (AGA); multiscale cascade; Swin Transformer; underwater image enhancement; COLOR CORRECTION; RESTORATION; MAP;
D O I
10.1109/TIM.2022.3189630
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The absorption and scattering caused by the underwater medium degrade the quality of underwater optical imaging, which limits the further development of underwater tasks. Recently, transformer-based methods have shown the same excellent performance as convolutional neural networks (CNNs) in various vision tasks, but the huge parameters of such networks hinder their application deployment. In this article, we propose novel adaptive group attention (AGA), which can dynamically select visually complementary channels based on the dependencies, reducing the number of further attention parameters. The AGA is applied in the Swin Transformer module and used to design an end-to-end underwater image enhancement network. The network also introduces the multiscale cascade module and the channel attention mechanism. This article conducted ablation study and qualitative and quantitative comparisons on public datasets, and the results show that the application of AGA significantly compresses the model size while ensuring performance, and other application components have the significant gain on the network. Compared with other advanced methods, the network in this article has outstanding performance.
引用
收藏
页数:18
相关论文
共 68 条
[21]  
Han K., 2021, P NIPS 21 P 35 INT C
[22]   Single Image Haze Removal Using Dark Channel Prior [J].
He, Kaiming ;
Sun, Jian ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (12) :2341-2353
[23]   Shallow-Water Image Enhancement Using Relative Global Histogram Stretching Based on Adaptive Parameter Acquisition [J].
Huang, Dongmei ;
Wang, Yan ;
Song, Wei ;
Sequeira, Jean ;
Mavromatis, Sebastien .
MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 :453-465
[24]   A review on underwater autonomous environmental perception and target grasp, the challenge of robotic organism capture [J].
Huang, Hai ;
Tang, Qirong ;
Li, Jiyong ;
Zhang, Wanli ;
Bao, Xuan ;
Zhu, Haitao ;
Wang, Gang .
OCEAN ENGINEERING, 2020, 195
[25]  
Islam MJ, 2020, ROBOTICS: SCIENCE AND SYSTEMS XVI
[26]   Fast Underwater Image Enhancement for Improved Visual Perception [J].
Islam, Md Jahidul ;
Xia, Youya ;
Sattar, Junaed .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :3227-3234
[27]   Zero-shot Single Image Restoration through Controlled Perturbation of Koschmieder's Model [J].
Kar, Aupendu ;
Dhara, Sobhan Kanti ;
Sen, Debashis ;
Biswas, Prabir Kumar .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16200-16210
[28]  
Kitaev N., 2020, arXiv, DOI DOI 10.48550/ARXIV.2001.04451
[29]  
Lample G, 2019, Arxiv, DOI arXiv:1907.05242
[30]   Contrast Enhancement Based on Layered Difference Representation of 2D Histograms [J].
Lee, Chulwoo ;
Lee, Chul ;
Kim, Chang-Su .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (12) :5372-5384