A transformer-based network for perceptual contrastive underwater image enhancement

被引:5
|
作者
Cheng, Na [1 ]
Sun, Zhixuan [1 ]
Zhu, Xuanbing [1 ]
Wang, Hongyu [1 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
Underwater image enhancement; Transformer; Multi-loss function; Contrastive learning; MODEL;
D O I
10.1016/j.image.2023.117032
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Vision-based underwater image enhancement methods have received much attention for application in the fields of marine engineering and marine science. The absorption and scattering of light in real underwater scenes leads to severe information degradation in the acquired underwater images, thus limiting further development of underwater tasks. To solve these problems, a novel transformer-based perceptual contrastive network for underwater image enhancement methods (TPC-UIE) is proposed to achieve visually friendly and high-quality images, where contrastive learning is applied to the underwater image enhancement (UIE) task for the first time. Specifically, to address the limitations of the pure convolution-based network, we embed the transformer into the UIE network to improve its ability to capture global dependencies. Then, the limits of the transformer are then taken into account as convolution is reintroduced to better capture local attention. At the same time, the dual-attention module strengthens the network's focus on the spatial and color channels that are more severely attenuated. Finally, a perceptual contrastive regularization method is proposed, where a multi-loss function made up of reconstruction loss, perceptual loss, and contrastive loss jointly optimizes the model to simultaneously ensure texture detail, contrast, and color consistency. Experimental results on several existing datasets show that the TPC-UIE obtains excellent performance in both subjective and objective evaluations compared to other methods. In addition, the visual quality of the underwater images is significantly improved by the enhancement of the method and effectively facilitates further development of the underwater task.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] TIPFNet: a transformer-based infrared polarization image fusion network
    Li, Kunyuan
    Qi, Meibin
    Zhuang, Shuo
    Yang, Yanfang
    Gao, Jun
    OPTICS LETTERS, 2022, 47 (16) : 4255 - 4258
  • [32] Image captioning using transformer-based double attention network
    Parvin, Hashem
    Naghsh-Nilchi, Ahmad Reza
    Mohammadi, Hossein Mahvash
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [33] Transformer-based progressive residual network for single image dehazing
    Yang, Zhe
    Li, Xiaoling
    Li, Jinjiang
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [34] UDAformer: Underwater image enhancement based on dual attention transformer
    Shen, Zhen
    Xu, Haiyong
    Luo, Ting
    Song, Yang
    He, Zhouyan
    COMPUTERS & GRAPHICS-UK, 2023, 111 : 77 - 88
  • [35] Underwater Image Enhancement Based on Parallel Guidance of Transformer and CNN
    Chang, Jian
    Chen, Hongfu
    Wang, Bingbing
    Computer Engineering and Applications, 2024, 60 (04) : 280 - 288
  • [36] Transformer-Based Seismic Image Enhancement: A Novel Approach for Improved Resolution
    Park, Jin-Yeong
    Saad, Omar M.
    Oh, Ju-Won
    Alkhalifah, Tariq
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [37] A Transformer-based invertible neural network for robust image watermarking
    He, Zhouyan
    Hu, Renzhi
    Wu, Jun
    Luo, Ting
    Xu, Haiyong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [38] A Two-Stage Network Based on Transformer and Physical Model for Single Underwater Image Enhancement
    Zhang, Yuhao
    Chen, Dujing
    Zhang, Yanyan
    Shen, Meiling
    Zhao, Weiyu
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)
  • [39] A content-style control network with style contrastive learning for underwater image enhancement
    Wang, Zhenguang
    Tao, Huanjie
    Zhou, Hui
    Deng, Yishi
    Zhou, Ping
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [40] TMCIH: Perceptual Robust Image Hashing with Transformer-based Multi-layer Constraints
    Fang, Yaodong
    Zhou, Yuanding
    Li, Xinran
    Kong, Ping
    Qin, Chuan
    PROCEEDINGS OF THE 2023 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2023, 2023, : 7 - 12