A Multi-Stage Transformer Network for Image Dehazing Based on Contrastive Learning

被引:1
作者
Gao F. [1 ,2 ]
Ji S. [1 ]
Guo J. [1 ]
Hou J. [3 ]
Ouyang C. [3 ]
Yang B. [4 ]
机构
[1] School of Electrical Engineering, Xi'an Jiaotong University, Xi'an
[2] Electric Power Research Institute, State Grid Shaanxi Electric Power Company Limited, Xi'an
[3] Harbin Institute of Technology(Shenzhen), School of Computer Science, Guangdong, Shenzhen
[4] Harbin Institute of Technology(Shenzhen), School of Architecture, Guangdong, Shenzhen
来源
Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University | 2023年 / 57卷 / 01期
关键词
contrastive learning; image dehazing; multi-patch structure; Transformer;
D O I
10.7652/xjtuxb202301019
中图分类号
学科分类号
摘要
A multi-stage Transformer network for image dehazing based on contrastive learning is proposed to solve the problem that existing image dehazing methods fail to achieve the desired results in local image dehazing and detail restoration, and the non-homogeneous haze cannot be removed completely all the way. First, the channel-wise Transformer block is utilized as the primary feature extraction block to adequately capture the mutual long-range dependencies among channels. Second, the multi-modality supervised contrastive learning is introduced to maximize the capturing efficiency of information from the contrastive samples, so that the restored image is closer to the clear image in the embedding space while staying as far away from the hazy image as possible. Finally, a hierarchical multi-patch structure and deformable Transformer blocks are employed to effectively integrate the local and global structural information of the hazy image. Moreover, a large number of tests have been conducted on the proposed method by using two synthetic data sets and the three real data sets. The results show that the proposed MSTCNet achieves a higher peak signal-to-noise ratio(PSNR)gain of 1.49, 1.45, 0.11, 1.45 and 0.22 dB on five datasets, respectively. It outperforms existing methods in both general and non-data sets, shows the best visual effect of dehazing in removing the dense, non-homogeneous and uniform haze, and achieves the highest objective evaluation index value. © 2023 Xi'an Jiaotong University. All rights reserved.
引用
收藏
页码:195 / 210
页数:15
相关论文
共 34 条
  • [1] HASSAN H, MISHRA P, AHMAD M, Et al., Effects of haze and dehazing on deep learning-based vision models, Applied Intelligence, (2022)
  • [2] NARASIMHAN S G, NAYAR S K., Vision and the atmosphere, International Journal of Computer Vision, 48, 3, pp. 233-254, (2002)
  • [3] HE Kaiming, SUN Jian, TANG Xiaoou, Single image haze removal using dark channel prior, IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 12, pp. 2341-2353, (2011)
  • [4] BERMAN D, TREIBITZ T, AVIDAN S., Non-local image dehazing, 2016 IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 1674-1682, (2016)
  • [5] HUANG Wenjun, LI Jie, QI Chun, A defogging algorithm for dense fog images via low-rank and dictionary expression decomposition, Journal of Xi'an Jiaotong University, 54, 4, pp. 118-125, (2020)
  • [6] GAO Tao, LIU Mengni, CHEN Ting, Et al., A far and near scene fusion defogging algorithm based on the prior of Dark-light channel, Journal of Xi'an Jiaotong University, 55, 10, pp. 78-86, (2021)
  • [7] YANG Yan, LU Xinxuan, An image dehazing method combining adaptive brightness transformation inequality to estimate transmittance, Journal of Xi'an Jiaotong University, 55, 6, pp. 69-76, (2021)
  • [8] HUANG He, HU Kaiyi, SONG Jing, Et al., A twice optimization method for solving transmittance with haze-lines, Journal of Xi'an Jiaotong University, 55, 8, pp. 130-138, (2021)
  • [9] QU Yanyun, CHEN Yizi, HUANG Jingying, Et al., Enhanced Pix2pix dehazing network, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), pp. 8152-8160, (2019)
  • [10] QIN Xu, WANG Zhilin, BAI Yuanchao, Et al., FFA-net: feature fusion attention network for single image dehazing, Proceedings of the AAAI Conference on Artificial Intelligence, pp. 11908-11915, (2020)