Dehazeformer: Nonhomogeneous Image Dehazing With Collaborative Global-local Network

被引:0
作者
Luo, Xiao-Tong [1 ]
Yang, Wen-Jin [1 ]
Qu, Yan-Yun [1 ]
Xie, Yuan [2 ]
机构
[1] Department of Computer Science and Technology, School of Informatics, Xiamen University, Xiamen
[2] School of Computer Science and Technology, East China Normal University, Shanghai
来源
Zidonghua Xuebao/Acta Automatica Sinica | 2024年 / 50卷 / 07期
基金
中国国家自然科学基金;
关键词
convolutional neural network (CNN); feature fusion; Image dehazing; sparse self-attention; Transformer;
D O I
10.16383/j.aas.c230567
中图分类号
学科分类号
摘要
In recent years, image dehazing methods based on convolutional neural network (CNN) have made remarkable progress in synthetic datasets, but the local receptive field of convolution operation is difficult to effectively capture contextual guidance information due to the uneven distribution of haze in the real scene, resulting in the loss of global structure information. Therefore, the image dehazing task in the real scene still faces great challenges. Considering that Transformer has the advantage of capturing long-range semantic information dependency relationships, it can facilitate global structure information reconstruction. However, the high computational complexity of the standard Transformer structure hinders its application in image restoration. To solve the problems mentioned above, this paper proposes a double-branch collaborative nonhomogeneous image dehazing network, which is called Dehazeformer and composed of Transformer and convolutional neural network. The Transformer branch is used to extract global structure information, and sparse self-attention modules (SSM) are designed to reduce computational complexity. Besides, the convolutional neural network branch is used to obtain local information to recover texture details. Extensive experiments in the real nonhomogeneous haze scene show that the proposed method achieves excellent performance in both objective evaluation and subjective visual effects. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1333 / 1344
页数:11
相关论文
共 36 条
  • [1] Nayar S K, Narasimhan S G., Vision in bad weather, Proceedings of the Seventh IEEE/CVF International Conference on Computer Vision (ICCV), pp. 820-827, (1999)
  • [2] Dong H, Pan J S, Xiang L, Hu Z, Zhang X Y, Wang F, Et al., Multi-scale boosted dehazing network with dense feature fusion, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2154-2164, (2020)
  • [3] Hong M, Xie Y, Li C H, Qu Y Y., Distilling image dehazing with heterogeneous task imitation, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3459-3468, (2020)
  • [4] Qin X, Wang Z L, Bai Y C, Xie X D, Jia H Z., FFA-Net: Feature fusion attention network for single image dehazing, Proceedings of the Thirty-fourth AAAI Conference on Artificial Intelligence, pp. 11908-11915, (2020)
  • [5] Wu H Y, Qu Y Y, Lin S H, Zhou J, Qiao R Z, Zhang Z Z, Et al., Contrastive learning for compact single image dehazing, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10546-10555, (2021)
  • [6] Liu J, Wu H Y, Xie Y, Qu Y Y, Ma L Z., Trident dehazing network, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1732-1741, (2020)
  • [7] Wu H Y, Liu J, Xie Y, Qu Y Y, Ma L Z., Knowledge transfer dehazing network for nonhomogeneous dehazing, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1975-1983, (2020)
  • [8] Yu Y K, Liu H, Fu M H, Chen J, Wang X Y, Wang K Y., A two-branch neural network for non-homogeneous dehazing via ensemble learning, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 193-202, (2021)
  • [9] Fu M H, Liu H, Yu Y K, Chen J, Wang K Y., DW-GAN: A discrete wavelet transform GAN for nonhomogeneous dehazing, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 203-212, (2021)
  • [10] Guo C L, Yan Q X, Anwar S, Cong R M, Ren W Q, Li C Y., Image dehazing Transformer with transmission-aware 3D position embedding, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 5802-5810, (2022)