SDBAD-Net: A Spatial Dual-Branch Attention Dehazing Network Based on Meta-Former Paradigm

被引:28
作者
Zhang, Guoqing [1 ,2 ]
Fang, Wenxuan [1 ]
Zheng, Yuhui [1 ]
Wang, Ruili [2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Massey Univ, Sch Math & Computat Sci, Auckland 0632, New Zealand
基金
中国国家自然科学基金;
关键词
Image dehazing; lightweight; dual-branch structure; structural features supplementary; NEURAL-NETWORK; IMAGE; VISIBILITY;
D O I
10.1109/TCSVT.2023.3274366
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Image dehazing is an emblematical low-level vision task that aims at restoring haze-free images from haze images. Recently, some methods adopts deep learning techniques to rebuild haze-free images. However, in real-world scenarios, complex degradation of captured images and non-uniform spatial distributions of haze will significantly weaken the generalization ability of these models. Accordingly, we propose a novel Spatial Dual-Branch Attention Dehazing network (SDBAD-Net) based on the Meta-Former paradigm for end-to-end dehazing. Specifically, we firstly design a robust Spatial Dual-Branch Attention (SDBA) module to filter the haze distribution features from different densities, which is suitable for both uniform and non-uniform situations. Secondly, we introduce a Structural Features Supplementary (SFS) module to dynamically fuse the contextual structural features in a nonlinear manner, so as to correct the image distortion caused by the lack of structural details. Finally, the quantitative and qualitative experiments are carried out on two challenging datasets, and the results show that our method outperforms most of state-of-the-art algorithms with fewer parameters and faster speed, especially surpassing FFA-Net with only 50% parameters and 7% computational costs. In addition, we ulteriorly explore its performance on object detection in foggy weather with our model on the challenging Real-world Task-driven Testing Set (RTTS), and the surprising results further prove the robustness and wide-applicability of our method.
引用
收藏
页码:60 / 70
页数:11
相关论文
共 54 条
[1]   NH-HAZE: An Image Dehazing Benchmark with Non-Homogeneous Hazy and Haze-Free Images [J].
Ancuti, Codruta O. ;
Ancuti, Cosmin ;
Timofte, Radu .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :1798-1805
[2]   Unsupervised Adversarial Instance-Level Image Retrieval [J].
Bai, Cong ;
Li, Hongkai ;
Zhang, Jinglin ;
Huang, Ling ;
Zhang, Lu .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :2199-2207
[3]   Optimization of deep convolutional neural network for large scale image retrieval [J].
Bai, Cong ;
Huang, Ling ;
Pan, Xiang ;
Zheng, Jianwei ;
Chen, Shengyong .
NEUROCOMPUTING, 2018, 303 :60-67
[4]   Self-Guided Image Dehazing Using Progressive Feature Fusion [J].
Bai, Haoran ;
Pan, Jinshan ;
Xiang, Xinguang ;
Tang, Jinhui .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :1217-1229
[5]   DehazeNet: An End-to-End System for Single Image Haze Removal [J].
Cai, Bolun ;
Xu, Xiangmin ;
Jia, Kui ;
Qing, Chunmei ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (11) :5187-5198
[6]  
Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[7]   Gated Context Aggregation Network for Image Dehazing and Deraining [J].
Chen, Dongdong ;
He, Mingming ;
Fan, Qingnan ;
Liao, Jing ;
Zhang, Liheng ;
Hou, Dongdong ;
Yuan, Lu ;
Hua, Gang .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1375-1383
[8]   Complementary Color Wavelet: A Novel Tool for the Color Image/Video Analysis and Processing [J].
Chen, Yang ;
Li, Dan ;
Zhang, Jian Qiu .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (01) :12-27
[9]   Multi-Scale Boosted Dehazing Network with Dense Feature Fusion [J].
Dong, Hang ;
Pan, Jinshan ;
Xiang, Lei ;
Hu, Zhe ;
Zhang, Xinyi ;
Wang, Fei ;
Yang, Ming-Hsuan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2154-2164
[10]   CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows [J].
Dong, Xiaoyi ;
Bao, Jianmin ;
Chen, Dongdong ;
Zhang, Weiming ;
Yu, Nenghai ;
Yuan, Lu ;
Chen, Dong ;
Guo, Baining .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12114-12124