Spatial-Frequency Adaptive Remote Sensing Image Dehazing With Mixture of Experts

被引:3
作者
Shen, Hao [1 ]
Ding, Henghui [2 ]
Zhang, Yulun [3 ]
Cong, Xiaofeng [4 ]
Zhao, Zhong-Qiu [1 ]
Jiang, Xudong [5 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
[2] Fudan Univ, Inst Big Data, Shanghai 200433, Peoples R China
[3] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[4] Southeast Univ, Sch Cyber Sci Engn, Nanjing 210096, Peoples R China
[5] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Remote sensing; Transformers; Atmospheric modeling; Feature extraction; Frequency modulation; Convolutional neural networks; Frequency-domain analysis; Decoupled frequency learning; image dehazing; mixture of modulation experts (MoME); REMOVAL; HAZE;
D O I
10.1109/TGRS.2024.3458986
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The feature modulation mechanism has been demonstrated to be particularly well-suited for efficient network design and is rarely explored in remote sensing dehazing tasks. Moreover, we observe distinct patterns in haze distribution across the low-frequency (LF) and high-frequency (HF) components of haze images from various datasets. However, existing research rarely investigated the potential solution in the frequency domain. In response, we propose a novel spatial-frequency adaptive network (SFAN), which is mainly built by the proposed mixture of modulation experts (MoME) and decoupled frequency learning block (DFLB). Different from the fixed feature modulation design used in other tasks, the MoME adopts the mixture-of-expert mechanism to dynamically learn diverse contextual features of various granularities and scales in a sample-adaptive manner and then utilize them to perform elementwise local feature modulation. This pure convolution architecture enables our network to have superior performance and efficiency tradeoffs. Furthermore, the DFLB is devised to facilitate the LF global haze removal and reconstruction of HF local texture information. At the micro level, we first utilize a mask extractor (ME) to generate the frequency mask from the input hazy image, then employ a dual-branch decoupled learning unit to boost frequency learning, and finally develop a mixture of fusion experts (MoFE) to achieve HF and LF feature interaction. Extensive experiments on publicly available dehazing datasets demonstrate that our network performs superior performance while incurring lower computational costs. Compared to the state-of-the-art approach (DEA-Net), SFAN achieves, an average, 0.83-dB PSNR improvement on five remote sensing datasets but consumes only 51% of the FLOPs. The code will be available at https://github.com/it-hao/SFAN.
引用
收藏
页数:14
相关论文
共 66 条
[51]   Partial Siamese With Multiscale Bi-Codec Networks for Remote Sensing Image Haze Removal [J].
Sun, Hang ;
Luo, Zhiming ;
Ren, Dong ;
Hu, Wei ;
Du, Bo ;
Yang, Wen ;
Wan, Jun ;
Zhang, Lefei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[52]   CropCap: Embedding Visual Cross-Partition Dependency for Image Captioning [J].
Wang, Bo ;
Zhang, Zhao ;
Zhao, Suiyi ;
Zhang, Haijun ;
Hong, Richang ;
Wang, Meng .
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, :1750-1758
[53]   Encoder-Free Multiaxis Physics-Aware Fusion Network for Remote Sensing Image Dehazing [J].
Wen, Yuanbo ;
Gao, Tao ;
Zhang, Jing ;
Li, Ziqi ;
Chen, Ting .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[54]   Contrastive Learning for Compact Single Image Dehazing [J].
Wu, Haiyan ;
Qu, Yanyun ;
Lin, Shaohui ;
Zhou, Jian ;
Qiao, Ruizhi ;
Zhang, Zhizhong ;
Xie, Yuan ;
Ma, Lizhuang .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10546-10555
[55]  
Xiao Y., 2024, IEEE Trans. Image Process., V33, P752
[56]  
Xiao Y, 2024, Arxiv, DOI arXiv:2405.04964
[57]  
Yang H, 2023, Arxiv, DOI arXiv:2312.01381
[58]  
Yang Jianwei, 2022, Focal modulation networks
[59]   Frequency and Spatial Dual Guidance for Image Dehazing [J].
Yu, Hu ;
Zheng, Naishan ;
Zhou, Man ;
Huang, Jie ;
Xiao, Zeyu ;
Zhao, Feng .
COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 :181-198
[60]   Restormer: Efficient Transformer for High-Resolution Image Restoration [J].
Zamir, Syed Waqas ;
Arora, Aditya ;
Khan, Salman ;
Hayat, Munawar ;
Khan, Fahad Shahbaz ;
Yang, Ming-Hsuan .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :5718-5729