Prompt-Guided Sparse Transformer for Remote Sensing Image Dehazing

被引:0
作者
Dong, Haobo [1 ]
Song, Tianyu [1 ]
Qi, Xuanyu [1 ]
Jin, Guiyue [1 ]
Jin, Jiyu [1 ]
Ma, Ling [2 ]
机构
[1] Dalian Polytech Univ, Sch Informat Sci & Engn, Dalian 116034, Peoples R China
[2] Wuchang Shouyi Univ, Coll Informat Sci & Engn, Wuhan 430064, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Correlation; Transformers; Task analysis; Interference; Image restoration; Frequency-domain analysis; Frequency; prompt; remote sensing (RS) image dehazing; top-k selection operator (TSO); Transformer;
D O I
10.1109/LGRS.2024.3450181
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Transformer-based methods have gradually shown excellent performance in remote sensing (RS) image dehazing tasks. The self-attention can effectively explore nonlocal features, which are crucial for restoring images obscured by haze. However, when the tokens from the query differ from those of the key, these low-correlation self-attention values will still be included in the calculations indiscriminately, leading to further interference in the reconstruction of clear images. To better aggregate features, we propose a prompt-guided sparse Transformer (PGSformer). Specifically, adaptive top-k guided attention (ATGA) utilizes the top-k selection operator (TSO) to preserve the most important attention scores from the keys for each query, preventing interference from low-correlation query-key pairs in self-attention calculation. Meanwhile, we design the learnable prompt block (LPB) within ATGA to further enhance the accuracy of sparse selection for attention enhancement. Here, LPB guides the TSO dynamically optimizing sparse rate and adaptively learning mask thresholds to further distill the selected features. In addition, the frequency selection feedforward network (FSFN) is designed to adaptively obtain frequency information, so that the overall pipeline can improve the learning ability of dual frequency features. Extensive experimental results on several benchmarks show that our PGSformer outperforms the other competitive dehazing approach (RSDformer) by 0.92 dB on average PSNR.
引用
收藏
页数:5
相关论文
共 18 条
  • [1] Learning A Sparse Transformer Network for Effective Image Deraining
    Chen, Xiang
    Li, Hao
    Li, Mingqiang
    Pan, Jinshan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5896 - 5905
  • [2] Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
    Dong, Hang
    Pan, Jinshan
    Xiang, Lei
    Hu, Zhe
    Zhang, Xinyi
    Wang, Fei
    Yang, Ming-Hsuan
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2154 - 2164
  • [3] End-to-End Detail-Enhanced Dehazing Network for Remote Sensing Images
    Dong, Weida
    Wang, Chunyan
    Sun, Hao
    Teng, Yunjie
    Liu, Huan
    Zhang, Yue
    Zhang, Kailin
    Li, Xiaoyan
    Xu, Xiping
    [J]. REMOTE SENSING, 2024, 16 (02)
  • [4] Guo Y., 2023, P IEEE CVF C COMP VI, P1884
  • [5] Single Image Haze Removal Using Dark Channel Prior
    He, Kaiming
    Sun, Jian
    Tang, Xiaoou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (12) : 2341 - 2353
  • [6] Huang BH, 2020, IEEE WINT CONF APPL, P1795, DOI [10.1109/wacv45572.2020.9093471, 10.1109/WACV45572.2020.9093471]
  • [7] Kingma D. P., 2014, P INT C LEARN REPR, P3423
  • [8] Korhonen J, 2012, INT WORK QUAL MULTIM, P37, DOI 10.1109/QoMEX.2012.6263880
  • [9] AOD-Net: All-in-One Dehazing Network
    Li, Boyi
    Peng, Xiulian
    Wang, Zhangyang
    Xu, Jizheng
    Feng, Dan
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4780 - 4788
  • [10] Lin D., 2019, Remote Sens., V12, P1366