Prompt-Guided Sparse Transformer for Remote Sensing Image Dehazing

被引：0

作者：

Dong, Haobo ^{[1
]}

Song, Tianyu ^{[1
]}

Qi, Xuanyu ^{[1
]}

Jin, Guiyue ^{[1
]}

Jin, Jiyu ^{[1
]}

Ma, Ling ^{[2
]}

机构：

[1] Dalian Polytech Univ, Sch Informat Sci & Engn, Dalian 116034, Peoples R China

[2] Wuchang Shouyi Univ, Coll Informat Sci & Engn, Wuhan 430064, Peoples R China

来源：

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS | 2024年 / 21卷

基金：

中国国家自然科学基金;

关键词：

Convolution; Correlation; Transformers; Task analysis; Interference; Image restoration; Frequency-domain analysis; Frequency; prompt; remote sensing (RS) image dehazing; top-k selection operator (TSO); Transformer;

D O I：

10.1109/LGRS.2024.3450181

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Transformer-based methods have gradually shown excellent performance in remote sensing (RS) image dehazing tasks. The self-attention can effectively explore nonlocal features, which are crucial for restoring images obscured by haze. However, when the tokens from the query differ from those of the key, these low-correlation self-attention values will still be included in the calculations indiscriminately, leading to further interference in the reconstruction of clear images. To better aggregate features, we propose a prompt-guided sparse Transformer (PGSformer). Specifically, adaptive top-k guided attention (ATGA) utilizes the top-k selection operator (TSO) to preserve the most important attention scores from the keys for each query, preventing interference from low-correlation query-key pairs in self-attention calculation. Meanwhile, we design the learnable prompt block (LPB) within ATGA to further enhance the accuracy of sparse selection for attention enhancement. Here, LPB guides the TSO dynamically optimizing sparse rate and adaptively learning mask thresholds to further distill the selected features. In addition, the frequency selection feedforward network (FSFN) is designed to adaptively obtain frequency information, so that the overall pipeline can improve the learning ability of dual frequency features. Extensive experimental results on several benchmarks show that our PGSformer outperforms the other competitive dehazing approach (RSDformer) by 0.92 dB on average PSNR.

引用

页数：5

共 18 条

[1] Learning A Sparse Transformer Network for Effective Image Deraining
Chen, Xiang
Li, Hao
Li, Mingqiang
Pan, Jinshan
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5896 - 5905
[2] Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
Dong, Hang
Pan, Jinshan
Xiang, Lei
Hu, Zhe
Zhang, Xinyi
Wang, Fei
Yang, Ming-Hsuan
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2154 - 2164
[3] End-to-End Detail-Enhanced Dehazing Network for Remote Sensing Images
Dong, Weida
Wang, Chunyan
Sun, Hao
Teng, Yunjie
Liu, Huan
Zhang, Yue
Zhang, Kailin
Li, Xiaoyan
Xu, Xiping
[J]. REMOTE SENSING, 2024, 16 (02)
[4] Guo Y., 2023, P IEEE CVF C COMP VI, P1884
[5] Single Image Haze Removal Using Dark Channel Prior
He, Kaiming
Sun, Jian
Tang, Xiaoou
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (12) : 2341 - 2353
[6] Huang BH, 2020, IEEE WINT CONF APPL, P1795, DOI [10.1109/wacv45572.2020.9093471, 10.1109/WACV45572.2020.9093471]
[7] Kingma D. P., 2014, P INT C LEARN REPR, P3423
[8] Korhonen J, 2012, INT WORK QUAL MULTIM, P37, DOI 10.1109/QoMEX.2012.6263880
[9] AOD-Net: All-in-One Dehazing Network
Li, Boyi
Peng, Xiulian
Wang, Zhangyang
Xu, Jizheng
Feng, Dan
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4780 - 4788
[10] Lin D., 2019, Remote Sens., V12, P1366

← 1 2 →