PBT: Progressive Background-Aware Transformer for Infrared Small Target Detection

被引:14
作者
Yang, Huoren [1 ]
Mu, Tingkui [1 ]
Dong, Ziyue [2 ]
Zhang, Zicheng [2 ]
Wang, Bin [1 ]
Ke, Wei [2 ]
Yang, Qiujie [3 ]
He, Zhiping [3 ]
机构
[1] Xran Jiaotong Univ, Sch Sci, Inst Space Opt, Xian 710049, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Xian 710049, Shaanxi, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Tech Phys, Shanghai 200083, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Decoding; Feature extraction; Object detection; Task analysis; Transformers; Encoding; Semantics; Background context; cross-attention; infrared small target detection (IRSTD); vision transformer; LOCAL CONTRAST METHOD; CONTEXT; MODEL; DIM;
D O I
10.1109/TGRS.2024.3415080
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the domain of infrared small target detection (IRSTD), the challenges revolve around detecting small and faint targets from infrared images. These targets lack distinct textures and morphology exist in complex backgrounds with numerous distractions. Current deep-learning methods typically prioritize preserving target features while neglecting the crucial background context, ultimately resulting in false alarms and miss detection. To tackle this issue, we propose a novel approach involving separately focusing on candidate target responses and background context during the encoding stage and aligning them during the decoding stage. Specifically, we introduce the progressive background-aware transformer (PBT) which adopts an asymmetric encoder-decoder architecture. The encoder with task-specific frequency domain priors extracts candidate target responses and background context features separately from shallow and deep blocks, respectively. The following hierarchical decoder progressively refines the candidate target responses under the guidance of rich background context stage by stage, leading to more accurate results. Our experiments demonstrate that PBT surpasses state-of-the-art IRSTD methods across various datasets. The code and dataset are available at https://github.com/Heron0625/PBT.
引用
收藏
页数:13
相关论文
共 76 条
[1]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[2]   A Local Contrast Method for Small Infrared Target Detection [J].
Chen, C. L. Philip ;
Li, Hong ;
Wei, Yantao ;
Xia, Tian ;
Tang, Yuan Yan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01) :574-581
[3]   IRSTFormer: A Hierarchical Vision Transformer for Infrared Small Target Detection [J].
Chen, Gao ;
Wang, Weihua ;
Tan, Sirui .
REMOTE SENSING, 2022, 14 (14)
[4]   Hybrid Task Cascade for Instance Segmentation [J].
Chen, Kai ;
Pang, Jiangmiao ;
Wang, Jiaqi ;
Xiong, Yu ;
Li, Xiaoxiao ;
Sun, Shuyang ;
Feng, Wansen ;
Liu, Ziwei ;
Shi, Jianping ;
Ouyang, Wanli ;
Loy, Chen Change ;
Lin, Dahua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978
[5]   Activating More Pixels in Image Super-Resolution Transformer [J].
Chen, Xiangyu ;
Wang, Xintao ;
Zhou, Jiantao ;
Qiao, Yu ;
Dong, Chao .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22367-22377
[6]   Towards Large-Scale Small Object Detection: Survey and Benchmarks [J].
Cheng, Gong ;
Yuan, Xiang ;
Yao, Xiwen ;
Yan, Kebing ;
Zeng, Qinghua ;
Xie, Xingxing ;
Han, Junwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) :13467-13488
[7]   Attentional Local Contrast Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11) :9813-9824
[8]   Asymmetric Contextual Modulation for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :949-958
[9]   Reweighted Infrared Patch-Tensor Model With Both Nonlocal and Local Priors for Single-Frame Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (08) :3752-3767
[10]   Non-negative infrared patch-image model: Robust target-background separation via partial sum minimization of singular values [J].
Dai, Yimian ;
Wu, Yiquan ;
Song, Yu ;
Guo, Jun .
INFRARED PHYSICS & TECHNOLOGY, 2017, 81 :182-194