LMAFormer: Local Motion Aware Transformer for Small Moving Infrared Target Detection

被引:2
作者
Huang, Yuanxin [1 ]
Zhi, Xiyang [1 ]
Hu, Jianming [1 ]
Yu, Lijian [1 ]
Han, Qichao [1 ]
Chen, Wenbin [1 ]
Zhang, Wei [1 ]
机构
[1] Harbin Inst Technol, Res Ctr Space Opt Engn, Harbin 150001, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Feature extraction; Object detection; Transformers; Decoding; Three-dimensional displays; Computational modeling; Deep learning; Annotations; Visualization; Urban areas; Infrared small moving target detection; local motion aware; multiframe joint query; multiscale transformer encoder;
D O I
10.1109/TGRS.2024.3502663
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In temporal infrared small target detection, it is crucial to leverage the disparities in spatiotemporal characteristics between the target and the background to distinguish the former. However, remote imaging and the relative motion between the detection platform and the background cause significant coupling of spatiotemporal characteristics, making target detection highly challenging. To address these challenges, we propose a network named LMAFormer. First, we introduce a local motion-aware spatiotemporal attention mechanism that aligns and enhances multiframe features to extract local spatiotemporal salient features of targets while avoiding interference from moving backgrounds. Second, we employ a multiscale fusion transformer encoder that computes self-attention weights across and within scales during encoding, to establish multiscale correlations among different regions of temporal images, enabling motion background modeling. Last, we propose a multiframe joint query decoder. The shallowest feature map after multiscale feature propagation is mapped to initial query weights, which are refined through grouped convolutions to generate grouped query vectors. These are jointly optimized to encapsulate rich multiframe details, strengthening motion background modeling and target feature representation, improving prediction accuracy. Experimental results on the NUDT-MIRSDT, IRDST, and the established TSIRMT datasets demonstrate that our network outperforms state-of-the-art (SOTA) methods. Our code and dataset will be available at https://github.com/lifier/LMAFormer.
引用
收藏
页数:17
相关论文
共 67 条
[1]   An Efficient Rep-Style Gaussian-Wasserstein Network: Improved UAV Infrared Small Object Detection for Urban Road Surveillance and Safety [J].
Aibibu, Tuerniyazi ;
Lan, Jinhui ;
Zeng, Yiliang ;
Lu, Weijian ;
Gu, Naiwei .
REMOTE SENSING, 2024, 16 (01)
[2]   Analysis of new top-hat transformation and the application for infrared dim small target detection [J].
Bai, Xiangzhi ;
Zhou, Fugen .
PATTERN RECOGNITION, 2010, 43 (06) :2145-2156
[3]   Toward Dense Moving Infrared Small Target Detection: New Datasets and Baseline [J].
Chen, Shengjia ;
Ji, Luping ;
Zhu, Sicheng ;
Ye, Mao ;
Ren, Haohao ;
Sang, Yongsheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[4]   SSTNet: Sliced Spatio-Temporal Network With Cross-Slice ConvLSTM for Moving Infrared Dim-Small Target Detection [J].
Chen, Shengjia ;
Ji, Luping ;
Zhu, Jiewen ;
Ye, Mao ;
Yao, Xiaoyong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-12
[5]   A Multi-Task Framework for Infrared Small Target Detection and Segmentation [J].
Chen, Yuhang ;
Li, Liyuan ;
Liu, Xin ;
Su, Xiaofeng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[6]   Masked-attention Mask Transformer for Universal Image Segmentation [J].
Cheng, Bowen ;
Misra, Ishan ;
Schwing, Alexander G. ;
Kirillov, Alexander ;
Girdhar, Rohit .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1280-1289
[7]   XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model [J].
Cheng, Ho Kei ;
Schwing, Alexander G. .
COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 :640-658
[8]   One-Stage Cascade Refinement Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Li, Xiang ;
Zhou, Fei ;
Qian, Yulei ;
Chen, Yaohong ;
Yang, Jian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[9]   Reweighted Infrared Patch-Tensor Model With Both Nonlocal and Local Priors for Single-Frame Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (08) :3752-3767
[10]   Infrared Small Marine Target Detection Based on Spatiotemporal Dynamics Analysis [J].
Dang, Chujia ;
Li, Zhengzhou ;
Hao, Congyu ;
Xiao, Qin .
REMOTE SENSING, 2023, 15 (05)