Toward Accurate Infrared Small Target Detection via Edge-Aware Gated Transformer

被引:17
作者
Zhu, Yiming [1 ]
Ma, Yong [1 ]
Fan, Fan [1 ]
Huang, Jun [1 ]
Wu, Kangle [1 ]
Wang, Ge [1 ]
机构
[1] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Shape; Transformers; Image edge detection; Object detection; Logic gates; Task analysis; Gated-shaped stream; infrared small target; Swin Transformer; MODEL; DIM;
D O I
10.1109/JSTARS.2024.3386899
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Extracting small targets from complex backgrounds is the eventual goal of single-frame infrared small target detection, which has many potential applications in defense security and marine rescue. Recently, methods utilizing deep learning have shown their superiority over traditional theoretical approaches. However, they do not consider both the global semantics and specific shape information, thereby limiting their performance. To overcome this proplem, we propose a gated-shaped TransUnet (GSTUnet), designed to fully utilize shape information while detecting small target detection. Specifically, we have proposed a multiscale encoder branch to extract global features of small targets at different scales. Then, the extracted global features are passed through a gated-shaped stream branch that focuses on the shape information of small targets through gate convolutions. Finally, we fuse their features to obtain the final result. Our GSTUnet learns both global and shape information through the aforementioned two branches, establishing global relationships between different feature scales. The GSTUnet demonstrates excellent evaluation metrics on various datasets, outperforming current state-of-the-art methods.
引用
收藏
页码:8779 / 8793
页数:15
相关论文
共 56 条
[1]   Analysis of new top-hat transformation and the application for infrared dim small target detection [J].
Bai, Xiangzhi ;
Zhou, Fugen .
PATTERN RECOGNITION, 2010, 43 (06) :2145-2156
[2]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[3]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[4]  
Castleman K.R., 1996, DIGITAL IMAGE PROCES
[5]   A Local Contrast Method for Small Infrared Target Detection [J].
Chen, C. L. Philip ;
Li, Hong ;
Wei, Yantao ;
Xia, Tian ;
Tang, Yuan Yan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01) :574-581
[6]  
Chen J., 2021, arXiv
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]   One-Stage Cascade Refinement Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Li, Xiang ;
Zhou, Fei ;
Qian, Yulei ;
Chen, Yaohong ;
Yang, Jian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[9]   Attentional Local Contrast Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11) :9813-9824
[10]   Asymmetric Contextual Modulation for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :949-958