IRSTD-YOLO: An Improved YOLO Framework for Infrared Small Target Detection

被引:0
作者
Tang, Yuan [1 ]
Xu, Tingfa [1 ,2 ]
Qin, Haolin [1 ]
Li, Jianan [1 ]
机构
[1] Beijing Inst Technol, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Chongqing Innovat Ctr, Chongqing 401135, Peoples R China
关键词
Feature extraction; Object detection; Image edge detection; Training; Convolution; Kernel; Data mining; YOLO; Head; Geoscience and remote sensing; Infrared small target detection; target enhancement;
D O I
10.1109/LGRS.2025.3562096
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Detecting small targets in infrared images, especially in low-contrast and complex backgrounds, remains challenging. To tackle this, we propose infrared small target detection YOLO (IRSTD-YOLO), a novel detection network. The edge and feature extraction (EFE) module enhances feature representation by integrating a SobelConv branch and a 2DConv branch. The SobelConv branch applies Sobel operators to extract gradient information, enhancing edge contrast and making small targets more distinguishable from the background. Unlike standard convolutions, which process all features uniformly, this edge-aware operation emphasizes structural information crucial for detecting small infrared targets. The 2DConv branch captures spatial context, complementing the edge features to create a more comprehensive representation. To further refine detection, we introduce the infrared small target enhancement (IRSTE) module, addressing the limitations of conventional feature pyramid networks. Instead of merely adding a shallow detection head, IRSTE processes and enhances shallow-layer features, which are rich in small target information, and fuses them with deeper features. By leveraging a multibranch strategy that integrates local, global, and large-scale contexts, IRSTE enhances small target representation and detection robustness, particularly in low-contrast environments where traditional networks often fail. Experimental results show that IRSTD-YOLO achieves an mAP@0.5:0.95 of 36.7% on the InfraredUAV dataset and 51.6% on the AntiUAV310 dataset, outperforming YOLOv11-s by 4.4% and 4.2%, respectively. Code is released at https://github.com/vectorbullet/IRSTD-YOLO
引用
收藏
页数:5
相关论文
共 19 条
[1]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[2]   Weakly Supervised Learning for Pixel-Level Sea Ice Concentration Extraction Using AI4Arctic Sea Ice Challenge Dataset [J].
Chen, Xinwei ;
Patel, Muhammed ;
Xu, Linlin ;
Chen, Yuhao ;
Scott, K. Andrea ;
Clausi, David A. .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 :1-5
[3]  
Cui YN, 2024, AAAI CONF ARTIF INTE, P1426
[4]   Attentional Local Contrast Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11) :9813-9824
[5]   Asymmetric Contextual Modulation for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :949-958
[6]  
[回丙伟 Hui Bingwei], 2020, [中国科学数据, China Scientific Data], V5, P1, DOI 10.11922/csdata.2019.0074.zh
[7]  
Jiang N, 2021, Arxiv, DOI arXiv:2101.08466
[8]  
Khanam R, 2024, Arxiv, DOI arXiv:2410.17725
[9]   Dense Nested Attention Network for Infrared Small Target Detection [J].
Li, Boyang ;
Xiao, Chao ;
Wang, Longguang ;
Wang, Yingqian ;
Lin, Zaiping ;
Li, Miao ;
An, Wei ;
Guo, Yulan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :1745-1758
[10]  
Meng SQ, 2025, IEEE T GEOSCI REMOTE, V63, DOI 10.1109/TGRS.2025.3541171