An infrared small target detection model via Gather-Excite attention and normalized Wasserstein distance

被引:3
|
作者
Sun, Kangjian [1 ]
Huo, Ju [1 ]
Liu, Qi [1 ]
Yang, Shunyuan [2 ]
机构
[1] Harbin Inst Technol, Sch Elect Engn & Automat, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
infrared small target detection; YOLOv7; anchor update; Gather-Excite attention; normalized Wasserstein distance; LIGHTWEIGHT; IMAGES; FUSION; YOLO;
D O I
10.3934/mbe.2023842
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Infrared small target detection (ISTD) is the main research content for defense confrontation, long-range precision strikes and battlefield intelligence reconnaissance. Targets from the aerial view have the characteristics of small size and dim signal. These characteristics affect the performance of traditional detection models. At present, the target detection model based on deep learning has made huge advances. The You Only Look Once (YOLO) series is a classic branch. In this paper, a model with better adaptation capabilities, namely ISTD-YOLOv7, is proposed for infrared small target detection. First, the anchors of YOLOv7 are updated to provide prior. Second, Gather-Excite (GE) attention is embedded in YOLOv7 to exploit feature context and spatial location information. Finally, Normalized Wasserstein Distance (NWD) replaces IoU in the loss function to alleviate the sensitivity of YOLOv7 for location deviations of small targets. Experiments on a standard dataset show that the proposed model has stronger detection performance than YOLOv3, YOLOv5s, SSD, CenterNet, FCOS, YOLOXs, DETR and the baseline model, with a mean Average Precision (mAP) of 98.43%. Moreover, ablation studies indicate the effectiveness of the improved components.
引用
收藏
页码:19040 / 19064
页数:25
相关论文
共 50 条
  • [31] RKformer: Runge-Kutta Transformer with Random-Connection Attention for Infrared Small Target Detection
    Zhang, Mingjin
    Bai, Haichen
    Zhang, Jing
    Zhang, Rui
    Wang, Chaoyue
    Guo, Jie
    Gao, Xinbo
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1730 - 1738
  • [32] AFE-Net: Attention-Guided Feature Enhancement Network for Infrared Small Target Detection
    Wang, Keyan
    Wu, Xueyan
    Zhou, Peicheng
    Chen, Zuntian
    Zhang, Rui
    Yang, Liyun
    Li, Yunsong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4208 - 4221
  • [33] EAAU-Net: Enhanced Asymmetric Attention U-Net for Infrared Small Target Detection
    Tong, Xiaozhong
    Sun, Bei
    Wei, Junyu
    Zuo, Zhen
    Su, Shaojing
    REMOTE SENSING, 2021, 13 (16)
  • [34] Infrared Dim and Small Target Detection via Multiple Subspace Learning and Spatial-Temporal Patch-Tensor Model
    Sun, Yang
    Yang, Jungang
    An, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05): : 3737 - 3752
  • [35] Robust Infrared Small Target Detection Using a Novel Four-Leaf Model
    Zhou, Dali
    Wang, Xiaodong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 1462 - 1469
  • [36] Infrared Small Target Detection Based on a Group Image-Patch Tensor Model
    Yang, Lanlan
    Yan, Peng
    Li, Meihui
    Zhang, Jianlin
    Xu, Zhiyong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [37] Infrared small target detection via contrast-enhanced dual-branch network
    Xiao, Bolin
    Zhou, Wenjun
    Wang, Tianfei
    Zhang, Quan
    Peng, Bo
    DIGITAL SIGNAL PROCESSING, 2025, 159
  • [38] Self-attention mapping U-net with infrared weak supervision for enhanced small target detection
    Lang, Tingting
    Lin, Zhihan
    Zhang, Lingjian
    Qiu, Yanqing
    Yang, Shengying
    Liu, Yong
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (05)
  • [39] Infrared bi-polar small target detection via novel ring morphological transformation
    Wang, Chenglong
    Yang, Linjie
    Wang, Luping
    INFRARED PHYSICS & TECHNOLOGY, 2022, 127
  • [40] MIRSAM: multimodal vision-language segment anything model for infrared small target detection
    Mingjin Zhang
    Qian Xu
    Yuchun Wang
    Xi Li
    Haojuan Yuan
    Visual Intelligence, 2025, 3 (1):