YOLOv8 for Infrared Small Target Detection

被引:0
作者
Zhuo, Yilan [1 ]
Li, Wei [1 ,2 ]
Huo, Ju [1 ]
Chao, Tao [1 ,2 ]
机构
[1] Natl Key Lab Complex Syst Control & Intelligent A, Beijing, Peoples R China
[2] Harbin Inst Technol, Harbin 150080, Peoples R China
来源
ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL, VOL 18 | 2025年 / 1354卷
关键词
infrared image; target Detection; attention mechanism; YOLOv8;
D O I
10.1007/978-981-96-2268-9_37
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Infrared small target detection is a critical task in national defense, homeland security, aerospace, industrial monitoring, and environmental conservation, while the detection accuracy is generally insufficient due to the small size of the targets and the unclear features. Additionally, conventional detection algorithms often face challenges in terms of computational resources, making it difficult to achieve higher frame rates for infrared small target detection. To overcome these challenges, a new algorithm based on the latest YOLOv8 is introduced. In the feature extraction stage, the lightweight repeated C2f module from YOLOv8 is utilized, along with the incorporation of an attention mechanism. This approach reduces computational overhead while enhancing feature extraction capabilities. The detection results on the DIRST dataset demonstrate that the proposed method achieves an 8.4% increase in frame rate compared to the baseline network. Furthermore, it also exhibits improved performance in the confusion matrix analysis.
引用
收藏
页码:388 / 398
页数:11
相关论文
共 15 条
[1]  
Bochkovskiy A, 2020, ARXIV, DOI 10.48550/ARXIV.2004.10934
[2]   Fusing Self-Attention and CoordConv to Improve the YOLOv5s Algorithm for Infrared Weak Target Detection [J].
Fan, Xiangsuo ;
Ding, Wentao ;
Qin, Wenlin ;
Xiao, Dachuan ;
Min, Lei ;
Yuan, Haohao .
SENSORS, 2023, 23 (15)
[3]   Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916
[4]   You Should Look at All Objects [J].
Jin, Zhenchao ;
Yu, Dongdong ;
Song, Luchuan ;
Yuan, Zehuan ;
Yu, Lequan .
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 :332-349
[5]  
Kumar N., 2023, arXiv
[6]   Research on Infrared Dim and Small Target Detection Algorithm Based on Low-Rank Tensor Recovery [J].
Liu, Chuntong ;
Wang, Hao .
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (04) :861-872
[7]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[8]  
Ma J., 2023, Remote Sens, V15
[9]   ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design [J].
Ma, Ningning ;
Zhang, Xiangyu ;
Zheng, Hai-Tao ;
Sun, Jian .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :122-138
[10]   YOLO9000: Better, Faster, Stronger [J].
Redmon, Joseph ;
Farhadi, Ali .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6517-6525