Infrared small target detection with super-resolution and YOLO

被引:7
作者
Hao, Xinyue [1 ,2 ]
Luo, Shaojuan [3 ]
Chen, Meiyun [1 ]
He, Chunhua [2 ]
Wang, Tao [1 ,2 ]
Wu, Heng [1 ,2 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangdong Prov Key Lab Cyber Phys Syst, Guangzhou 510006, Peoples R China
[2] Guangdong Univ Technol, Sch Comp, Guangzhou 510006, Peoples R China
[3] Guangdong Univ Technol, Sch Chem Engn & Light Ind, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Infrared remote sensing; Small target detection; Deep learning; YOLO;
D O I
10.1016/j.optlastec.2024.111221
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Infrared remote sensing imaging plays a crucial role in military observation, nighttime security surveillance, forest fire monitoring, and so on. In these applications, detecting dim small targets has always been a challenging problem, especially in complex backgrounds and low-contrast conditions. Existing model-driven methods usually lack robustness in handling noise and small-size targets. Deep learning-based approaches are heavily dependent on data and have limitations in feature processing and fusion, leading to missed detections and false alarms. To address these issues, we propose a small target detection method for infrared images with image super-resolution technology and deep learning. Firstly, we apply super-resolution image preprocessing and multiple data augmentation to the input infrared images. Secondly, we develop a deep-learning network based on YOLO called YOLO-SR, which incorporates a bottleneck transformer block after the spatial pyramid pooling module in the backbone layer to capture long-range dependencies in the infrared images. We design a C3-Neck module in the neck layer to better extract and fuse spatial and channel information. Experimental results show that the proposed method achieves mAP@0.5 scores of 95.2% on the public datasets and effectively addresses the issues of missed detections and false alarms compared to current state-of-the-art data-driven detection methods.
引用
收藏
页数:10
相关论文
共 47 条
[1]  
[Anonymous], 2018, FREE FLIR Thermal Dataset for Algorithm Training
[2]  
[Anonymous], 2022, About us
[3]  
Cai W., 2022, IEEE Trans. Geosci, Remote Sens
[4]  
Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[5]   Target detection in infrared and SAR terrain images using a non-Gaussian stochastic model [J].
Chapple, PB ;
Bertilone, DC ;
Caprari, RS ;
Angeli, S ;
Newsam, GN .
TARGETS AND BACKGROUNDS: CHARACTERIZATION AND REPRESENTATION V, 1999, 3699 :122-132
[6]  
Chen J., 2021, arXiv
[7]   A lightweight iterative error reconstruction network for infrared image super-resolution in smart grid [J].
Chen, Lihui ;
Tang, Rui ;
Anisetti, Marco ;
Yang, Xiaomin .
SUSTAINABLE CITIES AND SOCIETY, 2021, 66
[8]   Attentional Local Contrast Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11) :9813-9824
[9]   Asymmetric Contextual Modulation for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :949-958
[10]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929