Mitigate Target-Level Insensitivity of Infrared Small Target Detection via Posterior Distribution Modeling

被引:0
作者
Li, Haoqing [1 ]
Yang, Jinfu [2 ,3 ]
Xu, Yifei [1 ]
Wang, Runshi [1 ]
机构
[1] Beijing Univ Technol, Fac Informat, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
Noise; Task analysis; Object detection; Deep learning; Wavelet domain; Training; diffusion model; generative model; infrared small target detection (IRSTD); GENERALIZED CROSS-VALIDATION; IMAGE; DIM;
D O I
10.1109/JSTARS.2024.3429491
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Infrared small target detection (IRSTD) aims to segment small targets from infrared clutter background. Existing methods mainly focus on discriminative approaches, i.e., a pixel-level front-background binary segmentation. Since infrared small targets are small and low signal-to-clutter ratio, empirical risk has few disturbances when a certain false alarm and missed detection exist, which seriously affect the further improvement of such methods. Motivated by the dense prediction generative methods, in this article, we compensate pixel-level discriminant with mask posterior distribution modeling. Specifically, we propose a diffusion model framework for IRSTD. This generative framework maximizes the posterior distribution of the small target mask to surmount the performance bottleneck associated with minimizing discriminative empirical risk. This transition from the discriminative paradigm to generative one enables us to bypass the target-level insensitivity. Furthermore, we design a low-frequency isolation in wavelet domain to suppress the interference of intrinsic infrared noise on the diffusion noise estimation. The low-frequency component of the infrared image in the wavelet domain is processed by a neural network, and the high-frequency component is utilized to restore the targets information, to estimate the residuals of the enhanced features. Experiments show that the proposed method achieves competitive performance gains over state-of-the-art methods on NUAA-SIRST, NUDT-SIRST, and IRSTD-1 k datasets.
引用
收藏
页码:13188 / 13201
页数:14
相关论文
共 63 条
[1]  
Amit T, 2021, ARXIV
[2]  
Baranchuk D., 2022, INT C LEARN REPR
[3]   Generative Semantic Segmentation [J].
Chen, Jiaqi ;
Lu, Jiachen ;
Zhu, Xiatian ;
Zhang, Li .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :7111-7120
[4]   Attentional Local Contrast Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11) :9813-9824
[5]   Asymmetric Contextual Modulation for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :949-958
[6]   Infrared Fluorescence-Based Cancer Screening Capsule for the Small Intestine [J].
Demosthenous, Panayiota ;
Pitris, Constantinos ;
Georgiou, Julius .
IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2016, 10 (02) :467-476
[7]   Max-Mean and Max-Median filters for detection of small-targets [J].
Deshpande, SD ;
Er, MH ;
Ronda, V ;
Chan, P .
SIGNAL AND DATA PROCESSING OF SMALL TARGETS 1999, 1999, 3809 :74-83
[8]  
Ermon S., 2021, P INT C LEARN REPR
[9]   ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts [J].
Feng, Zhida ;
Zhang, Zhenyu ;
Yu, Xintong ;
Fang, Yewei ;
Li, Lanxin ;
Chen, Xuyi ;
Lu, Yuxiang ;
Liu, Jiaxiang ;
Yin, Weichong ;
Feng, Shikun ;
Sun, Yu ;
Chen, Li ;
Tian, Hao ;
Wu, Hua ;
Wang, Haifeng .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :10135-10145
[10]   Infrared Patch-Image Model for Small Target Detection in a Single Image [J].
Gao, Chenqiang ;
Meng, Deyu ;
Yang, Yi ;
Wang, Yongtao ;
Zhou, Xiaofang ;
Hauptmann, Alexander G. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (12) :4996-5009