Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior

被引:2
|
作者
Shi, Yukai [1 ]
Lin, Yupei [1 ]
Wei, Pengxu [2 ]
Xian, Xiaoyu [3 ]
Chen, Tianshui [1 ]
Lin, Liang [2 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci, Guangzhou 510006, Peoples R China
[3] CRRC Acad Co Ltd, Beijing 100036, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Data augmentation; Object detection; Feature extraction; Task analysis; Diversity reception; Data models; Image synthesis; diffusion model; infrared small target detection; Mosaic augmentation; LOCAL CONTRAST METHOD;
D O I
10.1109/TGRS.2024.3408045
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, researchers have proposed various deep learning methods to accurately detect infrared targets with the characteristics of indistinct shape and texture. Due to the limited variety of infrared datasets, training deep learning models with good generalization poses a challenge. To augment the infrared dataset, researchers employ data augmentation techniques, which often involve generating new images by combining images from different datasets. However, these methods are lacking in two respects. In terms of realism, the images generated by mixup-based methods lack realism and are difficult to effectively simulate complex real-world scenarios. In terms of diversity, compared with real-world scenes, borrowing knowledge from another dataset inherently has a limited diversity. Currently, the diffusion model stands out as an innovative generative approach. Large-scale trained diffusion models have a strong generative prior that enables real-world modeling of images to generate diverse and realistic images. In this article, we propose Diff-Mosaic, a data augmentation method based on the diffusion model. This model effectively alleviates the challenge of diversity and realism of data augmentation methods via diffusion prior. Specifically, our method consists of two stages. First, we introduce an enhancement network called Pixel-Prior, which generates highly coordinated and realistic Mosaic images by harmonizing pixels. In the second stage, we propose an image enhancement strategy named Diff-Prior. This strategy utilizes diffusion priors to model images in the real-world scene, further enhancing the diversity and realism of the images. Extensive experiments have demonstrated that our approach significantly improves the performance of the detection network. The code is available at https://github.com/YupeiLin2388/Diff-Mosaic.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Infrared Moving Small-Target Detection via Spatiotemporal Consistency of Trajectory Points
    Zhao, Fan
    Wang, Tingting
    Shao, Sidi
    Mang, Erhu
    Lin, Guangfeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (01) : 122 - 126
  • [32] Infrared Small UAV Target Detection Based on Residual Image Prediction via Global and Local Dilated Residual Networks
    Fang, Houzhang
    Xia, Mingjiang
    Zhou, Gang
    Chang, Yi
    Yan, Luxin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [33] Infrared small target detection via contrast-enhanced dual-branch network
    Xiao, Bolin
    Zhou, Wenjun
    Wang, Tianfei
    Zhang, Quan
    Peng, Bo
    DIGITAL SIGNAL PROCESSING, 2025, 159
  • [34] Sparse Prior Is Not All You Need: When Differential Directionality Meets Saliency Coherence for Infrared Small Target Detection
    Zhou, Fei
    Fu, Maixia
    Qian, Yulei
    Yang, Jian
    Dai, Yimian
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [35] Infrared small target detection via line-based reconstruction and entropy-induced suppression
    Shang, Ke
    Sun, Xiao
    Tian, Jinwen
    Li, Yansheng
    Ma, Jiayi
    INFRARED PHYSICS & TECHNOLOGY, 2016, 76 : 75 - 81
  • [36] Robust Infrared Small Target Detection via Multidirectional Derivative-Based Weighted Contrast Measure
    Lu, Ruitao
    Yang, Xiaogang
    Li, Weipeng
    Fan, Jiwei
    Li, Dalei
    Jing, Xin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [37] Designing and learning a lightweight network for infrared small target detection via dilated pyramid and semantic distillation
    Chen, Gao
    Wang, Weihua
    Li, Xingjian
    INFRARED PHYSICS & TECHNOLOGY, 2023, 131
  • [38] An infrared small target detection model via Gather-Excite attention and normalized Wasserstein distance
    Sun, Kangjian
    Huo, Ju
    Liu, Qi
    Yang, Shunyuan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (11) : 19040 - 19064
  • [39] Adaptive detection method of infrared small target based on target-background separation via robust principal component analysis
    Wang, Chuanyun
    Qin, Shiyin
    INFRARED PHYSICS & TECHNOLOGY, 2015, 69 : 123 - 135
  • [40] Infrared low-altitude and slow-speed small target detection via fusion of target sparsity and motion saliency
    Wu, Lang
    Ma, Yong
    Huang, Jun
    Qiu, Zhaobing
    Fan, Fan
    INFRARED PHYSICS & TECHNOLOGY, 2024, 142