Leveraging Visual Language Model and Generative Diffusion Model for Zero-Shot SAR Target Recognition

被引:2
作者
Wang, Junyu [1 ]
Sun, Hao [1 ]
Tang, Tao [1 ]
Sun, Yuli [1 ]
He, Qishan [1 ]
Lei, Lin [1 ]
Ji, Kefeng [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
SAR simulation; target recognition; visual language model; generative diffusion model; domain adaption;
D O I
10.3390/rs16162927
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Simulated data play an important role in SAR target recognition, particularly under zero-shot learning (ZSL) conditions caused by the lack of training samples. The traditional SAR simulation method is based on manually constructing target 3D models for electromagnetic simulation, which is costly and limited by the target's prior knowledge base. Also, the unavoidable discrepancy between simulated SAR and measured SAR makes the traditional simulation method more limited for target recognition. This paper proposes an innovative SAR simulation method based on a visual language model and generative diffusion model by extracting target semantic information from optical remote sensing images and transforming it into a 3D model for SAR simulation to address the challenge of SAR target recognition under ZSL conditions. Additionally, to reduce the domain shift between the simulated domain and the measured domain, we propose a domain adaptation method based on dynamic weight domain loss and classification loss. The effectiveness of semantic information-based 3D models has been validated on the MSTAR dataset and the feasibility of the proposed framework has been validated on the self-built civilian vehicle dataset. The experimental results demonstrate that the first proposed SAR simulation method based on a visual language model and generative diffusion model can effectively improve target recognition performance under ZSL conditions.
引用
收藏
页数:22
相关论文
共 60 条
  • [11] Moving and stationary target acquisition and recognition (MSTAR) model-based automatic target recognition: Search technology for a robust ATR
    Diemunsch, J
    Wissinger, J
    [J]. ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY V, 1998, 3370 : 481 - 492
  • [12] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [13] Ganin Y, 2015, PR MACH LEARN RES, V37, P1180
  • [14] Hammer H., 2009, P IM SIGN PROC REM S, VVolume 7477, P406
  • [15] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [16] Fast 3D-ISAR Image Simulation of Targets at Arbitrary Aspect Angles Through Nonuniform Fast Fourier Transform (NUFFT)
    He, Xin Yi
    Zhou, Xiao Yang
    Cui, Tie Jun
    [J]. IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2012, 60 (05) : 2597 - 2602
  • [17] Hong YC, 2024, Arxiv, DOI [arXiv:2311.04400, DOI 10.48550/ARXIV.2311.04400]
  • [18] Hu EJ, 2021, Arxiv, DOI arXiv:2106.09685
  • [19] Hu Y, 2023, Arxiv, DOI arXiv:2307.15266
  • [20] Physics inspired hybrid attention for SAR target recognition
    Huang, Zhongling
    Wu, Chong
    Yao, Xiwen
    Zhao, Zhicheng
    Huang, Xiankai
    Han, Junwei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 207 : 164 - 174