Research on decision-making modeling of cognitive jamming for multi-functional radar based on Markov

被引:0
|
作者
Zhu B. [1 ,2 ]
Zhu W. [2 ]
Li W. [3 ]
Yang Y. [3 ]
Gao T. [3 ]
机构
[1] Department of Electronic and Optical Engineering, Space Engineering University, Beijing
[2] State Key Lab. of Complex Electromagnetic Environment Effects on Electronics and Information System, Luoyang
[3] Graduate School, Space Engineering University, Beijing
关键词
Markov decision process; Q-learning; radar confrontation; radar state; reinforcement learning;
D O I
10.12305/j.issn.1001-506X.2022.08.13
中图分类号
学科分类号
摘要
Multi-functional radar is an indispensable and important equipment in modern electromagnetic battlefield. The interference of multi-functional radar is always a difficult problem. In this paper, based on the study of the characteristics of multi-functional radar signal and the radar countermeasure process, the method of joint representation of radar state is proposed, and the interference problem of multi-functional radar is modeled as a Markov decision process with benefits. The cognitive interference decision system is designed. The interference strategy is solved by the cognitive interference decision algorithm based on Q-learning. Through the simulation experiment, it is proved that the cognitive interference decision algorithm based on Q-learning can learn the optimal interference strategy in the absence of prior experience, have the characteristic of "cognition", and have strong adaptability in the unstable environment, which effectively supports the interference decision model mentioned above. © 2022 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:2488 / 2497
页数:9
相关论文
共 32 条
  • [1] WANG S F, LI Y, XU M, Et al., Principle and technology of cognitive electronic warfare, pp. 10-11, (2018)
  • [2] GONG L L, LUO J Q, WU S L., A radar emitter identification method based on pulse match template sequence, Proc. of the IEEE 2nd International Conference on Signal Processing Systems, 3, pp. 153-156, (2010)
  • [3] ZHOU M C., Research on radar jamming decision technology based on game theory, pp. 29-32, (2014)
  • [4] BACHMANN D J, EVANS R J, MORAN B, Game theoretic analysis of adaptive radar jamming, IEEE Trans.on Aerospace and Electronic Systems, 47, 2, pp. 1081-1100, (2011)
  • [5] HE B, SU H T., Joint power allocation and beamforming between a multistatic radar and jammer based on game theory, Proc. of the IEEE 11th International Conference on Communication Software and Networks, pp. 337-341, (2019)
  • [6] SUN H W, TONG N N, SUN F J, Jamming design selection based on D-S theory, Journal of Projectiles, Rockets, Missiles and Guidance, pp. 218-220, (2003)
  • [7] TANG Z, GAO X G, Research on the self-defence electronic jamming decision-making based on the discrete dynamic Bayesian network, Journal of Systems Engineering and Electronics, 19, 4, pp. 702-708, (2008)
  • [8] SILVER D, SCHRITTWIESER J L, SIMONYAN K, Et al., Mastering the game of go without human knowledge, Nature, 550, 7676, pp. 354-359, (2017)
  • [9] YOO J, JANG D, KIM H J, Et al., Hybrid reinforcement learning control for a micro quadrotor flight, IEEE Control Systems Letters, 5, 2, pp. 505-510, (2020)
  • [10] GUO X X, YAN W S, CUI R X, Reinforcement learning-based nearly optimal control for constrained-input partially unknown systems using differentiator, IEEE Trans.on Neural Networks and Learning Systems, 31, 11, pp. 4713-4725, (2019)