Intelligent anti-jamming decision algorithm based on proximal policy optimization

被引:0
作者
Ma, Song [1 ,2 ]
Li, Li [3 ]
Li, Wei [2 ]
Huang, Wei [2 ]
Wang, Jun [2 ]
机构
[1] Southwest China Institute of Electronic Technology, Chengdu
[2] National Key Laboratory of Wireless Communications, University of Electronic Science and Technology of China, Chengdu
[3] Southwest China Research Institute of Electronic Equipment, Chengdu
来源
Tongxin Xuebao/Journal on Communications | 2024年 / 45卷 / 08期
基金
中国国家自然科学基金;
关键词
anti-jamming decision; deep reinforcement learning; intelligent anti-jamming; proximal policy optimization;
D O I
10.11959/j.issn.1000-436x.2024137
中图分类号
学科分类号
摘要
The existing intelligent anti-jamming methods based on deep reinforcement learning are applied to space-ground TT&C and communication links, in which the deep neural network used for decision-making has a complex structure, and the resources of satellites and other vehicles are limited, making it difficult to independently complete the timely training of complex neural network under the constraints of limited complexity, and the decision-making of anti-jamming cannot converge. Aiming at the above problems, an intelligent anti-jamming decision algorithm based on proximal policy optimization was proposed, which deployed the decision-making neural network and the training neural network in the vehicles and the ground station, respectively. The ground station conducted the optimal offline training based on the empirical information feedback from the vehicles, and assisted the decision-making neural network in parameter updating, thereby achieving the effective selection of anti-jamming strategies while satisfying the resource constraints of the vehicles. The simulation results demonstrate that the convergence speed of the proposed algorithm is increased by 37%, and the system capacity after convergence is increased by 25%, compared with the decision algorithms of policy gradient and deep Q-learning. © 2024 Editorial Board of Journal on Communications. All rights reserved.
引用
收藏
页码:249 / 257
页数:8
相关论文
共 24 条
[1]  
HUANG T, LIU J, WANG S, Et al., Survey of the future network technology and trend, Journal on Communications, 42, 1, pp. 130-150, (2021)
[2]  
LIU T H, WANG H Q., Thought on application of space-ground integrated information network in domestic civil aviation, Telecommunication Engineering, 58, 6, pp. 738-744, (2018)
[3]  
NIEPHAUS C, KRETSCHMER M, GHINEA G., QoS provisioning in converged satellite and terrestrial networks: a survey of the state-of-the-art, IEEE Communications Surveys & Tutorials, 18, 4, pp. 2415-2441, (2016)
[4]  
ZHANG H J, CHEN A Q, LI Y B, Et al., Key technologies of 6G mobile network, Journal on Communications, 43, 7, pp. 189-202, (2022)
[5]  
GUIDOTTI A, VANELLI-CORALLI A, CONTI M, Et al., Architectures and key technical challenges for 5G systems incorporating satellites, IEEE Transactions on Vehicular Technology, 68, 3, pp. 2624-2639, (2019)
[6]  
ZHANG L C, XU Y B, LI F H, Et al., Dynamic security-empowering architecture for space-ground integration information network, Journal on Communications, 42, 9, pp. 87-95, (2021)
[7]  
ZHU Y G, SUN Y F, YAO F Q, Et al., Channel-space endogenous anti-jamming method based on multi-reconfigurable intelligent surface, Journal on Communications, 44, 10, pp. 13-22, (2023)
[8]  
BRYAN C, MARK G, JESSE S., Winning in the gray zone: using electromagnetic warfare to regain escalation dominance, (2017)
[9]  
YAO H P, WANG L Y, WANG X D, Et al., The space-terrestrial integrated network: an overview, IEEE Communications Magazine, 56, 9, pp. 178-185, (2018)
[10]  
FENG Z B, XU Y H, DU Z Y, Et al., Active defense technology against intelligent jammer, Journal on Communications, 43, 10, pp. 42-54, (2022)