UAV Networks Against Multiple Maneuvering Smart Jamming With Knowledge-Based Reinforcement Learning

被引:26
|
作者
Li, Zhiwei [1 ]
Lu, Yu [2 ]
Li, Xi [2 ]
Wang, Zengguang [3 ]
Qiao, Wenxin [2 ]
Liu, Yicen [2 ]
机构
[1] Army Engn Univ, UAV Engn Dept, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
[2] Aircraft Maintenance Ctr, Shijiazhuang Campus, Yongji 044500, Peoples R China
[3] Natl Def Univ, Shijiazhuang 050000, Hebei, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 15期
关键词
Jamming; Interference; Reinforcement learning; Games; Receivers; Signal to noise ratio; Convergence; Anti-jamming; domain knowledge; reinforcement learning (RL); unmanned aerial vehicle (UAV) networks; STACKELBERG GAME; TRANSMISSION;
D O I
10.1109/JIOT.2021.3062659
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The unmanned aerial vehicles (UAVs) networks are very vulnerable to smart jammers that can choose their jamming strategy based on the ongoing channel state accordingly. Although reinforcement learning (RL) algorithms can give UAV networks the ability to make intelligent decisions, the high-dimensional state space makes it difficult for algorithms to converge quickly. This article proposes a knowledge-based RL method, which uses domain knowledge to compress the state space that the agent needs to explore and then improve the algorithm convergence speed. Specifically, we use the inertial law of the aircraft and the law of signal attenuation in free space to guide the highly efficient exploration of the UAVs in the state space. We incorporate the performance indicators of the receiver and the subjective value of the task into the design of the reward function, and build a virtual environment for pretraining to accelerate the convergence of anti-jamming decisions. In addition, the algorithm proposed is completely based on observable data, which is more realistic than those studies that assume the position or the channel strategy of the jammer. The simulation shows that the proposed algorithm can outperform the benchmarks of model-free RL algorithm in terms of converge speed and averaged reward.
引用
收藏
页码:12289 / 12310
页数:22
相关论文
共 50 条
  • [21] Opponent-Awareness-Based Anti-Intelligent Jamming Channel Access Scheme: A Deep Reinforcement Learning Perspective
    Yuan, Hongcheng
    Chen, Jin
    Li, Wen
    Li, Guoxin
    Han, Hao
    Chen, Taoyi
    Gu, Fanglin
    Xu, Yuhua
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (07) : 11202 - 11216
  • [22] Learning-Based Resource Management Optimization for UAV-Assisted MEC Against Jamming
    Liu, Shuai
    Yang, Helin
    Xiao, Liang
    Zheng, Mengting
    Lu, Huabing
    Xiong, Zehui
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (08) : 4873 - 4886
  • [23] Reinforcement learning based a non-zero-sum game for secure transmission against smart jamming
    Zhao, Chenyu
    Wang, Qing
    Liu, Xiaofeng
    Li, Chun
    Shi, Lidong
    DIGITAL SIGNAL PROCESSING, 2021, 112
  • [24] Deep Reinforcement Learning-Based Anti-Jamming Algorithm Using Dual Action Network
    Li, Xiangchen
    Chen, Jienan
    Ling, Xiang
    Wu, Tingyong
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (07) : 4625 - 4637
  • [25] Playing a Strategy Game with Knowledge-Based Reinforcement Learning
    Voss V.
    Nechepurenko L.
    Schaefer R.
    Bauer S.
    SN Computer Science, 2020, 1 (2)
  • [26] Adaptive Reinforcement Learning Framework for NOMA-UAV Networks
    Mahmud, Syed Khurram
    Liu, Yuanwei
    Chen, Yue
    Chai, Kok Keong
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (09) : 2943 - 2947
  • [27] Jamming Games in Underwater Sensor Networks with Reinforcement Learning
    Xiao, Liang
    Li, Qiangda
    Chen, Tianhua
    Cheng, En
    Dai, Huaiyu
    2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,
  • [28] A smart reactive jamming approach to counter reinforcement learning-based antijamming strategies in GEO SATCOM scenario
    Arif, Shahzad
    Hashmi, Ali Javed
    Khan, Waseem
    Kausar, Rizwana
    INTERNATIONAL JOURNAL OF SATELLITE COMMUNICATIONS AND NETWORKING, 2022, 40 (02) : 96 - 119
  • [29] Power control with reinforcement learning in cooperative cognitive radio networks against jamming
    Xiao, Liang
    Li, Yan
    Liu, Jinliang
    Zhao, Yifeng
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (09): : 3237 - 3257
  • [30] Power control with reinforcement learning in cooperative cognitive radio networks against jamming
    Liang Xiao
    Yan Li
    Jinliang Liu
    Yifeng Zhao
    The Journal of Supercomputing, 2015, 71 : 3237 - 3257