UAV Networks Against Multiple Maneuvering Smart Jamming With Knowledge-Based Reinforcement Learning

被引：26

作者：

Li, Zhiwei ^{[1
]}

Lu, Yu ^{[2
]}

Li, Xi ^{[2
]}

Wang, Zengguang ^{[3
]}

Qiao, Wenxin ^{[2
]}

Liu, Yicen ^{[2
]}

机构：

[1] Army Engn Univ, UAV Engn Dept, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China

[2] Aircraft Maintenance Ctr, Shijiazhuang Campus, Yongji 044500, Peoples R China

[3] Natl Def Univ, Shijiazhuang 050000, Hebei, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 15期

关键词：

Jamming; Interference; Reinforcement learning; Games; Receivers; Signal to noise ratio; Convergence; Anti-jamming; domain knowledge; reinforcement learning (RL); unmanned aerial vehicle (UAV) networks; STACKELBERG GAME; TRANSMISSION;

D O I：

10.1109/JIOT.2021.3062659

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The unmanned aerial vehicles (UAVs) networks are very vulnerable to smart jammers that can choose their jamming strategy based on the ongoing channel state accordingly. Although reinforcement learning (RL) algorithms can give UAV networks the ability to make intelligent decisions, the high-dimensional state space makes it difficult for algorithms to converge quickly. This article proposes a knowledge-based RL method, which uses domain knowledge to compress the state space that the agent needs to explore and then improve the algorithm convergence speed. Specifically, we use the inertial law of the aircraft and the law of signal attenuation in free space to guide the highly efficient exploration of the UAVs in the state space. We incorporate the performance indicators of the receiver and the subjective value of the task into the design of the reward function, and build a virtual environment for pretraining to accelerate the convergence of anti-jamming decisions. In addition, the algorithm proposed is completely based on observable data, which is more realistic than those studies that assume the position or the channel strategy of the jammer. The simulation shows that the proposed algorithm can outperform the benchmarks of model-free RL algorithm in terms of converge speed and averaged reward.

引用

页码：12289 / 12310

页数：22

共 50 条

[21] Opponent-Awareness-Based Anti-Intelligent Jamming Channel Access Scheme: A Deep Reinforcement Learning Perspective
Yuan, Hongcheng
Chen, Jin
Li, Wen
Li, Guoxin
Han, Hao
Chen, Taoyi
Gu, Fanglin
Xu, Yuhua
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (07) : 11202 - 11216
[22] Learning-Based Resource Management Optimization for UAV-Assisted MEC Against Jamming
Liu, Shuai
Yang, Helin
Xiao, Liang
Zheng, Mengting
Lu, Huabing
Xiong, Zehui
IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (08) : 4873 - 4886
[23] Reinforcement learning based a non-zero-sum game for secure transmission against smart jamming
Zhao, Chenyu
Wang, Qing
Liu, Xiaofeng
Li, Chun
Shi, Lidong
DIGITAL SIGNAL PROCESSING, 2021, 112
[24] Deep Reinforcement Learning-Based Anti-Jamming Algorithm Using Dual Action Network
Li, Xiangchen
Chen, Jienan
Ling, Xiang
Wu, Tingyong
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (07) : 4625 - 4637
[25] Playing a Strategy Game with Knowledge-Based Reinforcement Learning
Voss V.
Nechepurenko L.
Schaefer R.
Bauer S.
SN Computer Science, 2020, 1 (2)
[26] Adaptive Reinforcement Learning Framework for NOMA-UAV Networks
Mahmud, Syed Khurram
Liu, Yuanwei
Chen, Yue
Chai, Kok Keong
IEEE COMMUNICATIONS LETTERS, 2021, 25 (09) : 2943 - 2947
[27] Jamming Games in Underwater Sensor Networks with Reinforcement Learning
Xiao, Liang
Li, Qiangda
Chen, Tianhua
Cheng, En
Dai, Huaiyu
2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,
[28] A smart reactive jamming approach to counter reinforcement learning-based antijamming strategies in GEO SATCOM scenario
Arif, Shahzad
Hashmi, Ali Javed
Khan, Waseem
Kausar, Rizwana
INTERNATIONAL JOURNAL OF SATELLITE COMMUNICATIONS AND NETWORKING, 2022, 40 (02) : 96 - 119
[29] Power control with reinforcement learning in cooperative cognitive radio networks against jamming
Xiao, Liang
Li, Yan
Liu, Jinliang
Zhao, Yifeng
JOURNAL OF SUPERCOMPUTING, 2015, 71 (09): : 3237 - 3257
[30] Power control with reinforcement learning in cooperative cognitive radio networks against jamming
Liang Xiao
Yan Li
Jinliang Liu
Yifeng Zhao
The Journal of Supercomputing, 2015, 71 : 3237 - 3257

← 1 2 3 4 5 →