Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming Mitigation

被引：1

作者：

Davaslioglu, Kemal ^{[1
]}

Kompella, Sastry ^{[1
]}

Erpek, Tugba ^{[1
]}

Sagduyu, Yalin E. ^{[1
]}

机构：

[1] Nexcepta, Gaithersburg, MD 20878 USA

来源：

MILCOM 2024-2024 IEEE MILITARY COMMUNICATIONS CONFERENCE, MILCOM | 2024年

关键词：

Anti-jamming; reinforcement learning; deep learning; catastrophic forgetting; continual learning;

D O I：

10.1109/MILCOM61039.2024.10773861

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep Reinforcement Learning (DRL) has been highly effective in learning from and adapting to RF environments and thus detecting and mitigating jamming effects to facilitate reliable wireless communications. However, traditional DRL methods are susceptible to catastrophic forgetting (namely forgetting old tasks when learning new ones), especially in dynamic wireless environments where jammer patterns change over time. This paper considers an anti-jamming system and addresses the challenge of catastrophic forgetting in DRL applied to jammer detection and mitigation. First, we demonstrate the impact of catastrophic forgetting in DRL when applied to jammer detection and mitigation tasks, where the network forgets previously learned jammer patterns while adapting to new ones. This catastrophic interference undermines the effectiveness of the system, particularly in scenarios where the environment is non-stationary. We present a method that enables the network to retain knowledge of old jammer patterns while learning to handle new ones. Our approach substantially reduces catastrophic forgetting, allowing the anti-jamming system to learn new tasks without compromising its ability to perform previously learned tasks effectively. Furthermore, we introduce a systematic methodology for sequentially learning tasks in the anti-jamming framework. By leveraging continual DRL techniques based on PackNet, we achieve superior anti-jamming performance compared to standard DRL methods. Our proposed approach not only addresses catastrophic forgetting but also enhances the adaptability and robustness of the system in dynamic jamming environments. We demonstrate the efficacy of our method in preserving knowledge of past jammer patterns, learning new tasks efficiently, and achieving superior anti-jamming performance compared to traditional DRL approaches.

引用

页码：740 / 745

页数：6

共 27 条

[1] QoS and Jamming-Aware Wireless Networking Using Deep Reinforcement Learning [J].