An Efficient Parallel Reinforcement Learning Approach to Cross-Layer Defense Mechanism in Industrial Control Systems

被引：21

作者：

Zhong, Kai ^{[1
]}

Yang, Zhibang ^{[2
]}

Xiao, Guoqing ^{[1
]}

Li, Xingpei ^{[1
]}

Yang, Wangdong ^{[1
]}

Li, Kenli ^{[1
]}

机构：

[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China

[2] Changsha Univ, Hunan Prov Key Lab Ind Internet Technol & Secur, Changsha 410022, Hunan, Peoples R China

来源：

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS | 2022年 / 33卷 / 11期

关键词：

Games; Q-learning; Security; Integrated circuit modeling; Process control; Physical layer; Stochastic processes; Industrial control system (ICS); interaction; multiple attributes; parallel q-learning; stochastic game; GAME; NETWORKS; SECURITY;

D O I：

10.1109/TPDS.2021.3135412

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The ongoing digitalization enables stable control processes and smooth operations of Industrial Control Systems (ICSs). A direct consequence of the highly interconnected architecture of ICSs is the introduced cyber vulnerability and increasing cyber security threats to ICSs. Numerous researches pay attention to the security problem of ICSs. However, most current studies face two challenges. First, the interaction problem between the cyber layer and the physical layer of ICSs may result in incorrect attack response strategies. Second, ICSs are real-time systems, but existing defense decision algorithms based on game theory or reinforcement learning techniques have high computational complexity, which prevents them from making decisions quickly. In this paper, we design a new multi-attribute based method for quantifying rewards and propose a multi-attribute based Q-learning algorithm to resolve the interaction problem. In addition, to overcome the limitation of slow convergence, we develop an effective parallel Q-learning (PQL) algorithm to quickly find the optimal strategy. The experimental results show the effectiveness of the PQL algorithm. Compared with the Q-learning algorithm (QL) and the deep Q-network (DQN) algorithm, our proposed solution can reduce the average completion time by 12.5 to 37 percent.

引用

页码：2979 / 2990

页数：12

共 43 条

[41] Elastic O-RAN Slicing for Industrial Monitoring and Control: A Distributed Matching Game and Deep Reinforcement Learning Approach
Abedin, Sarder Fakhrul
Mahmood, Aamir
Tran, Nguyen H.
Han, Zhu
Gidlund, Mikael
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (10) : 10808 - 10822
[42] Energy Efficient 3-D UAV Control for Persistent Communication Service and Fairness: A Deep Reinforcement Learning Approach
Qi, Hang
Hu, Zhiqun
Huang, Hao
Wen, Xiangming
Lu, Zhaoming
IEEE ACCESS, 2020, 8 : 53172 - 53184
[43] Novel Reinforcement Learning based Power Control and Subchannel Selection Mechanism for Grant-Free NOMA URLLC-Enabled Systems
Tran, Duc-Dung
Ha, Vu Nguyen
Chatzinotas, Symeon
2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,

← 1 2 3 4 5 →