An Efficient Parallel Reinforcement Learning Approach to Cross-Layer Defense Mechanism in Industrial Control Systems

被引:21
|
作者
Zhong, Kai [1 ]
Yang, Zhibang [2 ]
Xiao, Guoqing [1 ]
Li, Xingpei [1 ]
Yang, Wangdong [1 ]
Li, Kenli [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Changsha Univ, Hunan Prov Key Lab Ind Internet Technol & Secur, Changsha 410022, Hunan, Peoples R China
关键词
Games; Q-learning; Security; Integrated circuit modeling; Process control; Physical layer; Stochastic processes; Industrial control system (ICS); interaction; multiple attributes; parallel q-learning; stochastic game; GAME; NETWORKS; SECURITY;
D O I
10.1109/TPDS.2021.3135412
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The ongoing digitalization enables stable control processes and smooth operations of Industrial Control Systems (ICSs). A direct consequence of the highly interconnected architecture of ICSs is the introduced cyber vulnerability and increasing cyber security threats to ICSs. Numerous researches pay attention to the security problem of ICSs. However, most current studies face two challenges. First, the interaction problem between the cyber layer and the physical layer of ICSs may result in incorrect attack response strategies. Second, ICSs are real-time systems, but existing defense decision algorithms based on game theory or reinforcement learning techniques have high computational complexity, which prevents them from making decisions quickly. In this paper, we design a new multi-attribute based method for quantifying rewards and propose a multi-attribute based Q-learning algorithm to resolve the interaction problem. In addition, to overcome the limitation of slow convergence, we develop an effective parallel Q-learning (PQL) algorithm to quickly find the optimal strategy. The experimental results show the effectiveness of the PQL algorithm. Compared with the Q-learning algorithm (QL) and the deep Q-network (DQN) algorithm, our proposed solution can reduce the average completion time by 12.5 to 37 percent.
引用
收藏
页码:2979 / 2990
页数:12
相关论文
共 43 条
  • [41] Elastic O-RAN Slicing for Industrial Monitoring and Control: A Distributed Matching Game and Deep Reinforcement Learning Approach
    Abedin, Sarder Fakhrul
    Mahmood, Aamir
    Tran, Nguyen H.
    Han, Zhu
    Gidlund, Mikael
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (10) : 10808 - 10822
  • [42] Energy Efficient 3-D UAV Control for Persistent Communication Service and Fairness: A Deep Reinforcement Learning Approach
    Qi, Hang
    Hu, Zhiqun
    Huang, Hao
    Wen, Xiangming
    Lu, Zhaoming
    IEEE ACCESS, 2020, 8 : 53172 - 53184
  • [43] Novel Reinforcement Learning based Power Control and Subchannel Selection Mechanism for Grant-Free NOMA URLLC-Enabled Systems
    Tran, Duc-Dung
    Ha, Vu Nguyen
    Chatzinotas, Symeon
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,