An Efficient Parallel Reinforcement Learning Approach to Cross-Layer Defense Mechanism in Industrial Control Systems

被引:21
|
作者
Zhong, Kai [1 ]
Yang, Zhibang [2 ]
Xiao, Guoqing [1 ]
Li, Xingpei [1 ]
Yang, Wangdong [1 ]
Li, Kenli [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Changsha Univ, Hunan Prov Key Lab Ind Internet Technol & Secur, Changsha 410022, Hunan, Peoples R China
关键词
Games; Q-learning; Security; Integrated circuit modeling; Process control; Physical layer; Stochastic processes; Industrial control system (ICS); interaction; multiple attributes; parallel q-learning; stochastic game; GAME; NETWORKS; SECURITY;
D O I
10.1109/TPDS.2021.3135412
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The ongoing digitalization enables stable control processes and smooth operations of Industrial Control Systems (ICSs). A direct consequence of the highly interconnected architecture of ICSs is the introduced cyber vulnerability and increasing cyber security threats to ICSs. Numerous researches pay attention to the security problem of ICSs. However, most current studies face two challenges. First, the interaction problem between the cyber layer and the physical layer of ICSs may result in incorrect attack response strategies. Second, ICSs are real-time systems, but existing defense decision algorithms based on game theory or reinforcement learning techniques have high computational complexity, which prevents them from making decisions quickly. In this paper, we design a new multi-attribute based method for quantifying rewards and propose a multi-attribute based Q-learning algorithm to resolve the interaction problem. In addition, to overcome the limitation of slow convergence, we develop an effective parallel Q-learning (PQL) algorithm to quickly find the optimal strategy. The experimental results show the effectiveness of the PQL algorithm. Compared with the Q-learning algorithm (QL) and the deep Q-network (DQN) algorithm, our proposed solution can reduce the average completion time by 12.5 to 37 percent.
引用
收藏
页码:2979 / 2990
页数:12
相关论文
共 43 条
  • [31] Transmission control protocol performance enhancement for mobile broadband interactive satellite communication system: a cross-layer approach
    Liu, Gang
    Ji, Hong
    Li, Yi
    Li, Xi
    Wang, Yongbin
    INTERNATIONAL JOURNAL OF SATELLITE COMMUNICATIONS AND NETWORKING, 2015, 33 (02) : 119 - 133
  • [32] Energy Efficient Cross-Layer Transmission Design for Two-User Wireless Systems with Imperfect Channel State Information
    Li, Muhu
    Wang, Ping
    Wang, Chao
    Liu, Fuqiang
    2017 9TH INTERNATIONAL CONFERENCE ON ADVANCED INFOCOMM TECHNOLOGY (ICAIT 2017), 2017, : 182 - 188
  • [33] Reinforcement learning of occupant behavior model for cross-building transfer learning to various HVAC control systems
    Deng, Zhipeng
    Chen, Qingyan
    ENERGY AND BUILDINGS, 2021, 238
  • [34] Reinforcement Learning-Based Composite Optimal Operational Control of Industrial Systems With Multiple Unit Devices
    Zhao, Jianguo
    Yang, Chunyu
    Dai, Wei
    Gao, Weinan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (02) : 1091 - 1101
  • [35] Cross-Layer Handoff Design in MIMO-Enabled WLANs for Communication-Based Train Control (CBTC) Systems
    Zhu, Li
    Yu, F. Richard
    Ning, Bin
    Tang, Tao
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2012, 30 (04) : 719 - 728
  • [36] A Reinforcement Learning-Based Control Approach for Unknown Nonlinear Systems with Persistent Adversarial Inputs
    Zhong, Xiangnan
    He, Haibo
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [37] Energy-Efficient UAV Movement Control for Fair Communication Coverage: A Deep Reinforcement Learning Approach
    Nemer, Ibrahim A.
    Sheltami, Tarek R.
    Belhaiza, Slim
    Mahmoud, Ashraf S.
    SENSORS, 2022, 22 (05)
  • [38] Evolutionary Optimization of Fuzzy Reinforcement Learning and Its Application to Time-Varying Tracking Control of Industrial Parallel Robotic Manipulators
    Huang, Hsu-Chih
    Chen, Yu-Xiang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (12) : 11712 - 11720
  • [39] A Graph Attention Mechanism Based Multi-Agent Reinforcement Learning Method for Efficient Traffic Light Control
    Su, Changqing
    Yan, Yan
    Wang, Tao
    Zhang, Baoxian
    Li, Cheng
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1332 - 1337
  • [40] Energy Efficient AP Selection for Cell-Free Massive MIMO Systems: Deep Reinforcement Learning Approach
    Ghiasi, Niyousha
    Mashhadi, Shima
    Farahmand, Shahrokh
    Razavizadeh, S. Mohammad
    Lee, Inkyu
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2023, 7 (01): : 29 - 41