An Efficient Parallel Reinforcement Learning Approach to Cross-Layer Defense Mechanism in Industrial Control Systems

被引：21

作者：

Zhong, Kai ^{[1
]}

Yang, Zhibang ^{[2
]}

Xiao, Guoqing ^{[1
]}

Li, Xingpei ^{[1
]}

Yang, Wangdong ^{[1
]}

Li, Kenli ^{[1
]}

机构：

[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China

[2] Changsha Univ, Hunan Prov Key Lab Ind Internet Technol & Secur, Changsha 410022, Hunan, Peoples R China

来源：

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS | 2022年 / 33卷 / 11期

关键词：

Games; Q-learning; Security; Integrated circuit modeling; Process control; Physical layer; Stochastic processes; Industrial control system (ICS); interaction; multiple attributes; parallel q-learning; stochastic game; GAME; NETWORKS; SECURITY;

D O I：

10.1109/TPDS.2021.3135412

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The ongoing digitalization enables stable control processes and smooth operations of Industrial Control Systems (ICSs). A direct consequence of the highly interconnected architecture of ICSs is the introduced cyber vulnerability and increasing cyber security threats to ICSs. Numerous researches pay attention to the security problem of ICSs. However, most current studies face two challenges. First, the interaction problem between the cyber layer and the physical layer of ICSs may result in incorrect attack response strategies. Second, ICSs are real-time systems, but existing defense decision algorithms based on game theory or reinforcement learning techniques have high computational complexity, which prevents them from making decisions quickly. In this paper, we design a new multi-attribute based method for quantifying rewards and propose a multi-attribute based Q-learning algorithm to resolve the interaction problem. In addition, to overcome the limitation of slow convergence, we develop an effective parallel Q-learning (PQL) algorithm to quickly find the optimal strategy. The experimental results show the effectiveness of the PQL algorithm. Compared with the Q-learning algorithm (QL) and the deep Q-network (DQN) algorithm, our proposed solution can reduce the average completion time by 12.5 to 37 percent.

引用

页码：2979 / 2990

页数：12

共 43 条

[31] Transmission control protocol performance enhancement for mobile broadband interactive satellite communication system: a cross-layer approach
Liu, Gang
Ji, Hong
Li, Yi
Li, Xi
Wang, Yongbin
INTERNATIONAL JOURNAL OF SATELLITE COMMUNICATIONS AND NETWORKING, 2015, 33 (02) : 119 - 133
[32] Energy Efficient Cross-Layer Transmission Design for Two-User Wireless Systems with Imperfect Channel State Information
Li, Muhu
Wang, Ping
Wang, Chao
Liu, Fuqiang
2017 9TH INTERNATIONAL CONFERENCE ON ADVANCED INFOCOMM TECHNOLOGY (ICAIT 2017), 2017, : 182 - 188
[33] Reinforcement learning of occupant behavior model for cross-building transfer learning to various HVAC control systems
Deng, Zhipeng
Chen, Qingyan
ENERGY AND BUILDINGS, 2021, 238
[34] Reinforcement Learning-Based Composite Optimal Operational Control of Industrial Systems With Multiple Unit Devices
Zhao, Jianguo
Yang, Chunyu
Dai, Wei
Gao, Weinan
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (02) : 1091 - 1101
[35] Cross-Layer Handoff Design in MIMO-Enabled WLANs for Communication-Based Train Control (CBTC) Systems
Zhu, Li
Yu, F. Richard
Ning, Bin
Tang, Tao
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2012, 30 (04) : 719 - 728
[36] A Reinforcement Learning-Based Control Approach for Unknown Nonlinear Systems with Persistent Adversarial Inputs
Zhong, Xiangnan
He, Haibo
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[37] Energy-Efficient UAV Movement Control for Fair Communication Coverage: A Deep Reinforcement Learning Approach
Nemer, Ibrahim A.
Sheltami, Tarek R.
Belhaiza, Slim
Mahmoud, Ashraf S.
SENSORS, 2022, 22 (05)
[38] Evolutionary Optimization of Fuzzy Reinforcement Learning and Its Application to Time-Varying Tracking Control of Industrial Parallel Robotic Manipulators
Huang, Hsu-Chih
Chen, Yu-Xiang
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (12) : 11712 - 11720
[39] A Graph Attention Mechanism Based Multi-Agent Reinforcement Learning Method for Efficient Traffic Light Control
Su, Changqing
Yan, Yan
Wang, Tao
Zhang, Baoxian
Li, Cheng
IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1332 - 1337
[40] Energy Efficient AP Selection for Cell-Free Massive MIMO Systems: Deep Reinforcement Learning Approach
Ghiasi, Niyousha
Mashhadi, Shima
Farahmand, Shahrokh
Razavizadeh, S. Mohammad
Lee, Inkyu
IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2023, 7 (01): : 29 - 41

← 1 2 3 4 5 →