Security State Estimation for Cyber-Physical Systems against DoS Attacks via Reinforcement Learning and Game Theory

被引:11
作者
Jin, Zengwang [1 ,2 ]
Zhang, Shuting [1 ]
Hu, Yanyan [3 ]
Zhang, Yanning [2 ]
Sun, Changyin [4 ]
机构
[1] Northwestern Polytech Univ, Sch Cybersecur, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Natl Engn Lab Integrated AeroSp Ground Ocean Big, Xian 710072, Peoples R China
[3] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[4] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
cyber-physical system; security estimation; DoS attack; reinforcement learning; Nash equilibrium; MITIGATION;
D O I
10.3390/act11070192
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
This paper addressed the optimal policy selection problem of attacker and sensor in cyber-physical systems (CPSs) under denial of service (DoS) attacks. Since the sensor and the attacker have opposite goals, a two-player zero-sum game is introduced to describe the game between the sensor and the attacker, and the Nash equilibrium strategies are studied to obtain the optimal actions. In order to effectively evaluate and quantify the gains, a reinforcement learning algorithm is proposed to dynamically adjust the corresponding strategies. Furthermore, security state estimation is introduced to evaluate the impact of offensive and defensive strategies on CPSs. In the algorithm, the epsilon-greedy policy is improved to make optimal choices based on sufficient learning, achieving a balance of exploration and exploitation. It is worth noting that the channel reliability factor is considered in order to study CPSs with multiple reasons for packet loss. The reinforcement learning algorithm is designed in two scenarios: reliable channel (that is, the reason for packet loss is only DoS attacks) and unreliable channel (the reason for packet loss is not entirely from DoS attacks). The simulation results of the two scenarios show that the proposed reinforcement learning algorithm can quickly converge to the Nash equilibrium policies of both sides, proving the availability and effectiveness of the algorithm.
引用
收藏
页数:19
相关论文
共 41 条
[1]   Online-Learning-Based Defense Against Jamming Attacks in Multichannel Wireless CPS [J].
Alipour-Fanid, Amir ;
Dabaghchian, Monireh ;
Wang, Ning ;
Jiao, Long ;
Zeng, Kai .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (17) :13278-13290
[2]   Resilient Cyber-Security Approach For Aviation Cyber-Physical Systems Protection Against Sensor Spoofing Attacks [J].
Alsulami, Abdulaziz A. ;
Zein-Sabatto, Saleh .
2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, :565-571
[3]  
Anderson BD., 2012, Optimal filtering
[4]   Online Detection of Stealthy False Data Injection Attacks in Power System State Estimation [J].
Ashok, Aditya ;
Govindarasu, Manimaran ;
Ajjarapu, Venkataramana .
IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (03) :1636-1646
[5]   Secure Planning Against Stealthy Attacks via Model-Free Reinforcement Learning [J].
Bozkurt, Alper Kamil ;
Wang, Yu ;
Pajic, Miroslav .
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, :10656-10662
[6]   Design of False Data Injection Attack on Distributed Process Estimation [J].
Choraria, Moulik ;
Chattopadhyay, Arpan ;
Mitra, Urbashi ;
Strom, Erik G. .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 :670-683
[7]   A reputation score policy and Bayesian game theory based incentivized mechanism for DDoS attacks mitigation and cyber defense [J].
Dahiya, Amrita ;
Gupta, Brij B. .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 117 :193-204
[8]   Distributed Reinforcement Learning for Cyber-Physical System With Multiple Remote State Estimation Under DoS Attacker [J].
Dai, Pengcheng ;
Yu, Wenwu ;
Wang, He ;
Wen, Guanghui ;
Lv, Yuezu .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04) :3212-3222
[9]   A systems and control perspective of CPS security [J].
Dibaji, Seyed Mehran ;
Pirani, Mohammad ;
Flamholz, David Bezalel ;
Annaswamy, Anuradha M. ;
Johansson, Karl Henrik ;
Chakrabortty, Aranya .
ANNUAL REVIEWS IN CONTROL, 2019, 47 :394-411
[10]   Secure State Estimation and Control of Cyber-Physical Systems: A Survey [J].
Ding, Derui ;
Han, Qing-Long ;
Ge, Xiaohua ;
Wang, Jun .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (01) :176-190