Observer-Based Optimal Backstepping Security Control for Nonlinear Systems Using Reinforcement Learning Strategy

被引:3
作者
Wei, Qinglai [1 ,2 ,3 ]
Chen, Wendi [1 ,2 ,3 ]
Tan, Xiangmin [4 ]
Xiao, Jun [3 ]
Dong, Qi [5 ]
机构
[1] Macau Univ Sci & Technol, Inst Syst Engn, Macau, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Engn Thermophys, Beijing 100190, Peoples R China
[5] China Acad Elect & Informat Technol, Grp Elect & Informat, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Nonlinear systems; Security; Backstepping; Optimal control; Control systems; Vectors; Observers; Deception attacks; improved state observer; nonlinear systems; optimized backstepping (OB); reinforcement learning (RL); STATE;
D O I
10.1109/TCYB.2024.3443522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers an observer-based optimal backstepping security control for nonlinear systems using reinforcement learning (RL) strategy. The main challenge faced is the design of optimal contoller under the deception attacks. Therefore, this article introduces an improved security RL algorithm based on neural network technology under the design framework of critic-actor to resist attacks and optimize the entire system. Second, compared with some existing results, how to relax the general assumption about deception attack is also a difficult research topic. In this article, an unusual observer that uses the attacked system output is designed to estimate the real unavailable states caused by deception attacks, so that the impact of deception attacks is eliminated and the output feedback control is also achieved. By selecting the virtual controllers and the real controller as corresponding optimized controllers within the framework of the RL algorithm, the control strategy can ensure that all signals in the closed-loop system are semi-globally ultimately bounded. Finally, two simulation experiments will be run to demonstrate the effectiveness of the strategy.
引用
收藏
页码:7011 / 7023
页数:13
相关论文
共 42 条
[1]   Event-Triggered Extended Dissipative FTB for T-S Fuzzy Switched Systems With Mismatched Phenomena and Deception Attacks: A Multidomain Framework [J].
Chen, Haiyang ;
Zong, Guangdeng ;
Su, Shun-Feng ;
Gao, Fangzheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (09) :4928-4938
[2]  
Chen W., 2024, J. Autom. Intell., V3, P34
[3]   Adaptive Event-Triggered Control for Non-Strict Feedback Nonlinear CPSs With Time Delays Against Deception Attacks and Actuator Faults [J].
Chen, Wen-Di ;
Niu, Ben ;
Wang, Huan-Qing ;
Li, Hai-Tao ;
Wang, Ding .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) :4106-4116
[4]   Nussbaum-Based Adaptive Fault-Tolerant Control for Nonlinear CPSs With Deception Attacks: A New Coordinate Transformation Technology [J].
Chen, Wen-Di ;
Li, Yuan-Xin ;
Liu, Lei ;
Zhao, Xu-Dong ;
Niu, Ben ;
Han, Li-Min .
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (02) :1212-1222
[5]  
Cheng HJ, 2023, IEEE T AFFECT COMPUT, V14, P3149, DOI [10.1109/TAFFC.2023.3265653, 10.1109/TII.2022.3232768]
[6]   A survey on security control and attack detection for industrial cyber-physical systems [J].
Ding, Derui ;
Han, Qing-Long ;
Xiang, Yang ;
Ge, Xiaohua ;
Zhang, Xian-Ming .
NEUROCOMPUTING, 2018, 275 :1674-1683
[7]   Adaptive neural network control of nonlinear systems by state and output feedback [J].
Ge, SS ;
Hang, CC ;
Zhang, T .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1999, 29 (06) :818-828
[8]   Optimal Linear Cyber-Attack on Remote State Estimation [J].
Guo, Ziyang ;
Shi, Dawei ;
Johansson, Karl Henrik ;
Shi, Ling .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2017, 4 (01) :4-13
[9]   Adaptive sliding control for single-link flexible-joint robot with mismatched uncertainties [J].
Huang, AC ;
Chen, YC .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2004, 12 (05) :770-775
[10]   A Novel Parallel Control Method for Optimal Consensus of Nonlinear Multiagent Systems [J].
Jiao, Shanshan ;
Wei, Qinglai ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (10) :5912-5925