A Reinforcement Learning-Based Control Approach for Unknown Nonlinear Systems with Persistent Adversarial Inputs

被引：1

作者：

Zhong, Xiangnan ^{[1
]}

He, Haibo ^{[2
]}

机构：

[1] Florida Atlantic Univ, Dept Comp & Elect Engn & Comp Sci, Boca Raton, FL 33431 USA

[2] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA

来源：

2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2021年

基金：

美国国家科学基金会;

关键词：

Reinforcement learning; zero-sum games; neural networks; observer; online learning and control; TRACKING CONTROL; GAME; ADP; GO;

D O I：

10.1109/IJCNN52387.2021.9534429

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper develops an intelligent control method based on reinforcement learning techniques for unknown nonlinear continuous-time systems in an adversarial environment. The developed method can automatically learn the optimal control input for the system and also predict the worst case adversarial input that one adversary can bring into. Besides, we assume that the agent can only observe partial information of the environment during the learning process. Therefore, a neural network-based observer is developed to adaptively reconstruct the hidden states and dynamics. Then, theoretical analysis is provided to show the stability of the developed intelligent control and the accuracy of the established observer. This method has been applied on a torsional pendulum system and the results demonstrate the effectiveness of the designed approach.

引用

页数：8

共 56 条

[51] Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints [J].

Zhao, Bo ;

Liu, Derong ;

Luo, Chaomin .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) :4330-4340

[52]

Zhong X., 2014, P IEEE S ADAPTIVE DY, P1

[53] Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game [J].

Zhong, Xiangnan ;

He, Haibo ;

Wang, Ding ;

Ni, Zhen .

IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (05) :1633-1646

[54] An Event-Triggered ADP Control Approach for Continuous-Time System With Unknown Internal States [J].

Zhong, Xiangnan ;

He, Haibo .

IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (03) :683-694

[55]

Zhong XN, 2015, IEEE IJCNN

[56] ARIGUMA Code Analyzer: Efficient Variant Detection by Identifying Common Instruction Sequences in Malware Families [J].

Zhong, Yang ;

Yamaki, Hirofumi ;

Yamaguchi, Yukiko ;

Takakura, Hiroki .

2013 IEEE 37TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2013, :11-20

← 1 2 3 4 5 6 →