Reinforcement Learning-Based Adaptive Optimal Control for Partially Unknown Systems Using Differentiator

被引:0
作者
Guo, Xinxin [1 ]
Yan, Weisheng [1 ]
Cui, Rongxin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China
来源
2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC) | 2018年
基金
中国国家自然科学基金;
关键词
NONLINEAR-SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An adaptive optimal controller is designed by solving the infinite-horizon optimal control issue based on reinforcement learning (RL) technique for partially unknown systems. Since the solution to Hamilton-Jacobi-Bellman equation includes the drift dynamics, a first-order robust exact differentiator (RED) is designed to provide an approximation for the unknown drift dynamics considering the known input dynamics. To obtain the approximation of the optimal control policy and value function, an actor-critic neural network (NN) structure is built. A synchronous update algorithm based on the first-order RED and the RL technique for the two NNs. By employing Lyapunov theorem, the convergence and stability are proved for the proposed control method. Eventually, to show the performance of the proposed controller, both linear and nonlinear simulation examples are given, repectively.
引用
收藏
页码:1039 / 1044
页数:6
相关论文
共 25 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation [J].
Beard, RW ;
Saridis, GN ;
Wen, JT .
AUTOMATICA, 1997, 33 (12) :2159-2177
[3]   Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using Reinforcement Learning [J].
Cui, Rongxin ;
Yang, Chenguang ;
Li, Yang ;
Sharma, Sanjay .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (06) :1019-1029
[4]  
Degris T, 2012, P AMER CONTR CONF, P2177
[5]   Adaptive Control of a Flexible Crane System With the Boundary Output Constraint [J].
He, Wei ;
Zhang, Shuang ;
Ge, Shuzhi Sam .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2014, 61 (08) :4126-4133
[6]   Robust adaptive boundary control of a flexible marine riser with vessel dynamics [J].
He, Wei ;
Ge, Shuzhi Sam ;
How, Bernard Voon Ee ;
Choo, Yoo Sang ;
Hong, Keum-Shik .
AUTOMATICA, 2011, 47 (04) :722-732
[7]   Efficient model-based reinforcement learning for approximate online optimal control [J].
Kamalapurkar, Rushikesh ;
Rosenfeld, Joel A. ;
Dixon, Warren E. .
AUTOMATICA, 2016, 74 :247-258
[8]   Higher-order sliding modes, differentiation and output-feedback control [J].
Levant, A .
INTERNATIONAL JOURNAL OF CONTROL, 2003, 76 (9-10) :924-941
[9]   Robust exact differentiation via sliding mode technique [J].
Levant, A .
AUTOMATICA, 1998, 34 (03) :379-384
[10]   Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data [J].
Lewis, F. L. ;
Vamvoudakis, Kyriakos G. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (01) :14-25