State-Dependent Adaptive Dynamic Programing for a Class of Continuous-Time Nonlinear Systems

被引:0
作者
Batmani, Yazdan [1 ]
Davoodi, Mohammadreza [2 ]
Meskin, Nader [2 ]
机构
[1] Univ Kurdistan, Dept Elect Engn, Sanandaj, Iran
[2] Qatar Univ, Dept Elect Engn, Doha, Qatar
来源
2016 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT) | 2016年
关键词
Adaptive dynamic programming; Nonlinear optimal control; Reinforcement learning; SDRE technique; RICCATI EQUATION; LINEAR-SYSTEMS; FEEDBACK-CONTROL; DESIGN;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The state-dependent Riccati equation (SDRE) technique can be used to solve optimal control problems for a wide class of nonlinear dynamical systems. In this method, instead of solving a complicated Hamilton-Jacobi-Bellman (HJB) equation, a state-dependent Riccati equation is solved which leads to a suboptimal control law. However, a priori model of the system must be available to apply this technique to the optimal control problem. In this paper, to solve the SDRE without using a priori model of the system, a direct adaptive suboptimal algorithm is proposed. The algorithm, named state-dependent Riccati equation adaptive dynamic programming (SDRE-ADP), is based on a reinforcement learning approach which can be implemented in an online fashion. Like the SDRE technique, the proposed SDRE-ADP can locally asymptotically stabilize the closed-loop system provided that some conditions are satisfied. Application of the proposed algorithm to an autonomous unmanned underwater vehicle (AUV) and a numerical example shows that it can be effectively applied for nonlinear systems.
引用
收藏
页码:325 / 330
页数:6
相关论文
共 16 条
[11]   Nonlinear state observation using H∞-filtering Riccati design [J].
Reif, K ;
Sonnemann, F ;
Unbehauen, R .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1999, 44 (01) :203-208
[12]   Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem [J].
Vamvoudakis, Kyriakos G. ;
Lewis, Frank L. .
AUTOMATICA, 2010, 46 (05) :878-888
[13]  
Varrier S., 2010, TECH REP
[14]   Adaptive optimal control for continuous-time linear systems based on policy iteration [J].
Vrabie, D. ;
Pastravanu, O. ;
Abu-Khalaf, M. ;
Lewis, F. L. .
AUTOMATICA, 2009, 45 (02) :477-484
[15]   Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems [J].
Vrabie, Draguna ;
Lewis, Frank .
NEURAL NETWORKS, 2009, 22 (03) :237-246
[16]   Adaptive Dynamic Programming: An Introduction [J].
Wang, Fei-Yue ;
Zhang, Huaguang ;
Liu, Derong .
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2009, 4 (02) :39-47