Approximate optimal adaptive control for weakly coupled nonlinear systems: A neuro-inspired approach

被引:4
作者
Carrillo, Luis Rodolfo Garcia [1 ]
Vamvoudakis, Kyriakos G. [2 ]
Hespanha, Joao Pedro [2 ]
机构
[1] Univ Nevada, Dept Elect & Biomed Engn, Reno, NV 89557 USA
[2] Univ Calif Santa Barbara, Ctr Control Dynam Syst & Computat CCDC, Santa Barbara, CA 93106 USA
关键词
weakly coupled systems; large-scale systems; adaptive control; approximate dynamic programming; asymptotic stability; optimal control; reinforcement learning; COMPOSITE CONTROL; LINEAR-SYSTEMS; CONVERGENCE; DESIGN;
D O I
10.1002/acs.2631
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a new approximate dynamic programming algorithm to solve the infinite-horizon optimal control problem for weakly coupled nonlinear systems. The algorithm is implemented as a three-critic/four-actor approximators structure, where the critic approximators are used to learn the optimal costs, while the actor approximators are used to learn the optimal control policies. Simultaneous continuous-time adaptation of both critic and actor approximators is implemented, a method commonly known as synchronous policy iteration. The adaptive control nature of the algorithm requires a persistence of excitation condition to be a priori guaranteed, but this can be relaxed by using previously stored data concurrently with current data in the update of the critic approximators. Appropriate robustifying terms are added to the controllers to eliminate the effects of the residual errors, leading to asymptotic stability of the equilibrium point of the closed-loop system. Simulation results show the effectiveness of the proposed approach for a sixth-order dynamical example. Copyright (C) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1494 / 1522
页数:29
相关论文
共 50 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   OPTIMAL-CONTROL OF WEAKLY COUPLED BILINEAR-SYSTEMS [J].
AGANOVIC, Z ;
GAJIC, Z .
AUTOMATICA, 1993, 29 (06) :1591-1593
[3]  
Aganovic Z., 1995, LINEAR OPTIMAL CONTR
[4]  
Aldhaheri R, 1991, IEEE T AUTOMATIC CON, V36
[5]  
[Anonymous], 2008, Nonlinear dynamical systems and control: A Lyapunov-based approach
[6]  
[Anonymous], 1994, Introduction to the Numerical Solutions of Markov Chains
[7]  
Bardi M., 1997, Optimal control and viscosity solutions of HamiltonJacobi-Bellman equations
[8]   Successive Galerkin approximation algorithms for nonlinear optimal and robust control [J].
Beard, RW ;
McLain, TW .
INTERNATIONAL JOURNAL OF CONTROL, 1998, 71 (05) :717-743
[9]  
Bertsekas D. P., 2007, Dynamic Programming and Optimal Control
[10]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92