Adaptive nearly optimal control for a class of continuous-time nonaffine nonlinear systems with inequality constraints

被引:29
作者
Fan, Quan-Yong [1 ]
Yang, Guang-Hong [1 ,2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Nonaffine nonlinear systems; Inequality constraints; Adaptive optimal control; Neural network; DYNAMIC-PROGRAMMING ALGORITHM; BARRIER LYAPUNOV FUNCTIONS; TRACKING CONTROL; STATE; UNCERTAIN; MODEL; DESIGN; INPUT; STABILIZATION; ARCHITECTURE;
D O I
10.1016/j.isatra.2016.10.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The state inequality constraints have been hardly considered in the literature on solving the nonlinear optimal control problem based the adaptive dynamic programming (ADP) method. In this paper, an actor-critic (AC) algorithm is developed to solve the optimal control problem with a discounted cost function for a class of state-constrained nonaffine nonlinear systems. To overcome the difficulties resulting from the inequality constraints and the nonaffine nonlinearities of the controlled systems, a novel transformation technique with redesigned slack functions and a pre-compensator method are introduced to convert the constrained optimal control problem into an unconstrained one for affine nonlinear systems. Then, based on the policy iteration (PI) algorithm, an online AC scheme is proposed to learn the nearly optimal control policy for the obtained affine nonlinear dynamics. Using the information of the nonlinear model, novel adaptive update laws are designed to guarantee the convergence of the neural network (NN) weights and the stability of the affine nonlinear dynamics without the requirement for the probing signal. Finally, the effectiveness of the proposed method is validated by simulation studies. (C) 2016 ISA. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:122 / 133
页数:12
相关论文
共 46 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]  
[Anonymous], IEEE T CYBERN
[3]  
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
[4]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[5]   Adaptive dynamic programming and optimal control of nonlinear nonaffine systems [J].
Bian, Tao ;
Jiang, Yu ;
Jiang, Zhong-Ping .
AUTOMATICA, 2014, 50 (10) :2624-2632
[6]   A unified framework for hybrid control: Model and optimal control theory [J].
Branicky, MS ;
Borkar, VS ;
Mitter, SK .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1998, 43 (01) :31-45
[7]  
Cox C, 1998, IEEE SYS MAN CYBERN, P1652, DOI 10.1109/ICSMC.1998.728126
[8]   Adaptive Actor-Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances [J].
Fan, Quan-Yong ;
Yang, Guang-Hong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (01) :165-177
[9]  
Finlayson B. A., 2013, METHOD WEIGHTED RESI
[10]   Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming [J].
Gao, Weinan ;
Jiang, Yu ;
Jiang, Zhong-Ping ;
Chai, Tianyou .
AUTOMATICA, 2016, 72 :37-45