Online optimal control of nonlinear discrete-time systems using approximate dynamic programming

被引:48
作者
Dierks T. [1 ]
Jagannathan S. [2 ]
机构
[1] DRS Sustainment Systems, Inc., St. Louis, MO 63121
[2] Department of Electrical and Computer Engineering, Missouri University of Science and Technology, Rolla
来源
Journal of Control Theory and Applications | 2011年 / 9卷 / 3期
基金
美国国家科学基金会;
关键词
Discrete-time systems; Hamilton Jacobi-Bellman; Online approximators; Online nonlinear optimal control;
D O I
10.1007/s11768-011-0178-0
中图分类号
学科分类号
摘要
In this paper, the optimal control of a class of general affine nonlinear discrete-time (DT) systems is undertaken by solving the Hamilton Jacobi-Bellman (HJB) equation online and forward in time. The proposed approach, referred normally as adaptive or approximate dynamic programming (ADP), uses online approximators (OLAs) to solve the infinite horizon optimal regulation and tracking control problems for affine nonlinear DT systems in the presence of unknown internal dynamics. Both the regulation and tracking controllers are designed using OLAs to obtain the optimal feedback control signal and its associated cost function. Additionally, the tracking controller design entails a feedforward portion that is derived and approximated using an additional OLA for steady state conditions. Novel update laws for tuning the unknown parameters of the OLAs online are derived. Lyapunov techniques are used to show that all signals are uniformly ultimately bounded and that the approximated control signals approach the optimal control inputs with small bounded error. In the absence of OLA reconstruction errors, an optimal control is demonstrated. Simulation results verify that all OLA parameter estimates remain bounded, and the proposed OLA-based optimal control scheme tunes itself to reduce the cost HJB equation. © 2011 South China University of Technology, Academy of Mathematics and Systems Science, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:361 / 369
页数:8
相关论文
共 15 条
[1]  
Jagannathan S., Neural Network Control of Nonlinear Discrete-Time Systems, (2006)
[2]  
Lewis F.L., Syrmos V.L., Optimal Control, (1995)
[3]  
Shamma J., Cloutier J., Existence of SDRE stabilizing feedback, IEEE Transactions on Automatic Control, 48, 3, pp. 513-517, (2003)
[4]  
Vlassenbroeck J., van Dooren R., A Chebyshev technique for solving nonlinear optimal control problems, IEEE Transactions on Automatic Control, 33, 4, pp. 333-340, (1988)
[5]  
Chen Z., Jagannathan S., Generalized Hamilton-Jacobi-Bellman formulation based neural network control of affine nonlinear discretetime systems, IEEE Transactions on Neural Networks, 19, 1, pp. 90-106, (2008)
[6]  
Al-Tamimi A., Lewis F.L., Abu-Khalaf M., Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof, IEEE Transactions on Systems, Man, and Cybernetics - Part B, 38, 4, pp. 943-949, (2008)
[7]  
Vrabie D., Pastravanu O., Abu-Khalaf M., Et al., Adaptive optimal control for continuous-time linear systems based on policy iteration, Automatica, 45, 2, pp. 477-484, (2009)
[8]  
Toussaint G., Basar T., Bullo F., H-infinity-optimal tracking control techniques for nonlinear underactuated systems, Proceedings of IEEE Conference on Decision and Control, pp. 2078-2083, (2000)
[9]  
Gu D., Hu H., Receding horizon tracking control of wheeled mobile robots, IEEE Transactions on Control Systems Technology, 14, 4, pp. 743-749, (2006)
[10]  
Zhang H., Wei Q., Luo Y., A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm, IEEE Transactions on Systems, Man, and Cybernetics - Part B, 38, 4, pp. 937-942, (2008)