Numerical Solutions to the Bellman Equation of Optimal Control

被引:37
作者
Aguilar, Cesar O. [1 ]
Krener, Arthur J. [2 ]
机构
[1] Calif State Coll Bakersfield, Dept Math, Bakersfield, CA 93309 USA
[2] Naval Postgrad Sch, Dept Appl Math, Monterey, CA USA
关键词
Discrete-time control systems; Nonlinear optimal regulation; Dynamic programming; Hamilton-Jacobi-Bellman equation; Numerical methods; DATA NONLINEAR-SYSTEMS; STABILIZATION;
D O I
10.1007/s10957-013-0403-8
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper, we present a numerical algorithm to compute high-order approximate solutions to Bellman's dynamic programming equation that arises in the optimal stabilization of discrete-time nonlinear control systems. The method uses a patchy technique to build local Taylor polynomial approximations defined on small domains, which are then patched together to create a piecewise smooth approximation. The numerical domain is dynamically computed as the level sets of the value function are propagated in reverse time under the closed-loop dynamics. The patch domains are constructed such that their radial boundaries are contained in the level sets of the value function and their lateral boundaries are constructed as invariant sets of the closed-loop dynamics. To minimize the computational effort, an adaptive subdivision algorithm is used to determine the number of patches on each level set depending on the relative error in the dynamic programming equation. Numerical tests in 2D and 3D are given to illustrate the accuracy of the method.
引用
收藏
页码:527 / 552
页数:26
相关论文
共 31 条
[1]  
Albrekht E.G., 1961, Journal of Applied Mathematics and Mechanics, V25, P1254, DOI [DOI 10.1016/0021-8928, 10.1016/0021-8928(61)90005-3, DOI 10.1016/0021-8928(61)90005-3]
[2]  
Ancona F., 1999, ESAIM. Control, Optimisation and Calculus of Variations, V4, P445, DOI 10.1051/cocv:1999117
[3]  
[Anonymous], 1999, Level Set Methods and Fast Marching Methods: Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science
[4]   ICOSAHEDRAL DISCRETIZATION OF THE 2-SPHERE [J].
BAUMGARDNER, JR ;
FREDERICKSON, PO .
SIAM JOURNAL ON NUMERICAL ANALYSIS, 1985, 22 (06) :1107-1115
[5]   Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation [J].
Beard, RW ;
Saridis, GN ;
Wen, JT .
AUTOMATICA, 1997, 33 (12) :2159-2177
[6]   Feedback control methodologies for nonlinear systems [J].
Beeler, SC ;
Tran, HT ;
Banks, HT .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2000, 107 (01) :1-33
[7]  
Bellman R., 1971, INTRO MATH THEORY CO
[8]   A PATCHY DYNAMIC PROGRAMMING SCHEME FOR A CLASS OF HAMILTON-JACOBI-BELLMAN EQUATIONS [J].
Cacace, Simone ;
Cristiani, Emiliano ;
Falcone, Maurizio ;
Picarelli, Athena .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2012, 34 (05) :A2625-A2649
[9]   VISCOSITY SOLUTIONS OF HAMILTON-JACOBI EQUATIONS [J].
CRANDALL, MG ;
LIONS, PL .
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1983, 277 (01) :1-42
[10]   APPROXIMATE SOLUTIONS OF THE BELLMAN EQUATION OF DETERMINISTIC CONTROL-THEORY [J].
DOLCETTA, IC ;
ISHII, H .
APPLIED MATHEMATICS AND OPTIMIZATION, 1984, 11 (02) :161-181