CONTINUOUS-TIME ROBUST DYNAMIC PROGRAMMING

被引:36
作者
Bian, Tao [1 ]
Jiang, Zhong-Ping [2 ]
机构
[1] Bank Amer Merrill Lynch, One Bryant Pk, New York, NY 10036 USA
[2] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, 6 Metrotech Ctr, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
dynamic programming; stochastic optimal control; adaptive optimal control; robust control; STOCHASTIC-APPROXIMATION; STABILIZATION; STABILITY; SYSTEMS; INPUT;
D O I
10.1137/18M1214147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new theory, known as robust dynamic programming, for a class of continuous-time dynamical systems. Different from traditional dynamic programming (DP) methods, this new theory serves as a fundamental tool to analyze the robustness of DP algorithms, and, in particular, to develop novel adaptive optimal control and reinforcement learning methods. In order to demonstrate the potential of this new framework, two illustrative applications in the fields of stochastic and decentralized optimal control are presented. Two numerical examples arising from both finance and engineering industries are also given, along with several possible extensions of the proposed framework.
引用
收藏
页码:4150 / 4174
页数:25
相关论文
共 61 条
[31]  
Khalil H.K., 2002, Non Linear System
[32]   ON AN ITERATIVE TECHNIQUE FOR RICCATI EQUATION COMPUTATIONS [J].
KLEINMAN, DL .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1968, AC13 (01) :114-+
[33]  
Krstic M., 1995, Nonlinear and adaptive control design
[34]  
Kucera V., 1973, Kybernetika, V9, P42
[35]  
Kushner H., 2003, Stochastic Approximation and Recursive Algorithms and Applications
[36]   MAXIMALLY ACHIEVABLE ACCURACY OF LINEAR OPTIMAL REGULATORS AND LINEAR OPTIMAL FILTERS [J].
KWAKERNA.H ;
SIVAN, R .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1972, AC17 (01) :79-&
[37]  
Lewis F. L., 2013, Reinforcement Learning and Approximate Dynamic Programming for Feedback Control
[38]  
Liberzon Daniel, 2012, Calculus of Variations and Optimal Control Theory
[39]  
Lim S. H., 2013, Proc. Adv. Neural Inf. Process. Syst., V26, P701
[40]   Robust control of Markov decision processes with uncertain transition matrices [J].
Nilim, A ;
El Ghaoui, L .
OPERATIONS RESEARCH, 2005, 53 (05) :780-798