Stochastic nonlinear minimax dynamic games with noisy measurements

被引:21
作者
Charalambous, CD [1 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1S 6N5, Canada
[2] McGill Univ, Dept Elect & Comp Engn, Ctr Intelligent Machines, Montreal, PQ H3A 2A7, Canada
关键词
certainty equivalence; dissipation; information state; separation; stochastic minimax games;
D O I
10.1109/TAC.2002.808475
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This note is concerned with nonlinear stochastic minimax dynamic games which are subject to noisy measurements. The minimizing players are control inputs while the maximizing players are square-integrable stochastic processes. The minimax dynamic game is formulated using an information state, which depends on the paths of the observed processes. The information state satisfies a partial differential equation of the Hamilton-Jacobi-Bellman (HJB) type. The HJB equation is employed to characterize the dissipation properties of the system, to derive a separation theorem between the design of the estimator and the controller, and to introduce a certainty-equivalence principle along the lines of Whittle. Finally, the separation theorem and the certainty-equivalence principle are applied to solve, the linear-quadratic-Gaussian minimax game. The results of this note generalize the L-2-gain of deterministic systems to stochastic analogs; they are related to the controller design of stochastic systems which employ risk-sensitive performance criteria, and to the controller design of deterministic systems which employ minimax performance criteria.
引用
收藏
页码:261 / 266
页数:6
相关论文
共 17 条
[1]  
Balakrishnan A V., 1976, Applied functional analysis
[2]   A FINITE-DIMENSIONAL RISK-SENSITIVE CONTROL PROBLEM [J].
BENSOUSSAN, A ;
ELLIOTT, RJ .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1995, 33 (06) :1834-1846
[3]   OPTIMAL-CONTROL OF PARTIALLY OBSERVABLE STOCHASTIC-SYSTEMS WITH AN EXPONENTIAL-OF-INTEGRAL PERFORMANCE INDEX [J].
BENSOUSSAN, A ;
VANSCHUPPEN, JH .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1985, 23 (04) :599-613
[4]  
CHARALAMBOUS C, 1996, STOCH STOCH REP, V57
[5]   The role of information state and adjoint in relating nonlinear output feedback risk-sensitive control and dynamic games [J].
Charalambous, CD .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (08) :1163-1170
[6]   Partially observable nonlinear risk-sensitive control problems: Dynamic programming and verification theorems [J].
Charalambous, CD .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (08) :1130-1138
[7]   Certain nonlinear partially observable stochastic optimal control problems with explicit control laws equivalent to LEQG/LQG problems [J].
Charalambous, CD ;
Elliott, RJ .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (04) :482-497
[8]   A max-plus-based algorithm for a Hamilton-Jacobi-Bellman equation of nonlinear filtering [J].
Fleming, WH ;
McEneaney, WM .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2000, 38 (03) :683-710
[9]  
FLEMING WH, 1992, STOCHASTIC THEORY AD, P185
[10]  
HELTON JW, 1999, EXTENDING H INFINITY