Stochastic nonlinear minimax dynamic games with noisy measurements

被引：21

作者：

Charalambous, CD ^{[1
]}

机构：

[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1S 6N5, Canada

[2] McGill Univ, Dept Elect & Comp Engn, Ctr Intelligent Machines, Montreal, PQ H3A 2A7, Canada

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2003年 / 48卷 / 02期

关键词：

certainty equivalence; dissipation; information state; separation; stochastic minimax games;

D O I：

10.1109/TAC.2002.808475

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This note is concerned with nonlinear stochastic minimax dynamic games which are subject to noisy measurements. The minimizing players are control inputs while the maximizing players are square-integrable stochastic processes. The minimax dynamic game is formulated using an information state, which depends on the paths of the observed processes. The information state satisfies a partial differential equation of the Hamilton-Jacobi-Bellman (HJB) type. The HJB equation is employed to characterize the dissipation properties of the system, to derive a separation theorem between the design of the estimator and the controller, and to introduce a certainty-equivalence principle along the lines of Whittle. Finally, the separation theorem and the certainty-equivalence principle are applied to solve, the linear-quadratic-Gaussian minimax game. The results of this note generalize the L-2-gain of deterministic systems to stochastic analogs; they are related to the controller design of stochastic systems which employ risk-sensitive performance criteria, and to the controller design of deterministic systems which employ minimax performance criteria.

引用

页码：261 / 266

页数：6

共 17 条

[1]

Balakrishnan A V., 1976, Applied functional analysis

[2] A FINITE-DIMENSIONAL RISK-SENSITIVE CONTROL PROBLEM [J].

BENSOUSSAN, A ;

ELLIOTT, RJ .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1995, 33 (06) :1834-1846

[3] OPTIMAL-CONTROL OF PARTIALLY OBSERVABLE STOCHASTIC-SYSTEMS WITH AN EXPONENTIAL-OF-INTEGRAL PERFORMANCE INDEX [J].

BENSOUSSAN, A ;

VANSCHUPPEN, JH .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1985, 23 (04) :599-613

[4]

CHARALAMBOUS C, 1996, STOCH STOCH REP, V57

[5] The role of information state and adjoint in relating nonlinear output feedback risk-sensitive control and dynamic games [J].

Charalambous, CD .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (08) :1163-1170

[6] Partially observable nonlinear risk-sensitive control problems: Dynamic programming and verification theorems [J].

Charalambous, CD .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (08) :1130-1138

[7] Certain nonlinear partially observable stochastic optimal control problems with explicit control laws equivalent to LEQG/LQG problems [J].

Charalambous, CD ;

Elliott, RJ .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (04) :482-497

[8] A max-plus-based algorithm for a Hamilton-Jacobi-Bellman equation of nonlinear filtering [J].

Fleming, WH ;

McEneaney, WM .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2000, 38 (03) :683-710

[9]

FLEMING WH, 1992, STOCHASTIC THEORY AD, P185

[10]

HELTON JW, 1999, EXTENDING H INFINITY

← 1 2 →