Stochastic uncertain systems subject to relative entropy constraints: Induced norms and monotonicity properties of minimax games

被引:36
作者
Charalambous, Charalambos D. [1 ]
Rezaei, Farzad
机构
[1] Univ Cyprus, Dept Elect & Comp Engn, Nicosia, Cyprus
[2] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada
关键词
duality properties; large deviations; minimax games; relative entropy; uncertain stochastic systems;
D O I
10.1109/TAC.2007.894517
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Entropy and relative entropy are fundamental concepts on which information theory is founded on, and in general, telecommunication systems design. On the other hand, dissipation inequalities, minimax strategies, and induced norms are the basic concepts on which robustness of uncertain control and estimation of systems are founded on. In this paper, the precise relation between these notions is investigated. In particular, it will be shown that the higher the dissipation the higher the entropy of the system, which has implications in computing the induced norm associated with robustness. These connections are obtained by considering stochastic optimal uncertain control systems, in which uncertainty is described by a relative entropy constraint between the nominal and uncertain measures, while the pay-off is a linear functional of the uncertain measure. This is a minimax game, in which the controller measure seeks to minimize the pay-off, while the disturbance measure aims at maximizing the pay-off. Salient properties of the minimax solution are derived, including a characterization of the optimal sensitivity reduction, computation of the induced norm, monotonicity properties of minimax solution, and relations between dissipation and relative entropy of the system. The theory is developed in an abstract setting and then applied to nonlinear partially observable continuous-time uncertain controlled systems, in which the nominal and uncertain systems are described by conditional distributions. In addition, existence of the optimal control policy among the class of policies known as wide-sense control laws is shown, and an explicit formulae for the worst case conditional measure is derived. The results are applied to linear-quadratic-Gaussian problems.
引用
收藏
页码:647 / 663
页数:17
相关论文
共 33 条
[1]   ON THE RELATION OF ZAKAI EQUATIONS AND MORTENSEN EQUATIONS [J].
BENES, V ;
KARATZAS, I .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1983, 21 (03) :472-489
[2]   OPTIMAL-CONTROL OF PARTIALLY OBSERVABLE STOCHASTIC-SYSTEMS WITH AN EXPONENTIAL-OF-INTEGRAL PERFORMANCE INDEX [J].
BENSOUSSAN, A ;
VANSCHUPPEN, JH .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1985, 23 (04) :599-613
[3]  
Billingsley P., 1968, Convergence of probability measures
[4]  
CHARALAMBOUS C, 1996, STOCH STOCH REP, V57
[5]   Relations between information theory, robustness and statistical mechanics of stochastic uncertain systems via large deviation theory [J].
Charalambous, C. D. ;
Kyprianou, A. ;
Rezaei, F. .
MODEL REDUCTION AND COARSE-GRAINING APPROACHES FOR MULTISCALE PHENOMENA, 2006, :269-+
[6]   The role of information state and adjoint in relating nonlinear output feedback risk-sensitive control and dynamic games [J].
Charalambous, CD .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (08) :1163-1170
[7]   Information capacity of MIMO channels with relative entropy constraint [J].
Charalambous, Charalambos D. ;
Denic, Stojan Z. ;
Constantinou, Costas .
2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, :876-+
[8]  
CHARALMBOUS CD, 2004, SIAM J CONTROL OPTIM, V2, P1578
[9]   Risk-sensitive filtering and smoothing via reference probability methods [J].
Dey, S ;
Moore, JB .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (11) :1587-1591
[10]  
Dupuis P., 1997, WEAK CONVERGENCE APP