A variational approach to nonlinear estimation

被引:59
作者
Mitter, SK [1 ]
Newton, NJ
机构
[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[2] MIT, Informat & Decis Syst Lab, Cambridge, MA 02139 USA
[3] Univ Essex, Dept Elect Syst Engn, Colchester CO4 3SQ, Essex, England
关键词
Bayesian inference; information theory; Legendre-type transforms; nonlinear filtering; stochastic optimal control;
D O I
10.1137/S0363012901393894
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider estimation problems, in which the estimand, X, and observation, Y, take values in measurable spaces. Regular conditional versions of the forward and inverse Bayes formula are shown to have dual variational characterizations involving the minimization of apparent information and the maximization of compatible information. These both have natural information-theoretic interpretations, according to which Bayes' formula and its inverse are optimal information processors. The variational characterization of the forward formula has the same form as that of Gibbs measures in statistical mechanics. The special case in which X and Y are diffusion processes governed by stochastic differential equations is examined in detail. The minimization of apparent information can then be formulated as a stochastic optimal control problem, with cost that is quadratic in both the control and observation fit. The dual problem can be formulated in terms of infinite-dimensional deterministic optimal control. Local versions of the variational characterizations are developed which quantify information flow in the estimators. In this context, the information conserving property of Bayesian estimators coincides with the Davis-Varaiya martingale stochastic dynamic programming principle.
引用
收藏
页码:1813 / 1833
页数:21
相关论文
共 16 条
[1]  
[Anonymous], 1975, Stochastic differential equations and applications
[2]  
Borkar V.S., 2002, DIRECTIONS MATH SYST, P41
[3]  
Clark J. M. C., 1978, NATO ADV STUDY I S E, V25, P721
[4]   DYNAMIC PROGRAMMING CONDITIONS FOR PARTIALLY OBSERVABLE STOCHASTIC SYSTEMS [J].
DAVIS, MHA ;
VARAIYA, P .
SIAM JOURNAL ON CONTROL, 1973, 11 (02) :226-261
[5]   A PATHWISE SOLUTION OF THE EQUATIONS OF NON-LINEAR FILTERING [J].
DAVIS, MHA .
THEORY OF PROBABILITY AND ITS APPLICATIONS, 1983, 27 (01) :167-175
[6]  
DUPUIS P., 2011, A Weak Convergence Approach to the Theory of Large Deviations
[7]   OPTIMAL CONTROL AND NONLINEAR FILTERING FOR NONDEGENERATE DIFFUSION PROCESSES. [J].
FLEMING, WENDELL H. ;
MITTER, SANJOY K. .
1982, V 8 (N 1) :63-77
[8]  
Georgii H.-O., 1988, Gibbs Measures and Phase Transitions
[9]  
Liptser R. S, 1977, STAT RANDOM PROCESSE, pI
[10]   Dynamical systems in the variational formulation of the Fokker-Planck equation by the Wasserstein metric [J].
Mikami, T .
APPLIED MATHEMATICS AND OPTIMIZATION, 2000, 42 (02) :203-227