A new class of upper bounds on the log partition function

被引:154
作者
Wainwright, MJ [1 ]
Jaakkola, TS
Willsky, AS
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[3] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
关键词
approximate inference; belief propagation; Bethe/Kikuchi free energy; factor graphs; graphical models; information geometry; Markov random field (MRF); partition function; sum-product algorithm; variational method;
D O I
10.1109/TIT.2005.850091
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce a new class of upper bounds on the log partition function of a Markov random field (MRIT). This quantity plays an important role in various contexts, including approximating marginal distributions, parameter estimation, combinatorial enumeration, statistical decision theory and large-deviations bounds. Our derivation is based on concepts from convex duality and information geometry: in particular, it exploits mixtures of distributions in the exponential domain, and the Legendre mapping between exponential and mean parameters. In the special case of convex combinations of tree-structured distributions, we obtain a family of variational problems, similar to the Bethe variational problem, but distinguished by the following desirable properties: i) they are convex, and have a unique global optimum; and ii) the optimum gives an upper bound on the log partition function. This optimum is defined by stationary conditions very similar to those defining fixed points of the sum-product algorithm, or more generally, any local optimum of the Bethe variational problem. As with sum-product fixed points, the elements of the optimizing argument can be used as approximations to the marginals of the original model. The analysis extends naturally to convex combinations of hypertree-structured distributions, thereby establishing links to Kikuchi approximations and variants.
引用
收藏
页码:2313 / 2335
页数:23
相关论文
共 48 条
[1]   Information geometry on hierarchy of probability distributions [J].
Amari, S .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2001, 47 (05) :1701-1711
[3]  
[Anonymous], CONVEX ANAL
[4]  
[Anonymous], MATH THEORY SYSTEMS
[5]  
BARNDORFFNIELSO.OE, 1978, INFORMATION EXPONENT
[6]  
Baxter R J., 1982, EXACTLY SOLVED MODEL
[7]  
Berge C., 1989, HYPERGRAPHS
[8]  
BERTSEKAS DP, 1995, NONLINEAR PROGRAMMIN
[9]  
Biggs N., 1993, ALGEBRAIC GRAPH THEO
[10]  
Bollob┬u├s B., 2013, MODERN GRAPH THEORY, V184