Functional Bregman Divergence and Bayesian Estimation of Distributions

被引:57
作者
Frigyik, Bela A. [1 ]
Srivastava, Santosh [2 ]
Gupta, Maya R. [3 ]
机构
[1] Purdue Univ, Dept Math, W Lafayette, IN 47907 USA
[2] Univ Washington, Dept Appl Math, Seattle, WA 98195 USA
[3] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
关键词
Bayesian estimation; Bregman divergence; convexity; Frechet derivative; uniform distribution;
D O I
10.1109/TIT.2008.929943
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A class of distortions termed functional Bregman divergences is defined, which includes squared error and relative entropy. A functional Bregman divergence acts on functions or distributions, and generalizes the standard Bregman divergence for vectors and a previous pointwise Bregman divergence that was defined for functions. A recent result showed that the mean minimizes the expected Bregman divergence. The new functional definition enables the extension of this result to the continuous case to show that the mean minimizes the expected functional Bregman divergence over a set of functions or distributions. It is shown how this theorem applies to the Bayesian estimation of distributions. Estimation of the uniform distribution from independent and identically drawn samples is presented as a case study.
引用
收藏
页码:5130 / 5139
页数:10
相关论文
共 28 条
  • [1] Unifying divergence minimization and statistical inference via convex duality
    Altun, Yasemin
    Smola, Alex
    [J]. LEARNING THEORY, PROCEEDINGS, 2006, 4005 : 139 - 153
  • [2] Amari SI, 2007, Methods of information geometry, V191
  • [3] [Anonymous], 1995, ADV CALCULUS SEVERAL
  • [4] Banerjee A, 2005, J MACH LEARN RES, V6, P1705
  • [5] On the optimality of conditional expectation as a Bregman predictor
    Banerjee, A
    Guo, X
    Wang, H
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (07) : 2664 - 2669
  • [6] Banerjee Arindam, 2007, P SIAM INT C DAT MIN
  • [7] Logistic regression, AdaBoost and Bregman distances
    Collins, M
    Schapire, RE
    Singer, Y
    [J]. MACHINE LEARNING, 2002, 48 (1-3) : 253 - 285
  • [8] Cover TM, 2006, Elements of Information Theory
  • [9] GENERALIZED PROJECTIONS FOR NONNEGATIVE FUNCTIONS
    CSISZAR, I
    [J]. ACTA MATHEMATICA HUNGARICA, 1995, 68 (1-2) : 161 - 186
  • [10] Friedman J, 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5