A Bernstein-Von Mises Theorem for discrete probability distributions

被引:23
作者
Boucheron, S. [1 ,2 ]
Gassiat, E. [3 ]
机构
[1] CNRS, LPMA, F-75700 Paris, France
[2] Univ Paris Diderot, Paris, France
[3] Univ Paris 11, Paris, France
关键词
Bernstein-Von Mises Theorem; Entropy estimation; non-parametric Bayesian statistics; Discrete models; Concentration inequalities;
D O I
10.1214/08-EJS262
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We investigate the asymptotic normality of the posterior distribution in the discrete setting, when model dimension increases with sample size. We consider a probability mass function theta(0) on N \ {0} and a sequence of truncation levels (k(n))(n) satisfying k(n)(3) <= n inf(i <= kn) theta(0)(i). Let (theta) over cap denote the maximum likelihood estimate of (theta(0)(i))(i <= kn) and let Delta(n)(theta(0)) denote the k(n)-dimensional vector which i-th coordinate is defined by root n((theta) over cap (n)(i)-theta(0)(i)) for 1 <= i <= k(n). We check that under mild conditions on theta(0) and on the sequence of prior probabilities on the k(n)-dimensional simplices, after centering and resealing, the variation distance between the posterior distribution recentered around (theta) over cap (n) and resealed by root n and the k(n)-dimensional Gaussian distribution N(Delta(n)(theta(0)), I(-1)(theta(0))) converges in probability to 0. This theorem can be used to prove the asymptotic normality of Bayesian estimators of Shannon and Renyi entropies. The proofs are based on concentration inequalities for centered and non-centered Chi-square (Pearson) statistics. The latter allow to establish posterior concentration rates with respect to Fisher distance rather than with respect to the Hellinger distance as it is commonplace in non-parametric Bayesian statistics.
引用
收藏
页码:114 / 148
页数:35
相关论文
共 36 条
[11]  
2-M
[12]  
Dudley R., 2002, CAMBRIDGE STUDIES AD, V74
[13]   NONPARAMETRIC REGRESSION WITH ERRORS-IN-VARIABLES [J].
FAN, JQ ;
TRUONG, YK .
ANNALS OF STATISTICS, 1993, 21 (04) :1900-1925
[14]   LOCAL LINEAR-REGRESSION SMOOTHERS AND THEIR MINIMAX EFFICIENCIES [J].
FAN, JQ .
ANNALS OF STATISTICS, 1993, 21 (01) :196-216
[15]   Generalized likelihood ratio statistics and Wilks phenomenon [J].
Fan, JQ ;
Zhang, CM ;
Zhang, J .
ANNALS OF STATISTICS, 2001, 29 (01) :153-193
[16]   ON THE ASYMPTOTIC-BEHAVIOR OF BAYES ESTIMATES IN THE DISCRETE CASE-II [J].
FREEDMAN, DA .
ANNALS OF MATHEMATICAL STATISTICS, 1965, 36 (02) :454-456
[17]   ON ASYMPTOTIC-BEHAVIOR OF BAYES ESTIMATES IN DISCRETE CASE [J].
FREEDMAN, DA .
ANNALS OF MATHEMATICAL STATISTICS, 1963, 34 (04) :1386-&
[18]  
Gallager R. G., 1968, Information Theory and Reliable Communication, V588
[19]   Convergence rates of posterior distributions [J].
Ghosal, S ;
Ghosh, JK ;
Van der Vaart, AW .
ANNALS OF STATISTICS, 2000, 28 (02) :500-531
[20]   Asymptotic normality of posterior distributions for exponential families when the number of parameters tends to infinity [J].
Ghosal, S .
JOURNAL OF MULTIVARIATE ANALYSIS, 2000, 74 (01) :49-68