Self-consistent method for density estimation

被引:33
作者
Bernacchia, Alberto [1 ]
Pigolotti, Simone [2 ]
机构
[1] Yale Univ, Dept Neurobiol, New Haven, CT 06510 USA
[2] Niels Bohr Inst, DK-2100 Copenhagen, Denmark
关键词
Binning; Kernel density estimation; Non-parametric statistics; PROBABILITY; DISTRIBUTIONS; FIELD; BIAS;
D O I
10.1111/j.1467-9868.2011.00772.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The estimation of a density profile from experimental data points is a challenging problem, which is usually tackled by plotting a histogram. Prior assumptions on the nature of the density, from its smoothness to the specification of its form, allow the design of more accurate estimation procedures, such as maximum likelihood. Our aim is to construct a procedure that makes no explicit assumptions, but still providing an accurate estimate of the density. We introduce the self-consistent estimate: the power spectrum of a candidate density is given, and an estimation procedure is constructed on the assumption, to be released a posteriori, that the candidate is correct. The self-consistent estimate is defined as a prior candidate density that precisely reproduces itself. Our main result is to derive the exact expression of the self-consistent estimate for any given data set, and to study its properties. Applications of the method require neither priors on the form of the density nor the subjective choice of parameters. A cutoff frequency, akin to a bin size or a kernel bandwidth, emerges naturally from the derivation. We apply the self-consistent estimate to artificial data generated from various distributions and show that it reaches the theoretical limit for the scaling of the square error with the size of the data set.
引用
收藏
页码:407 / 422
页数:16
相关论文
共 35 条
[11]   A NOTE ON THE USEFULNESS OF SUPERKERNELS IN DENSITY-ESTIMATION [J].
DEVROYE, L .
ANNALS OF STATISTICS, 1992, 20 (04) :2037-2056
[12]   Adaptive estimation of and oracle inequalities for probability densities and characteristic functions [J].
Efromovich, Sam .
ANNALS OF STATISTICS, 2008, 36 (03) :1127-1155
[13]   Correction of density estimators that are not densities [J].
Glad, IK ;
Hjort, NL ;
Ushakov, NG .
SCANDINAVIAN JOURNAL OF STATISTICS, 2003, 30 (02) :415-427
[14]  
GLAD IK, 2007, PREPR STAT, V2, P1
[15]   Problems with fitting to the power-law distribution [J].
Goldstein, ML ;
Morris, SA ;
Yen, GG .
EUROPEAN PHYSICAL JOURNAL B, 2004, 41 (02) :255-258
[16]   NONPARAMETRIC ROUGHNESS PENALTIES FOR PROBABILITY DENSITIES [J].
GOOD, IJ ;
GASKINS, RA .
BIOMETRIKA, 1971, 58 (02) :255-+
[17]   ON THE AMOUNT OF NOISE INHERENT IN BANDWIDTH SELECTION FOR A KERNEL DENSITY ESTIMATOR [J].
HALL, P ;
MARRON, JS .
ANNALS OF STATISTICS, 1987, 15 (01) :163-181
[18]   Analysis of data from continuous probability distributions [J].
Holy, TE .
PHYSICAL REVIEW LETTERS, 1997, 79 (19) :3545-3548
[19]  
Hossjer O, 1996, PROBAB THEORY REL, V105, P159
[20]   Statistical issues in the analysis of neuronal data [J].
Kass, RE ;
Ventura, V ;
Brown, EN .
JOURNAL OF NEUROPHYSIOLOGY, 2005, 94 (01) :8-25