Density estimation with distribution element trees

被引:13
作者
Meyer, Daniel W. [1 ]
机构
[1] Swiss Fed Inst Technol, Inst Fluid Dynam, Zurich, Switzerland
关键词
Nonparametric density estimation; Adaptive histogram; Kernel density estimation; Adaptive binning; Polynomial histogram; Curse of dimensionality; High dimensional; Big data; Polya tree; Density estimation tree; MULTIVARIATE DENSITY; QUADRATURE; SELECTION;
D O I
10.1007/s11222-017-9751-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The estimation of probability densities based on available data is a central task in many statistical applications. Especially in the case of large ensembles with many samples or high-dimensional sample spaces, computationally efficient methods are needed. We propose a new method that is based on a decomposition of the unknown distribution in terms of so-called distribution elements (DEs). These elements enable an adaptive and hierarchical discretization of the sample space with small or large elements in regions with smoothly or highly variable densities, respectively. The novel refinement strategy that we propose is based on statistical goodness-of-fit and pairwise (as an approximation to mutual) independence tests that evaluate the local approximation of the distribution in terms of DEs. The capabilities of our new method are inspected based on several examples of different dimensionality and successfully compared with other state-of-the-art density estimators.
引用
收藏
页码:609 / 632
页数:24
相关论文
共 46 条
[1]   Local bandwidth selectors for deconvolution kernel density estimation [J].
Achilleos, Achilleas ;
Delaigle, Aurore .
STATISTICS AND COMPUTING, 2012, 22 (02) :563-577
[2]   The autodependogram: a graphical device to investigate serial dependences [J].
Bagnato, Luca ;
Punzo, Antonio ;
Nicolis, Orietta .
JOURNAL OF TIME SERIES ANALYSIS, 2012, 33 (02) :233-254
[3]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[4]  
Botev Z, 2007, SPECTRAL IMPLEMENTAT
[5]  
Botev Z, 2016, IMPLEMENTATION ADAPT
[6]   KERNEL DENSITY ESTIMATION VIA DIFFUSION [J].
Botev, Z. I. ;
Grotowski, J. F. ;
Kroese, D. P. .
ANNALS OF STATISTICS, 2010, 38 (05) :2916-2957
[7]   A COMPARATIVE-STUDY OF SEVERAL SMOOTHING METHODS IN DENSITY-ESTIMATION [J].
CAO, R ;
CUEVAS, A ;
MANTEIGA, WG .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1994, 17 (02) :153-176
[8]   THE CHI-2 TEST OF GOODNESS OF FIT [J].
COCHRAN, WG .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (03) :315-345
[9]  
Curtin RR, 2013, J MACH LEARN RES, V14, P801
[10]   BAYESIAN ANALYSIS OF SOME NONPARAMETRIC PROBLEMS [J].
FERGUSON, TS .
ANNALS OF STATISTICS, 1973, 1 (02) :209-230