A Bayesian approach to bandwidth selection for multivariate kernel density estimation

被引:122
作者
Zhang, Xibin [1 ]
King, Maxwell L. [1 ]
Hyndman, Rob J. [1 ]
机构
[1] Monash Univ, Dept Econometr & Business Stat, Clayton, Vic 3800, Australia
基金
澳大利亚研究理事会;
关键词
cross-validation; Kullback-Leibler information; mean integrated squared errors; sampling algorithms; Monte Carlo kernel likelihood;
D O I
10.1016/j.csda.2005.06.019
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Kernel density estimation for multivariate data is an important technique that has a wide range of applications. However, it has received significantly less attention than its univariate counter-part. The lower level of interest in multivariate kernel density estimation is mainly due to the increased difficulty in deriving an optimal data-driven bandwidth as the dimension of the data increases. We provide Markov chain Monte Carlo (MCMC) algorithms for estimating optimal bandwidth matrices for multivariate kernel density estimation. Our approach is based on treating the elements of the bandwidth matrix as parameters whose posterior density can be obtained through the likelihood cross-validation criterion. Numerical studies for bivariate data show that the MCMC algorithm generally performs better than the plug-in algorithm under the Kullback-Leibler information criterion, and is as good as the plug-in algorithm under the mean integrated squared error (MISE) criterion. Numerical studies for five-dimensional data show that our algorithm is superior to the normal reference rule. Our MCMC algorithm is the first data-driven bandwidth selector for multivariate kernel density estimation that is applicable to data of any dimension. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:3009 / 3031
页数:23
相关论文
共 36 条
[1]   ON BANDWIDTH VARIATION IN KERNEL ESTIMATES - A SQUARE ROOT LAW [J].
ABRAMSON, IS .
ANNALS OF STATISTICS, 1982, 10 (04) :1217-1223
[2]   Nonparametric estimation of state-price densities implicit in financial asset prices [J].
Ait-Sahalia, Y ;
Lo, AW .
JOURNAL OF FINANCE, 1998, 53 (02) :499-547
[3]   Testing continuous-time models of the spot interest rate [J].
Ait-Sahalia, Y .
REVIEW OF FINANCIAL STUDIES, 1996, 9 (02) :385-426
[4]  
Amenta N., 1998, Computer Graphics. Proceedings. SIGGRAPH 98 Conference Proceedings, P415, DOI 10.1145/280814.280947
[5]  
[Anonymous], 1992, MULTIVARIATE DENSITY
[6]   Statistical applications of the multivariate skew normal distribution [J].
Azzalini, A ;
Capitanio, A .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 :579-602
[7]   The multivariate skew-normal distribution [J].
Azzalini, A ;
DallaValle, A .
BIOMETRIKA, 1996, 83 (04) :715-726
[8]   Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t-distribution [J].
Azzalini, A ;
Capitanio, A .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2003, 65 :367-389
[9]  
Bauwens L., 1998, ECONOMET J, V1, pC23
[10]  
Bowman AW, 1997, Applied Smoothing Techniques for Data Analysis: the Kernel Approach with S-Plus Illustrations