BANDWIDTH SELECTION FOR KERNEL DENSITY-ESTIMATION

被引:116
作者
CHIU, ST
机构
关键词
KERNEL DENSITY ESTIMATION; BANDWIDTH SELECTION; CROSS-VALIDATION; CHARACTERISTIC FUNCTION; PLUG-IN METHOD;
D O I
10.1214/aos/1176348376
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The problem of automatic bandwidth selection for a kernel density estimator is considered. It is well recognized that the bandwidth estimate selected by the least squares cross-validation is subject to large sample variation. This difficulty limits the application of the cross-validation estimate. Based on characteristic functions, an important expression for the cross-validation bandwidth estimate is obtained. The expression clearly points out the source of variation. To stabilize the variation, a simple bandwidth selection procedure is proposed. It is shown that the stabilized bandwidth selector gives a strongly consistent estimate of the optimal bandwidth. Under commonly used smoothness conditions, the stabilized bandwidth estimate has a faster convergence rate than the convergence rate of the cross-validation estimate. For sufficiently smooth density functions, it is shown that the stabilized bandwidth estimate is asymptotically normal with a relative convergence rate n-1/2 instead of the rate n-1/10 of the cross-validation estimate. A plug-in estimate and an adjusted plug-in estimate are also proposed, and their asymptotic distributions are obtained. It is noted that the plug-in estimate is asymptotically efficient. The adjusted plug-in bandwidth estimate and the stabilized bandwidth estimate are shown to be asymptotically equivalent. The simulation results verify that the proposed procedures perform much better than the cross-validation for finite samples.
引用
收藏
页码:1883 / 1905
页数:23
相关论文
共 50 条
[21]   Bandwidth selection for kernel density estimation: a review of fully automatic selectors [J].
Nils-Bastian Heidenreich ;
Anja Schindler ;
Stefan Sperlich .
AStA Advances in Statistical Analysis, 2013, 97 :403-433
[22]   Data based bandwidth selection in kernel density estimation with parametric start via kernel contrasts [J].
Ahmad, IA ;
Ran, IS .
JOURNAL OF NONPARAMETRIC STATISTICS, 2004, 16 (06) :841-877
[23]   How Bandwidth Selection Algorithms Impact Exploratory Data Analysis Using Kernel Density Estimation [J].
Harpole, Jared K. ;
Woods, Carol M. ;
Rodebaugh, Thomas L. ;
Levinson, Cheri A. ;
Lenze, Eric J. .
PSYCHOLOGICAL METHODS, 2014, 19 (03) :428-443
[24]   Modified Fast Algorithm for the Bandwidth Selection of the Kernel Density Estimation [J].
A. V. Lapko ;
V. A. Lapko .
Optoelectronics, Instrumentation and Data Processing, 2020, 56 :566-572
[25]   Information bound for bandwidth selection in kernel estimation of density derivatives [J].
Wu, TJ ;
Lin, Y .
STATISTICA SINICA, 2000, 10 (02) :457-473
[26]   Optimal bandwidth selection for multivariate kernel deconvolution density estimation [J].
Youndje, Elie ;
Wells, Martin T. .
TEST, 2008, 17 (01) :138-162
[27]   Optimal bandwidth selection for multivariate kernel deconvolution density estimation [J].
Élie Youndjé ;
Martin T. Wells .
TEST, 2008, 17 :138-162
[28]   Modified Fast Algorithm for the Bandwidth Selection of the Kernel Density Estimation [J].
Lapko, A., V ;
Lapko, V. A. .
OPTOELECTRONICS INSTRUMENTATION AND DATA PROCESSING, 2020, 56 (06) :566-572
[29]   Bayes bandwidth selection in kernel density estimation with censored data [J].
Kulasekera, K. B. ;
Padgett, W. J. .
JOURNAL OF NONPARAMETRIC STATISTICS, 2006, 18 (02) :129-143
[30]   A Bayesian approach to bandwidth selection for multivariate kernel density estimation [J].
Zhang, Xibin ;
King, Maxwell L. ;
Hyndman, Rob J. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (11) :3009-3031