Bandwidth selection for the smoothing of distribution functions

被引:151
作者
Bowman, A [1 ]
Hall, P
Prvan, T
机构
[1] Univ Glasgow, Dept Stat, Glasgow G12 8QQ, Lanark, Scotland
[2] Australian Natl Univ, Ctr Math & Applicat, Canberra, ACT 0200, Australia
[3] Univ Canberra, Sch Math & Stat, Canberra, ACT 2601, Australia
基金
澳大利亚研究理事会;
关键词
crossvalidation; empirical distribution function; kernel; smoothing;
D O I
10.1093/biomet/85.4.799
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Several approaches can be made to the choice of bandwidth in the kernel smoothing of distribution functions. Recent proposals by Sarda (1993) and by Altman & Leger (1995) are analogues of the 'leave-one-out' and 'plug-in' methods which have been widely used in density estimation. In contrast, a method of crossvalidation appropriate to the smoothing of distribution functions is proposed. Selection of the bandwidth parameter is based on unbiased estimation of a mean integrated squared error curve whose minimising value defines an optimal smoothing parameter. This procedure is shown to lead to asymptotically optimal bandwidth choice, not just in the usual first-order sense but also in the second-order sense in which kernel methods improve on the standard empirical distribution function. Some general theory on the performance of optimal, data-based methods of bandwidth choice is also provided, leading to results which do not have analogues in the context of density estimation. The numerical performances of all the methods discussed in the paper are compared. A bandwidth based on a simple reference distribution is also included. Simulations suggest that the crossvalidatory proposal works well, although the simple reference bandwidth is also quite effective.
引用
收藏
页码:799 / 808
页数:10
相关论文
共 11 条