NONPARAMETRIC ESTIMATION OF MULTIVARIATE DENSITY WITH DIRECT AND AUXILIARY DATA AND APPLICATION

被引:0
作者
Bandyopadhyay, Subhadip [1 ]
Bose, Arup [2 ]
Sengupta, Debasis [3 ]
机构
[1] Infosys Technol Ltd, SET Labs, CKDIS, Hyderabad 500032, Andhra Pradesh, India
[2] Indian Stat Inst, Stat Math Unit, Kolkata 700108, India
[3] Indian Stat Inst, Appl Stat Unit, Kolkata 700108, India
关键词
Incomplete data; kernel density estimation; classification; CLASSIFICATION;
D O I
10.1007/s13226-010-0003-3
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We consider the problem of multivariate density estimation, using samples from the distribution of interest as well as auxiliary samples from a related distribution. We assume that the data from the target distribution and the related distribution may occur individually as well as In pairs. Using nonparametric maximum likelihood estimator of the joint distribution, we derive a kernel density estimator of the marginal density. We show theoretically, in a simple special case, that the implied estimator of the marginal density has smaller integrated mean squared error than that of a similar estimator obtained by ignoring dependence of the paired observations. We establish consistency of the marginal density estimator under suitable conditions. We demonstrate small sample superiority of the proposed estimator over the estimator that ignores dependence of the samples, through a simulation study with dependent and non-normal populations. The application of the density estimator in nonparametric classification is also discussed. It is shown that the misclassification probability of the resulting classifier is asymptotically equivalent to that of the Bayes classifier. We also include a data analytic illustration.
引用
收藏
页码:251 / 274
页数:24
相关论文
共 18 条