On kernel nonparametric regression designed for complex survey data

被引:14
作者
Harms, Torsten [1 ]
Duchesne, Pierre [2 ]
机构
[1] Free Univ Berlin, D-14195 Berlin, Germany
[2] Univ Montreal, Dept Math & Stat, Montreal, PQ H3C 3J7, Canada
关键词
Bandwidth; Design-based inference; Local linear regression; Local polynomial regression; Model-based inference; Nonparametric regression; Sampling weights; Survey sampling; DENSITY-ESTIMATION; ESTIMATORS; VARIANCE; SPLINES;
D O I
10.1007/s00184-009-0244-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we consider nonparametric regression analysis between two variables when data are sampled through a complex survey. While nonparametric regression analysis has been widely used with data that may be assumed to be generated from independently and identically distributed (iid) random variables, the methods and asymptotic analyses established for iid data need to be extended in the framework of complex survey designs. Local polynomial regression estimators are studied, which include as particular cases design-based versions of the Nadaraya-Watson estimator and of the local linear regression estimator. In this paper, special emphasis is given to the local linear regression estimator. Our estimators incorporate both the sampling weights and the kernel weights. We derive the asymptotic mean squared error (MSE) of the kernel estimators using a combined inference framework, and as a corollary consistency of the estimators is deduced. Selection of a bandwidth is necessary for the resulting estimators; an optimal bandwidth can be determined, according to the MSE criterion in the combined mode of inference. Simulation experiments are conducted to illustrate the proposed methodology and an application with the Canadian survey of labour and income dynamics is presented.
引用
收藏
页码:111 / 138
页数:28
相关论文
共 22 条
[11]  
Hardle W., 1991, SMOOTHING TECHNIQUES
[12]   SUPER-POPULATION VIEWPOINT FOR FINITE POPULATION SAMPLING [J].
HARTLEY, HO ;
SIELKEN, RL .
BIOMETRICS, 1975, 31 (02) :411-422
[13]   Estimating distribution functions from survey data using nonparametric regression [J].
Johnson A.A. ;
Jay Breidt F. ;
Opsomer J.D. .
Journal of Statistical Theory and Practice, 2008, 2 (3) :419-431
[14]  
Korn E.L., 1997, J OFF STAT, V13, P385
[15]   Scatterplots with survey data [J].
Korn, EL ;
Graubard, BI .
AMERICAN STATISTICIAN, 1998, 52 (01) :58-69
[16]  
Nadaraya E.A., 1964, Theory Prob. Appl, V9, P141, DOI DOI 10.1137/1109020
[17]   Selecting the amount of smoothing in nonparametric regression estimation for complex surveys [J].
Opsomer, JD ;
Miller, CP .
JOURNAL OF NONPARAMETRIC STATISTICS, 2005, 17 (05) :593-611
[18]   ESTIMATION OF A PROBABILITY DENSITY-FUNCTION AND MODE [J].
PARZEN, E .
ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (03) :1065-&
[19]   MULTIVARIATE LOCALLY WEIGHTED LEAST-SQUARES REGRESSION [J].
RUPPERT, D ;
WAND, MP .
ANNALS OF STATISTICS, 1994, 22 (03) :1346-1370
[20]  
Sarndal C.-E., 1992, Model-assisted survey sampling