On kernel nonparametric regression designed for complex survey data

被引:14
作者
Harms, Torsten [1 ]
Duchesne, Pierre [2 ]
机构
[1] Free Univ Berlin, D-14195 Berlin, Germany
[2] Univ Montreal, Dept Math & Stat, Montreal, PQ H3C 3J7, Canada
关键词
Bandwidth; Design-based inference; Local linear regression; Local polynomial regression; Model-based inference; Nonparametric regression; Sampling weights; Survey sampling; DENSITY-ESTIMATION; ESTIMATORS; VARIANCE; SPLINES;
D O I
10.1007/s00184-009-0244-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we consider nonparametric regression analysis between two variables when data are sampled through a complex survey. While nonparametric regression analysis has been widely used with data that may be assumed to be generated from independently and identically distributed (iid) random variables, the methods and asymptotic analyses established for iid data need to be extended in the framework of complex survey designs. Local polynomial regression estimators are studied, which include as particular cases design-based versions of the Nadaraya-Watson estimator and of the local linear regression estimator. In this paper, special emphasis is given to the local linear regression estimator. Our estimators incorporate both the sampling weights and the kernel weights. We derive the asymptotic mean squared error (MSE) of the kernel estimators using a combined inference framework, and as a corollary consistency of the estimators is deduced. Selection of a bandwidth is necessary for the resulting estimators; an optimal bandwidth can be determined, according to the MSE criterion in the combined mode of inference. Simulation experiments are conducted to illustrate the proposed methodology and an application with the Canadian survey of labour and income dynamics is presented.
引用
收藏
页码:111 / 138
页数:28
相关论文
共 22 条
[1]  
[Anonymous], 1994, Kernel smoothing
[2]  
[Anonymous], 1989, Analysis of Complex Surveys
[3]  
Bellhouse DR, 1999, STAT SINICA, V9, P407
[4]   Model-assisted estimation for complex surveys using penalised splines [J].
Breidt, FJ ;
Claeskens, G ;
Opsomer, JD .
BIOMETRIKA, 2005, 92 (04) :831-846
[5]  
Breidt FJ, 2000, ANN STAT, V28, P1026
[6]   Asymptotic properties of kernel density estimation with complex survey data [J].
Buskirk, TD ;
Lohr, SL .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2005, 128 (01) :165-190
[7]  
BUSKIRK TD, 1998, 1998 P SURV RES METH, P799
[8]   CALIBRATION ESTIMATORS IN SURVEY SAMPLING [J].
DEVILLE, JC ;
SARNDAL, CE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1992, 87 (418) :376-382
[9]   Variance reduction in surveys with auxiliary information: a nonparametric approach involving regression splines [J].
Goga, C .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2005, 33 (02) :163-180
[10]   Inference for superpopulation parameters using sample surveys [J].
Graubard, BI ;
Korn, EL .
STATISTICAL SCIENCE, 2002, 17 (01) :73-96