Variance estimation in two-stage cluster sampling under imputation for missing data

被引:6
作者
Haziza D. [1 ]
Rao J.N.K. [2 ]
机构
[1] Département de mathématiques et de statistique, Université de Montréal, Montreal
[2] School of Mathematics and Statistics, Carleton University, Ottawa
基金
加拿大自然科学与工程研究理事会;
关键词
Nonresponse; Random hot-deck imputation; Reverse framework; Two-phase framework; Two-stage cluster sampling; Variance estimation;
D O I
10.1080/15598608.2010.10412021
中图分类号
学科分类号
摘要
Variance estimation in the presence of imputed data has been widely studied in the literature. It is well known that treating the imputed values as if they were true values could lead to serious underestimation of the true variance, especially if the response rates are low. In this paper, we consider the problem of variance estimation using a model, in the context of two-stage cluster sampling designs which are widely used in social and household surveys. In cluster sampling designs, units in the same neighborhood tend to have similar characteristics (e.g., income, education level, etc). It is thus important to take account of the intra-cluster correlation in formulating the model and then derive variance estimators under the appropriate model. In this paper, we consider weighted random hot-deck imputation and derive consistent variance estimators under two distinct frameworks: (i) the two-phase framework and (ii) the reverse framework. In the case of the two-phase framework, we use a variance estimation method proposed by Särndal (1992), whereas we use a method developed by Fay (1991) and Shao and Steel (1999) in the case of the reverse framework. Finally, we perform a simulation study to evaluate the performance of the proposed variance estimators in terms of relative bias. We conclude that the variance estimators obtained by Shao-Steel’s method are more robust to model misspecification than those derived using Särndal’s method. © 2010 Taylor & Francis Group, LLC. All rights reserved.
引用
收藏
页码:827 / 844
页数:17
相关论文
共 17 条
[1]  
Beaumont J.F., Bocci C., Variance estimation when donor imputation is used to fill in missing data, The Canadian Journal of Statistics, 37, pp. 400-416, (2009)
[2]  
Brick J.M., Kalton G., Kim J.K., Variance estimation with hot deck imputation using a model, Survey Methodology, 30, pp. 57-66, (2004)
[3]  
Demnati A., Rao J.N.K., Linearization variance estimators for survey data, Survey Methodology, 30, pp. 17-27, (2004)
[4]  
Deville J.C., Sarndal C.E., Variance estimation for the regression imputed Horvitz-Thompson estimator, Journal of Official Statistics, 23, pp. 33-40, (1994)
[5]  
Fay R.E., A Design-Based Perspective on Missing Data Variance. Proceedings of the 1991, pp. 429-440, (1991)
[6]  
Haziza D., Imputation and inference in the presence of missing data, Handbook of Statistics, 29, pp. 215-246, (2009)
[7]  
Kim J.K., Rao J.N.K., Unified approach to linearization variance estimation from survey data after imputation for item nonresponse, Biometrika, 96, pp. 917-932, (2009)
[8]  
Rao J.N.K., Variance Estimation under Imputation for Missing Data, (1990)
[9]  
Rao J.N.K., On variance estimation with imputed survey data, Journal of American Statistical Association, 91, pp. 499-506, (1996)
[10]  
Rao J.N.K., Shao J., On variance estimation under imputation for missing data, Biometrika, 79, pp. 811-822, (1992)