MULTIPLE IMPUTATION WITH SURVEY WEIGHTS: A MULTILEVEL APPROACH

被引:36
作者
Quartagno, M. [1 ,2 ]
Carpenter, J. R. [1 ,2 ]
Goldstein, H. [1 ,3 ]
机构
[1] London Sch Hyg & Trop Med, Dept Med Stat, Keppel St, London, England
[2] UCL, MRC Clin Trials Unit, 90 High Holborn, London WC1V 6LJ, England
[3] Univ Bristol, Ctr Multilevel Modelling, 35 Berkeley Sq, Bristol, Avon, England
基金
英国医学研究理事会;
关键词
Missing data; Multilevel; Multiple imputation; Survey; Weights; VARIANCE-ESTIMATION; MODELS;
D O I
10.1093/jssam/smz036
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Multiple imputation is now well established as a practical and flexible method for analyzing partially observed data, particularly under the missing at random assumption. However, when the substantive model is a weighted analysis, there is concern about the empirical performance of Rubin's rules and also about how to appropriately incorporate possible interaction between the weights and the distribution of the study variables. One approach that has been suggested is to include the weights in the imputation model, potentially also allowing for interactions with the other variables. We show that the theoretical criterion justifying this approach can be approximately satisfied if we stratify the weights to define level-two units in our data set and include random intercepts in the imputation model. Further, if we let the covariance matrix of the variables have a random distribution across the level-two units, we also allow imputation to reflect any interaction between weight strata and the distribution of the variables. We evaluate our proposal in a number of simulation scenarios, showing it has promising performance both in terms of coverage levels of the model parameters and bias of the associated Rubin's variance estimates. We illustrate its application to a weighted analysis of factors predicting reception-year readiness in children in the UK Millennium Cohort Study.
引用
收藏
页码:965 / 989
页数:25
相关论文
共 30 条
[1]  
[Anonymous], 2009, Multiple Imputation for Nonresponse in Surveys
[2]  
[Anonymous], 2013, Multiple imputation and its application, DOI DOI 10.1002/9781119942283
[3]   Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model [J].
Bartlett, Jonathan W. ;
Seaman, Shaun R. ;
White, Ian R. ;
Carpenter, James R. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2015, 24 (04) :462-487
[4]   ON VARIANCE ESTIMATION UNDER AUXILIARY VALUE IMPUTATION IN SAMPLE SURVEYS [J].
Beaumont, Jean-Francois ;
Haziza, David ;
Bocci, Cynthia .
STATISTICA SINICA, 2011, 21 (02) :515-537
[5]   Multiple membership multiple classification (MMMC) models [J].
Browne, William J. ;
Goldstein, Harvey ;
Rasbash, Jon .
STATISTICAL MODELLING, 2001, 1 (02) :103-124
[6]   CALIBRATION ESTIMATORS IN SURVEY SAMPLING [J].
DEVILLE, JC ;
SARNDAL, CE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1992, 87 (418) :376-382
[7]  
Elliott MR, 2007, SURV METHODOL, V33, P23
[8]  
Fay R.E., 1993, AM STAT ASS P SURVEY, P41
[9]  
Fay R.E., 1992, Proceedings of the Survey Research Methods Section, American Statistical Association, P227
[10]   Fitting multilevel multivariate models with missing data in responses and covariates that may include interactions and non-linear terms [J].
Goldstein, Harvey ;
Carpenter, James R. ;
Browne, William J. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2014, 177 (02) :553-564