General-purpose imputation of planned missing data in social surveys: Different strategies and their effect on correlations

被引:5
作者
Axenfeld, Julian B. [1 ]
Bruch, Christian [2 ,3 ]
Wolf, Christof [2 ,3 ]
机构
[1] Mannheim Ctr European Social Res, A5,6, D-68159 Mannheim, Germany
[2] GESIS Leibniz Inst Social Sci, B6,4-5, D-68159 Mannheim, Germany
[3] Univ Mannheim, Mannheim Ctr European Social Res, A5,6, D-68159 Mannheim, Germany
关键词
Bias; Monte Carlo simulation; multiple imputation; imputation methods; split questionnaire design; FULLY CONDITIONAL SPECIFICATION; SPLIT QUESTIONNAIRE DESIGN; MULTIPLE IMPUTATION; SQUARES; REGRESSION; LENGTH; MICE;
D O I
10.1214/22-SS137
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Planned missing survey data, for example stemming from split questionnaire designs are becoming increasingly common in survey research, making imputation indispensable to obtain reasonably analyzable data. However, these data can be difficult to impute due to low correlations, many predictors, and limited sample sizes to support imputation models. This paper presents findings from a Monte Carlo simulation, in which we investigate the accuracy of correlations after multiple imputation using different imputation methods and predictor set specifications based on data from the German Internet Panel (GIP). The results show that strategies that simplify the imputation exercise (such as predictive mean matching with dimensionality reduction or restricted predictor sets, linear regression models, or the multivariate normal model without transformation) perform well, while especially generalized linear models for categorical data, classification trees, and imputation models with many predictor variables lead to strong biases.
引用
收藏
页码:182 / 209
页数:28
相关论文
共 71 条
[1]   Split Questionnaire Design for Massive Surveys [J].
Adiguzel, Feray ;
Wedel, Michel .
JOURNAL OF MARKETING RESEARCH, 2008, 45 (05) :608-617
[2]   An Empirical Comparison of Multiple Imputation Methods for Categorical Data [J].
Akande, Olanrewaju ;
Li, Fan ;
Reiter, Jerome .
AMERICAN STATISTICIAN, 2017, 71 (02) :162-170
[3]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[4]  
[Anonymous], 1999, NORM Users Guide (Version 2)
[5]  
[Anonymous], 1998, Classification and regression trees
[6]  
[Anonymous], 2005, P SAS US GROUP INT S
[7]  
Axenfeld, 2022, CODE DATA AVAILABILI
[8]   Split Questionnaire Designs for Online Surveys: The Impact of Module Construction on Imputation Quality [J].
Axenfeld, Julian B. ;
Blom, Annelies G. ;
Bruch, Christian ;
Wolf, Christof .
JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2022, 10 (05) :1236-1262
[9]  
Bahrami S., 2014, Improving survey methods: lessons from recent research, P368
[10]   Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model [J].
Bartlett, Jonathan W. ;
Seaman, Shaun R. ;
White, Ian R. ;
Carpenter, James R. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2015, 24 (04) :462-487