We Have to Be Discrete About This: A Non-Parametric Imputation Technique for Missing Categorical Data

被引:30
作者
Cranmer, Skyler J. [1 ]
Gill, Jeff [2 ]
机构
[1] Univ N Carolina, Dept Polit Sci, Chapel Hill, NC 27515 USA
[2] Washington Univ, Dept Polit Sci, St Louis, MO 63130 USA
关键词
MULTIPLE IMPUTATION; ECONOMIC-DEVELOPMENT; DEMOCRACIES; INFERENCE;
D O I
10.1017/S0007123412000312
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
Missing values are a frequent problem in empirical political science research. Surprisingly, the match between the measurement of the missing values and the correcting algorithms applied is seldom studied. While multiple imputation is a vast improvement over the deletion of cases with missing values, it is often unsuitable for imputing highly non-granular discrete data. We develop a simple technique for imputing missing values in such situations, which is a variant of hot deck imputation, drawing from the conditional distribution of the variable with missing values to preserve the discrete measure of the variable. This method is tested against existing techniques using Monte Carlo analysis and then applied to real data on democratization and modernization theory. Software for our imputation technique is provided in a free, easy-to-use package for the R statistical environment.
引用
收藏
页码:425 / 449
页数:25
相关论文
共 54 条