How Many Imputations Do You Need? A Two-stage Calculation Using a Quadratic Rule

被引:345
作者
von Hippel, Paul T. [1 ]
机构
[1] Univ Texas Austin, Publ Policy Data Sci & Sociol, Austin, TX 78712 USA
关键词
missing data; missing values; incomplete data; multiple imputation; imputation;
D O I
10.1177/0049124117747303
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
When using multiple imputation, users often want to know how many imputations they need. An old answer is that 2-10 imputations usually suffice, but this recommendation only addresses the efficiency of point estimates. You may need more imputations if, in addition to efficient point estimates, you also want standard error (SE) estimates that would not change (much) if you imputed the data again. For replicableSEestimates, the required number of imputations increases quadratically with the fraction of missing information (not linearly, as previous studies have suggested). I recommend a two-stage procedure in which you conduct a pilot analysis using a small-to-moderate number of imputations, then use the results to calculate the number of imputations that are needed for a final analysis whoseSEestimates will have the desired level of replicability. I implement the two-stage procedure using a new SAS macro called%mi_combineand a new Stata command called how_many_imputations.
引用
收藏
页码:699 / 718
页数:20
相关论文
共 15 条
[1]   Missing data techniques for structural equation modeling [J].
Allison, PD .
JOURNAL OF ABNORMAL PSYCHOLOGY, 2003, 112 (04) :545-557
[2]  
[Anonymous], 2009, Multiple Imputation for Nonresponse in Surveys
[3]  
[Anonymous], 1997, Analysis of incomplete multivariate data
[4]   Small-sample degrees of freedom with multiple imputation [J].
Barnard, J ;
Rubin, DB .
BIOMETRIKA, 1999, 86 (04) :948-955
[5]   What Improves with Increased Missing Data Imputations? [J].
Bodner, Todd E. .
STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2008, 15 (04) :651-675
[6]  
Dean A.M., 2000, Design and Analysis of Experiments
[7]   Replication standards quantitive social science - Why not sociology [J].
Freese, Jeremy .
SOCIOLOGICAL METHODS & RESEARCH, 2007, 36 (02) :153-172
[8]   How many imputations are really needed? - Some practical clarifications of multiple imputation theory [J].
Graham, John W. ;
Olchowski, Allison E. ;
Gilreath, Tamika D. .
PREVENTION SCIENCE, 2007, 8 (03) :206-213
[9]  
Harel O., 2007, STAT METHODOL, V4, P75, DOI [10.1016/j.stamet.2006.03.002, DOI 10.1016/J.STAMET.2006.03.002]
[10]  
Levy P.S., 2009, Sampling of populations: Methods and applications, V4th