Handling missing data: analysis of a challenging data set using multiple imputation

被引:74
作者
Pampaka, Maria [1 ]
Hutcheson, Graeme [1 ]
Williams, Julian [1 ]
机构
[1] Univ Manchester, Manchester Inst Educ, Room B4-10 Ellen Wilkinson Bldg,Oxford Rd, Manchester M13 9PL, Lancs, England
基金
英国经济与社会研究理事会;
关键词
missing data; surveys; multiple imputation; regression; modelling;
D O I
10.1080/1743727X.2014.979146
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Missing data is endemic in much educational research. However, practices such as step-wise regression common in the educational research literature have been shown to be dangerous when significant data are missing, and multiple imputation (MI) is generally recommended by statisticians. In this paper, we provide a review of these advances and their implications for educational research. We illustrate the issues with an educational, longitudinal survey in which missing data was significant, but for which we were able to collect much of these missing data through subsequent data collection. We thus compare methods, that is, step-wise regression (basically ignoring the missing data) and MI models, with the model from the actual enhanced sample. The value of MI is discussed and the risks involved in ignoring missing data are considered. Implications for research practice are discussed.
引用
收藏
页码:19 / 37
页数:19
相关论文
共 33 条
[1]   Multiple imputation for missing data - A cautionary tale [J].
Allison, PD .
SOCIOLOGICAL METHODS & RESEARCH, 2000, 28 (03) :301-309
[2]   Imputation methods for handling item-nonresponse in practice: methodological issues and recent debates [J].
Durrant, Gabriele B. .
INTERNATIONAL JOURNAL OF SOCIAL RESEARCH METHODOLOGY, 2009, 12 (04) :293-304
[3]  
Fox J., 1987, SOCIOL METHODOL, V17, P347, DOI [DOI 10.2307/271037, 10.2307/271037]
[4]  
Fox J, 2009, J STAT SOFTW, V32, P1
[5]  
Honaker J, 2011, J STAT SOFTW, V45, P1
[6]   What to Do about Missing Values in Time-Series Cross-Section Data [J].
Honaker, James ;
King, Gary .
AMERICAN JOURNAL OF POLITICAL SCIENCE, 2010, 54 (02) :561-581
[7]   Much ado about nothing: A comparison of missing data methods and software to fit incomplete data regression models [J].
Horton, Nicholas J. ;
Kleinman, Ken P. .
AMERICAN STATISTICIAN, 2007, 61 (01) :79-90
[8]   Missing Data: data replacement and imputation [J].
Hutcheson, Graeme ;
Pampaka, Maria .
JOURNAL OF MODELLING IN MANAGEMENT, 2012, 7 (02)
[9]   Enrolment, achievement and retention on 'traditional' and 'Use of Mathematics' pre-university courses [J].
Hutcheson, Graeme ;
Pampaka, Maria ;
Williams, Julian .
RESEARCH IN MATHEMATICS EDUCATION, 2011, 13 (02) :147-168
[10]   Missing-data methods for generalized linear models: A comparative review [J].
Ibrahim, JG ;
Chen, MH ;
Lipsitz, SR ;
Herring, AH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) :332-346