Missing Data: Five Practical Guidelines

被引：1008

作者：

Newman, Daniel A. ^{[1
,2
]}

机构：

[1] Univ Illinois, Dept Psychol, Champaign, IL USA

[2] Univ Illinois, Sch Labor & Employment Relat, Champaign, IL USA

来源：

ORGANIZATIONAL RESEARCH METHODS | 2014年 / 17卷 / 04期

关键词：

missing data; full information maximum likelihood (FIML); EM algorithm; multiple imputation; R syntax/R code; STRUCTURAL EQUATION MODELS; MAXIMUM-LIKELIHOOD; SAMPLE SELECTION; RESPONSE RATES; IMPUTATION; METAANALYSIS; ACCURACY; BIAS;

D O I：

10.1177/1094428114548590

中图分类号：

B849 [应用心理学];

学科分类号：

040203 ;

摘要：

Missing data (a) reside at three missing data levels of analysis (item-, construct-, and person-level), (b) arise fromthree missing datamechanisms(missing completely at random, missing at random, and missing not at random) that range from completely random to systematic missingness, (c) can engender two missing data problems (biased parameter estimates and inaccurate hypothesis tests/inaccurate standard errors/low power), and (d) mandate a choice from among several missing data treatments (listwise deletion, pairwise deletion, single imputation, maximum likelihood, and multiple imputation). Whereas all missing data treatments are imperfect and are rooted in particular statistical assumptions, some missing data treatments are worse than others, on average (i. e., they lead to more bias in parameter estimates and less accurate hypothesis tests). Social scientists still routinely choose the more biased and error-prone techniques (listwise and pairwise deletion), likely due to poor familiarity with and misconceptions about the less biased/less error-prone techniques (maximum likelihood and multiple imputation). The current user-friendly review provides five easy-to-understand practical guidelines, with the goal of reducing missing data bias and error in the reporting of research results. Syntax is provided for correlation, multiple regression, and structural equation modeling with missing data.

引用

页码：372 / 411

页数：40

共 55 条

[1]

Allison Paul D., 2002, MISSING DATA

[2]

[Anonymous], 2013, Bayesian data analysis, third edition

[3]

[Anonymous], 2009, Multiple Imputation for Nonresponse in Surveys

[4]

[Anonymous], 2011, Handbook of Advanced Multilevel Analysis, DOI [10.4324/9780203848852-18, DOI 10.4324/9780203848852-18]

[5]

[Anonymous], 1978, The Belmont report: Ethical principles and guidelines for the protection of human subjects of research

[6] Response Rates in Organizational Science, 1995-2008: A Meta-analytic Review and Guidelines for Survey Researchers [J].

Anseel, Frederik ;

Lievens, Filip ;

Schollaert, Eveline ;

Choragwicka, Beata .

JOURNAL OF BUSINESS AND PSYCHOLOGY, 2010, 25 (03) :335-349

[7] Development of a measure of workplace deviance [J].

Bennett, RJ ;

Robinson, SL .

JOURNAL OF APPLIED PSYCHOLOGY, 2000, 85 (03) :349-360

[8] Influence of imputation and EM methods on factor analysis when item nonresponse in questionnaire data is nonignorable [J].

Bernaards, CA ;

Sijtsma, K .

MULTIVARIATE BEHAVIORAL RESEARCH, 2000, 35 (03) :321-364

[9] Implications of empirical Bayes meta-analysis for test validation [J].

Brannick, MT .

JOURNAL OF APPLIED PSYCHOLOGY, 2001, 86 (03) :468-480

[10] A comparison of inclusive and restrictive strategies in modern missing data procedures [J].

Collins, LM ;

Schafer, JL ;

Kam, CM .

PSYCHOLOGICAL METHODS, 2001, 6 (04) :330-351

← 1 2 3 4 5 6 →