Missing Data: Five Practical Guidelines

被引:942
|
作者
Newman, Daniel A. [1 ,2 ]
机构
[1] Univ Illinois, Dept Psychol, Champaign, IL USA
[2] Univ Illinois, Sch Labor & Employment Relat, Champaign, IL USA
关键词
missing data; full information maximum likelihood (FIML); EM algorithm; multiple imputation; R syntax/R code; STRUCTURAL EQUATION MODELS; MAXIMUM-LIKELIHOOD; SAMPLE SELECTION; RESPONSE RATES; IMPUTATION; METAANALYSIS; ACCURACY; BIAS;
D O I
10.1177/1094428114548590
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Missing data (a) reside at three missing data levels of analysis (item-, construct-, and person-level), (b) arise fromthree missing datamechanisms(missing completely at random, missing at random, and missing not at random) that range from completely random to systematic missingness, (c) can engender two missing data problems (biased parameter estimates and inaccurate hypothesis tests/inaccurate standard errors/low power), and (d) mandate a choice from among several missing data treatments (listwise deletion, pairwise deletion, single imputation, maximum likelihood, and multiple imputation). Whereas all missing data treatments are imperfect and are rooted in particular statistical assumptions, some missing data treatments are worse than others, on average (i. e., they lead to more bias in parameter estimates and less accurate hypothesis tests). Social scientists still routinely choose the more biased and error-prone techniques (listwise and pairwise deletion), likely due to poor familiarity with and misconceptions about the less biased/less error-prone techniques (maximum likelihood and multiple imputation). The current user-friendly review provides five easy-to-understand practical guidelines, with the goal of reducing missing data bias and error in the reporting of research results. Syntax is provided for correlation, multiple regression, and structural equation modeling with missing data.
引用
收藏
页码:372 / 411
页数:40
相关论文
共 50 条
  • [31] Missing Outcome Data in Epidemiologic Studies
    Cole, Stephen R.
    Zivich, Paul N.
    Edwards, Jessie K.
    Ross, Rachael K.
    Shook-Sa, Bonnie E.
    Price, Joan T.
    Stringer, Jeffrey S. A.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (01) : 6 - 10
  • [32] A review on missing hydrological data processing
    Gao, Yongbo
    Merz, Christoph
    Lischeid, Gunnar
    Schneider, Michael
    ENVIRONMENTAL EARTH SCIENCES, 2018, 77 (02)
  • [33] A survey on missing data in machine learning
    Emmanuel, Tlamelo
    Maupong, Thabiso
    Mpoeleng, Dimane
    Semong, Thabo
    Mphago, Banyatsang
    Tabona, Oteng
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [34] Analyzing Longitudinal Data With Missing Values
    Enders, Craig K.
    REHABILITATION PSYCHOLOGY, 2011, 56 (04) : 267 - 288
  • [35] ADDRESSING AND ADVANCING THE PROBLEM OF MISSING DATA
    Walton, Marc K.
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2009, 19 (06) : 945 - 956
  • [36] Propensity Score Analysis With Missing Data
    Cham, Heining
    West, Stephen G.
    PSYCHOLOGICAL METHODS, 2016, 21 (03) : 427 - 445
  • [37] Missing Data: An Update on the State of the Art
    Enders, Craig K.
    PSYCHOLOGICAL METHODS, 2023, : 322 - 339
  • [38] Cooperative Clustering Missing Data Imputation
    Wan, Daoming
    Razavi-Far, Roozbeh
    Saif, Mehrdad
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 1039 - 1045
  • [39] Missing data and the design of phylogenetic analyses
    Wiens, JJ
    JOURNAL OF BIOMEDICAL INFORMATICS, 2006, 39 (01) : 34 - 42
  • [40] Simple methods to handle missing data
    Bici, Ruzhdie
    INTERNATIONAL JOURNAL OF COMPUTATIONAL ECONOMICS AND ECONOMETRICS, 2023, 13 (02) : 216 - 242