Missing Data: Five Practical Guidelines

被引:942
|
作者
Newman, Daniel A. [1 ,2 ]
机构
[1] Univ Illinois, Dept Psychol, Champaign, IL USA
[2] Univ Illinois, Sch Labor & Employment Relat, Champaign, IL USA
关键词
missing data; full information maximum likelihood (FIML); EM algorithm; multiple imputation; R syntax/R code; STRUCTURAL EQUATION MODELS; MAXIMUM-LIKELIHOOD; SAMPLE SELECTION; RESPONSE RATES; IMPUTATION; METAANALYSIS; ACCURACY; BIAS;
D O I
10.1177/1094428114548590
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Missing data (a) reside at three missing data levels of analysis (item-, construct-, and person-level), (b) arise fromthree missing datamechanisms(missing completely at random, missing at random, and missing not at random) that range from completely random to systematic missingness, (c) can engender two missing data problems (biased parameter estimates and inaccurate hypothesis tests/inaccurate standard errors/low power), and (d) mandate a choice from among several missing data treatments (listwise deletion, pairwise deletion, single imputation, maximum likelihood, and multiple imputation). Whereas all missing data treatments are imperfect and are rooted in particular statistical assumptions, some missing data treatments are worse than others, on average (i. e., they lead to more bias in parameter estimates and less accurate hypothesis tests). Social scientists still routinely choose the more biased and error-prone techniques (listwise and pairwise deletion), likely due to poor familiarity with and misconceptions about the less biased/less error-prone techniques (maximum likelihood and multiple imputation). The current user-friendly review provides five easy-to-understand practical guidelines, with the goal of reducing missing data bias and error in the reporting of research results. Syntax is provided for correlation, multiple regression, and structural equation modeling with missing data.
引用
收藏
页码:372 / 411
页数:40
相关论文
共 50 条
  • [1] A practical guide to multiple imputation of missing data in nephrology
    Blazek, Katrina
    van Zwieten, Anita
    Saglimbene, Valeria
    Teixeira-Pinto, Armando
    KIDNEY INTERNATIONAL, 2021, 99 (01) : 68 - 74
  • [2] How to generate missing data for simulation studies
    Zhang, Xijuan
    QUANTITATIVE METHODS FOR PSYCHOLOGY, 2023, 19 (02): : 100 - 122
  • [3] Are missing data adequately handled in cluster randomised trials? A systematic review and guidelines
    Diaz-Ordaz, Karla
    Kenward, Michael G.
    Cohen, Abie
    Coleman, Claire L.
    Eldridge, Sandra
    CLINICAL TRIALS, 2014, 11 (05) : 590 - 600
  • [4] Dealing With Missing Data in Developmental Research
    Enders, Craig K.
    CHILD DEVELOPMENT PERSPECTIVES, 2013, 7 (01) : 27 - 31
  • [5] Retrieval of missing data for meta-analysis: A practical example
    Kelley, GA
    Kelley, KS
    Tran, ZV
    INTERNATIONAL JOURNAL OF TECHNOLOGY ASSESSMENT IN HEALTH CARE, 2004, 20 (03) : 296 - 299
  • [6] Practical Strategies for Extreme Missing Data Imputation in Dementia Diagnosis
    McCombe, Niamh
    Liu, Shuo
    Ding, Xuemei
    Prasad, Girijesh
    Bucholc, Magda
    Finn, David P.
    Todd, Stephen
    McClean, Paula L.
    Wong-Lin, Kongfatt
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (02) : 818 - 827
  • [7] Bayesian methods for dealing with missing data problems
    Ma, Zhihua
    Chen, Guanghui
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2018, 47 (03) : 297 - 313
  • [8] Methods for Handling Missing Secondary Respondent Data
    Young, Rebekah
    Johnson, David
    JOURNAL OF MARRIAGE AND FAMILY, 2013, 75 (01) : 221 - 234
  • [9] Missing data in bioarchaeology II: A test of ordinal and continuous data imputation
    Wissler, Amanda
    Blevins, Kelly E.
    Buikstra, Jane E.
    AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY, 2022, 179 (03): : 349 - 364
  • [10] A practical comparison of single and multiple imputation methods to handle complex missing data in air quality datasets
    Gomez-Carracedo, M. P.
    Andrade, J. M.
    Lopez-Mahia, P.
    Muniategui, S.
    Prada, D.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 134 : 23 - 33