Missing data in surveys: Key concepts, approaches, and applications

被引:106
作者
Mirzaei, Ardalan [1 ]
Carter, Stephen R. [1 ]
Patanwala, Asad E. [1 ]
Schneider, Carl R. [1 ]
机构
[1] Univ Sydney, Fac Med & Hlth, Sch Pharm, Sydney, NSW, Australia
关键词
Missing data; Research design; Questionnaire design; Research methods; Surveys; MAXIMUM-LIKELIHOOD; NONRESPONSE; IMPUTATION;
D O I
10.1016/j.sapharm.2021.03.009
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
A recent review of missing data in pharmacy literature has highlighted that a low proportion of studies reported how missing data was handled. In this paper we discuss the concept of missing data in survey research, how missing data is classified, common techniques to account for missingness and how to report on missing data. The paper provides guidance to mitigate the occurrence of missing data through planning. Considerations include estimating expected missing data, intended vs unintended missing data, survey length, working with electronic surveys, choosing between standard and filtered form questions, forced responses and straight-lining, as well as responses that can generate missingness like "I don't know" and "Not Applicable". We introduce methods for analysing data with missing values, such as deletion, imputation and likelihood methods. The manuscript provides a framework and flow chart for choosing the appropriate analysis method based on how much missing data is observed and the type of missingness. Special circumstances involving missing data have been discussed, such as in studies with repeated or cohort measures, factor analysis or as part of data integration. Finally, a checklist of questions are provided for researchers to guide the reporting of the missing data when conducting future research.
引用
收藏
页码:2308 / 2316
页数:9
相关论文
共 44 条
  • [1] Allison P. D., 2001, MISSING DATA, V136
  • [2] [Anonymous], 2004, The SAGE Encyclopedia of Social Science Research Methods
  • [3] AQUILINO WS, 1992, INT J ADDICT, V27, P71
  • [4] A SIMPLE EM ALGORITHM FOR CAPTURE RECAPTURE DATA WITH CATEGORICAL COVARIATES
    BAKER, SG
    [J]. BIOMETRICS, 1990, 46 (04) : 1193 - 1200
  • [5] Bennett DA, 2001, AUST NZ J PUBL HEAL, V25, P464, DOI 10.1111/j.1467-842X.2001.tb00294.x
  • [6] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [7] Incentives increased return rates but did not influence partial nonresponse or treatment outcome in a randomized trial
    Dirmaier, Jorg
    Harfst, Timo
    Koch, Uwe
    Schulz, Holger
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2007, 60 (12) : 1263 - 1270
  • [8] Doan A, 2012, PRINCIPLES OF DATA INTEGRATION, P1, DOI 10.1016/B978-0-12-416044-6.00001-6
  • [9] Review: A gentle introduction to imputation of missing values
    Donders, A. Rogier T.
    van der Heijden, Geert J. M. G.
    Stijnen, Theo
    Moons, Karel G. M.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (10) : 1087 - 1091
  • [10] Principled missing data methods for researchers
    Dong, Yiran
    Peng, Chao-Ying Joanne
    [J]. SPRINGERPLUS, 2013, 2 : 1 - 17