Missing data: A statistical framework for practice

被引:96
作者
Carpenter, James R. [1 ,2 ]
Smuk, Melanie [1 ]
机构
[1] London Sch Hyg & Trop Med, Dept Med Stat, Keppel St, London WC1E 7HT, England
[2] UCL, MRC Clin Trials Unit, London, England
基金
英国医学研究理事会;
关键词
complete records; missing data; multiple imputation; sensitivity analysis; MULTIPLE-IMPUTATION; INFERENCE; MODELS; TRIALS; LIKELIHOOD; OUTCOMES;
D O I
10.1002/bimj.202000196
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Missing data are ubiquitous in medical research, yet there is still uncertainty over when restricting to the complete records is likely to be acceptable, when more complex methods (e.g. maximum likelihood, multiple imputation and Bayesian methods) should be used, how they relate to each other and the role of sensitivity analysis. This article seeks to address both applied practitioners and researchers interested in a more formal explanation of some of the results. For practitioners, the framework, illustrative examples and code should equip them with a practical approach to address the issues raised by missing data (particularly using multiple imputation), alongside an overview of how the various approaches in the literature relate. In particular, we describe how multiple imputation can be readily used for sensitivity analyses, which are still infrequently performed. For those interested in more formal derivations, we give outline arguments for key results, use simple examples to show how methods relate, and references for full details. The ideas are illustrated with a cohort study, a multi-centre case control study and a randomised clinical trial.
引用
收藏
页码:915 / 947
页数:33
相关论文
共 92 条
  • [51] MKendrick A.G., 1925, Proc. Edinb. Math. Soc., V44, P98, DOI DOI 10.1017/S0013091500034428
  • [52] Moher D, 2010, BMJ-BRIT MED J, V340, DOI [10.1136/bmj.c869, 10.1016/j.jclinepi.2010.02.005, 10.1136/bmj.c332, 10.4103/0976-500X.72352, 10.1016/j.ijsu.2011.09.004, 10.1186/1741-7015-8-18]
  • [53] Multiple imputation for an incomplete covariate that is a ratio
    Morris, Tim P.
    White, Ian R.
    Royston, Patrick
    Seaman, Shaun R.
    Wood, Angela M.
    [J]. STATISTICS IN MEDICINE, 2014, 33 (01) : 88 - 104
  • [54] National Research Council, 2010, The Prevention and Treatment of Missing Data in Clinical Trials
  • [55] Missing values in longitudinal dietary data: A multiple imputation approach based on a fully conditional specification
    Nevalainen, Jaakko
    Kenward, Michael G.
    Virtanen, Suvi A.
    [J]. STATISTICS IN MEDICINE, 2009, 28 (29) : 3657 - 3669
  • [56] Orchard T., 1972, P 6 BERK S MATH STAT, V1, P697, DOI DOI 10.1525/9780520325883-036
  • [57] MULTIPLE IMPUTATION WITH SURVEY WEIGHTS: A MULTILEVEL APPROACH
    Quartagno, M.
    Carpenter, J. R.
    Goldstein, H.
    [J]. JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2020, 8 (05) : 965 - 989
  • [58] Multiple imputation for IPD meta-analysis: allowing for heterogeneity and studies with missing covariates
    Quartagno, M.
    Carpenter, J. R.
    [J]. STATISTICS IN MEDICINE, 2016, 35 (17) : 2938 - 2954
  • [59] Quartagno M, 2019, R J, V11, P205
  • [60] Multiple imputation for discrete data: Evaluation of the joint latent normal model
    Quartagno, Matteo
    Carpenter, James R.
    [J]. BIOMETRICAL JOURNAL, 2019, 61 (04) : 1003 - 1019