Multiple imputation in a large-scale complex survey: a practical guide

被引:117
作者
He, Y. [1 ]
Zaslavsky, A. M. [1 ]
Landrum, M. B. [1 ]
Harrington, D. P. [2 ]
Catalano, P. [2 ]
机构
[1] Harvard Univ, Sch Med, Dept Hlth Care Policy, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
关键词
FULLY CONDITIONAL SPECIFICATION; MODEL;
D O I
10.1177/0962280208101273
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The Cancer Care Outcomes Research and Surveillance (CanCORS) Consortium is a multisite, multimode, multiwave study of the quality and patterns of care delivered to population-based cohorts of newly diagnosed patients with lung and colorectal cancer. As is typical in observational studies, missing data are a serious concern for CanCORS, following complicated patterns that impose severe challenges to the consortium investigators. Despite the popularity of multiple imputation of missing data, its acceptance and application still lag in large-scale studies with complicated data sets such as CanCORS. We use sequential regression multiple imputation, implemented in public-available software, to deal with non-response in the CanCORS surveys and construct a centralised completed database that can be easily used by investigators from multiple sites. Our work illustrates the feasibility of multiple imputation in a large-scale multiobjective survey, showing its capacity to handle complex missing data. We present the implementation process in detail as an example for practitioners and discuss some of the challenging issues which need further research.
引用
收藏
页码:653 / 670
页数:18
相关论文
共 33 条
  • [1] A comparison of imputation techniques for handling missing predictor values in a risk model with a binary outcome
    Ambler, Gareth
    Omar, Rumana Z.
    Royston, Patrick
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (03) : 277 - 298
  • [2] [Anonymous], 2000, SURV METHODOL
  • [3] [Anonymous], 2021, Bayesian data analysis
  • [4] [Anonymous], IVEWARE IMPUTATION V
  • [5] Use of adjuvant chemotherapy and radiation therapy for colorectal cancer in a population-based cohort
    Ayanian, JZ
    Zaslavsky, AM
    Fuchs, CS
    Guadagnoli, E
    Creech, CM
    Cress, RD
    O'Connor, LC
    West, DW
    Allen, ME
    Wolf, RE
    Wright, WE
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2003, 21 (07) : 1293 - 1300
  • [6] Applications of multiple imputation in medical studies: from AIDS as NHANES
    Barnard, J
    Meng, XL
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 1999, 8 (01) : 17 - 36
  • [7] Sensitivity analysis after multiple imputation under missing at random: a weighting approach
    Carpenter, James R.
    Kenward, Michael G.
    White, Ian R.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (03) : 259 - 275
  • [8] MULTIPLE IMPUTATION OF INDUSTRY AND OCCUPATION CODES IN CENSUS PUBLIC-USE SAMPLES USING BAYESIAN LOGISTIC-REGRESSION
    CLOGG, CC
    RUBIN, DB
    SCHENKER, N
    SCHULTZ, B
    WEIDMAN, L
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1991, 86 (413) : 68 - 78
  • [9] Cochran W.G., 2007, Sampling techniques
  • [10] Gelman A, 1998, J AM STAT ASSOC, V93, P846, DOI 10.2307/2669819