The performance of multiple imputation for missing covariate data within the context of regression relative survival analysis

被引:29
作者
Giorgi, Roch [1 ]
Belot, Aurelien [2 ,3 ]
Gaudart, Jean [1 ]
Launoy, Guy [4 ]
机构
[1] Univ Mediterranee, Fac Med, Lab Enseignement & Rech Traitement Informat Med, EA 3283, F-13385 Marseille, France
[2] Univ Lyon 1, CNRS,Lab Biostat Sante, Serv Biostat, Hosp Civils Lyon,UMR 5558, Pierre Benite, France
[3] Inst Veille Sanitaire, Dept Malad Chron & Traumatismes, St Maurice, France
[4] FRANCIM, INSERM Canc & Populat, ER13, Caen, France
[5] Fac Med Toulouse, Head Off Reseau FRANCIM, Toulouse, France
关键词
missing data; multiple imputation; proportional hazards model; relative survival; colon cancer;
D O I
10.1002/sim.3476
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Relative survival assesses the effects of prognostic factors on disease-specific mortality when the cause of death is uncertain or unavailable. It provides an estimate of patients' survival, allowing for the effects of other independent causes of death. Regress ion-based relative survival models are commonly used in population-based studies to model the effects of some prognostic factors and to estimate net survival. Most often, studies focus on routinely collected prognostic factors for which the proportion of missing values is usually low (around 5 per cent). However, in some cases, additional factors are collected with a greater proportion of missingness. In the present article, we systematically assess the performance of multiple imputation in regression analysis of relative survival through a series of simulation experiments. According to the assumptions concerning the missingness mechanism (completely at random, at random, and not at random) and the missingness pattern (monotone, non-monotone), several strategies were considered and compared: all cases analysis, complete cases analysis, missing data indicator analysis, and multiple imputation by chained equations (MICE) analysis. We showed that MICE performs well in estimating the hazard ratios and the baseline hazard function when the missing mechanism is missing at random (MAR) conditionally on the vital status. In the situations where the missing mechanism was not MAR conditionally on vital status, complete case behaves consistently. As illustration, we used data of the French Cancer Registries on relative survival of patients with colorectal cancer. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:6310 / 6331
页数:22
相关论文
共 48 条
  • [1] AITKIN I, STAT US GROUP 1 AUST
  • [2] Allison PD, 2010, HANDBOOK OF SURVEY RESEARCH, 2ND EDITION, P631
  • [3] A comparison of imputation techniques for handling missing predictor values in a risk model with a binary outcome
    Ambler, Gareth
    Omar, Rumana Z.
    Royston, Patrick
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (03) : 277 - 298
  • [4] [Anonymous], MULTIPLE IMPUTATION
  • [5] Multiple imputation of baseline data in the cardiovascular health study
    Arnold, AM
    Kronmal, RA
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2003, 157 (01) : 74 - 84
  • [6] BERKSON J, 1950, P STAFF M MAYO CLIN, V25, P270
  • [7] Bolard P, 2002, J Cancer Epidemiol Prev, V7, P113
  • [8] Modelling time-dependent hazard ratios in relative survival: Application to colon cancer
    Bolard, P
    Quantin, C
    Esteve, J
    Faivre, J
    Abrahamowicz, M
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2001, 54 (10) : 986 - 996
  • [9] The design of simulation studies in medical statistics
    Burton, Andrea
    Altman, Douglas G.
    Royston, Patrick
    Holder, Roger L.
    [J]. STATISTICS IN MEDICINE, 2006, 25 (24) : 4279 - 4292
  • [10] A prognostic model for ovarian cancer
    Clark, TG
    Stewart, ME
    Altman, DG
    Gabra, H
    Smyth, JF
    [J]. BRITISH JOURNAL OF CANCER, 2001, 85 (07) : 944 - 952