Maximum likelihood estimation with missing outcomes: From simplicity to complexity

被引:14
作者
Baker, Stuart G. [1 ]
机构
[1] NCI, Canc Prevent Div, 9609 Med Ctr Dr 5E638, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
composite linear model; double sampling; latent class instrumental variable; missing-data mechanism; perfect fit analysis; randomized trial; GROUPED SURVIVAL-DATA; SENSITIVITY-ANALYSIS; REGRESSION-ANALYSIS; RANDOMIZED-TRIALS; IDENTIFICATION; VARIABLES; SUBJECT; MODELS;
D O I
10.1002/sim.8319
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many clinical or prevention studies involve missing or censored outcomes. Maximum likelihood (ML) methods provide a conceptually straightforward approach to estimation when the outcome is partially missing. Methods of implementing ML methods range from the simple to the complex, depending on the type of data and the missing-data mechanism. Simple ML methods for ignorable missing-data mechanisms (when data are missing at random) include complete-case analysis, complete-case analysis with covariate adjustment, survival analysis with covariate adjustment, and analysis via propensity-to-be-missing scores. More complex ML methods for ignorable missing-data mechanisms include the analysis of longitudinal dropouts via a marginal model for continuous data or a conditional model for categorical data. A moderately complex ML method for categorical data with a saturated model and either ignorable or nonignorable missing-data mechanisms is a perfect fit analysis, an algebraic method involving closed-form estimates and variances. A complex and flexible ML method with categorical data and either ignorable or nonignorable missing-data mechanisms is the method of composite linear models, a matrix method requiring specialized software. Except for the method of composite linear models, which can involve challenging matrix specifications, the implementation of these ML methods ranges in difficulty from easy to moderate.
引用
收藏
页码:4453 / 4474
页数:22
相关论文
共 35 条
[1]  
Allison PD, 2012, SAS GLOB FOR 2012
[2]  
Angrist JD, 1996, J AM STAT ASSOC, V91, P444, DOI 10.2307/2291629
[3]  
[Anonymous], J ROY STAT SOC D STA
[4]  
Baker S.G., 1992, Journal of Computational and Graphical Statistics, V1, P63, DOI DOI 10.2307/1390600
[5]   THE PAIRED AVAILABILITY DESIGN - A PROPOSAL FOR EVALUATING EPIDURAL ANALGESIA DURING LABOR [J].
BAKER, SG ;
LINDEMAN, KS .
STATISTICS IN MEDICINE, 1994, 13 (21) :2269-2278
[6]   Simple adjustments for randomized trials with nonrandomly missing or censored outcomes arising from informative covariates [J].
Baker, SG ;
Fitzmaurice, GM ;
Freedman, LS ;
Kramer, BS .
BIOSTATISTICS, 2006, 7 (01) :29-40
[7]   Simplemaximum likelihood estimates of efficacy in randomized trials and before-and-after studies, with implications formeta-analysis [J].
Baker, SG ;
Kramer, BS .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2005, 14 (04) :349-367
[8]   REGRESSION-ANALYSIS FOR CATEGORICAL VARIABLES WITH OUTCOME SUBJECT TO NONIGNORABLE NONRESPONSE [J].
BAKER, SG ;
LAIRD, NM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (401) :62-69
[9]   MARGINAL REGRESSION FOR REPEATED BINARY DATA WITH OUTCOME SUBJECT TO NON-IGNORABLE NONRESPONSE [J].
BAKER, SG .
BIOMETRICS, 1995, 51 (03) :1042-1052
[10]   Analysis of survival data from a randomized trial with all-or-none compliance: Estimating the cost-effectiveness of a cancer screening program [J].
Baker, SG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1998, 93 (443) :929-934