Accounting for Misclassified Outcomes in Binary Regression Models Using Multiple Imputation With Internal Validation Data

被引:67
作者
Edwards, Jessie K. [1 ]
Cole, Stephen R. [1 ]
Troester, Melissa A. [1 ]
Richardson, David B. [1 ]
机构
[1] Univ N Carolina, Dept Epidemiol, Gillings Sch Global Publ Hlth, Chapel Hill, NC 27599 USA
基金
美国国家卫生研究院;
关键词
bias(epidemiology); logistic regression; Monte Carlo method; sensitivity and specificity; VIRUS EYE DISEASE; LOGISTIC-REGRESSION; MAXIMUM-LIKELIHOOD; MISSING-DATA; RATIOS; PREVENTION; ACYCLOVIR; RISK;
D O I
10.1093/aje/kws340
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Outcome misclassification is widespread in epidemiology, but methods to account for it are rarely used. We describe the use of multiple imputation to reduce bias when validation data are available for a subgroup of study participants. This approach is illustrated using data from 308 participants in the multicenter Herpetic Eye Disease Study between 1992 and 1998 (48% female; 85% white; median age, 49 years). The odds ratio comparing the acyclovir group with the placebo group on the gold-standard outcome (physician-diagnosed herpes simplex virus recurrence) was 0.62 (95% confidence interval (CI): 0.35, 1.09). We masked ourselves to physician diagnosis except for a 30% validation subgroup used to compare methods. Multiple imputation (odds ratio (OR) = 0.60; 95% CI: 0.24, 1.51) was compared with naive analysis using self-reported outcomes (OR = 0.90; 95% CI: 0.47, 1.73), analysis restricted to the validation subgroup (OR = 0.57; 95% CI: 0.20, 1.59), and direct maximum likelihood (OR = 0.62; 95% CI: 0.26, 1.53). In simulations, multiple imputation and direct maximum likelihood had greater statistical power than did analysis restricted to the validation subgroup, yet all 3 provided unbiased estimates of the odds ratio. The multiple-imputation approach was extended to estimate risk ratios using log-binomial regression. Multiple imputation has advantages regarding flexibility and ease of implementation for epidemiologists familiar with missing data methods.
引用
收藏
页码:904 / 912
页数:9
相关论文
共 30 条
[1]  
ALBERT A, 1984, BIOMETRIKA, V71, P1
[2]  
Allison PD, 2001, Missing Data. Quantitative Applications in the Social Sciences, V136
[3]   USE OF THE PREVALENCE RATIO-UPSILON THE PREVALENCE ODDS RATIO AS A MEASURE OF RISK IN CROSS-SECTIONAL STUDIES [J].
AXELSON, O ;
FREDRIKSSON, M ;
EKBERG, K .
OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 1994, 51 (08) :574-574
[4]  
Beck RW, 2000, ARCH OPHTHALMOL-CHIC, V118, P1030
[5]   MISCLASSIFICATION IN 2 X 2 TABLES [J].
BROSS, I .
BIOMETRICS, 1954, 10 (04) :478-486
[6]   THE INCIDENCE OF MONOTONE LIKELIHOOD IN THE COX MODEL [J].
BRYSON, MC ;
JOHNSON, ME .
TECHNOMETRICS, 1981, 23 (04) :381-383
[7]  
Carroll J., 2006, MEASUREMENT ERROR NO, V2nd edn, DOI [10.1201/9781420010138, DOI 10.1201/9781420010138]
[8]   Estimation of Risk Ratios in Cohort Studies With Common Outcomes A Bayesian Approach [J].
Chu, Haitao ;
Cole, Stephen R. .
EPIDEMIOLOGY, 2010, 21 (06) :855-862
[9]   Analysis of longitudinal binary data from multiphase sampling [J].
Clayton, D ;
Spiegelhalter, D ;
Dunn, G ;
Pickles, A .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1998, 60 :71-87
[10]  
Cohen F, 2000, ARCH OPHTHALMOL-CHIC, V118, P1617