Conditional validation sampling for consistent risk estimation with binary outcome data subject to misclassification

被引:4
作者
Gravel, Christopher A. [1 ,2 ,3 ,4 ,5 ]
Farrell, Patrick J. [1 ,2 ]
Krewski, Daniel [1 ,2 ,3 ,5 ]
机构
[1] Carleton Univ, Sch Math & Stat, Ottawa, ON, Canada
[2] Univ Ottawa, McLaughlin Ctr Populat Hlth Risk Assessment, Ottawa, ON, Canada
[3] Univ Ottawa, Sch Epidemiol & Publ Hlth, Ottawa, ON, Canada
[4] McGill Univ, Dept Epidemiol Biostat & Occupat Hlth, Montreal, PQ, Canada
[5] Risk Sci Int, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
contingency tables; misclassification bias; misclassified binary data; pharmacoepidemiology; validation sampling;
D O I
10.1002/pds.4701
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose Misclassification of a binary outcome can introduce bias in estimation of the odds-ratio associated with an exposure of interest in pharmacoepidemiology research. It has been previously demonstrated that utilizing information from an internal randomly selected validation sample can help mitigate this bias. Methods Using a Monte Carlo simulation-based approach, we study the properties of misclassification bias-adjusted odds-ratio estimators in a contingency table setting. We consider two methods of internal validation sampling; namely, simple random sampling and sampling conditional on the original (possibly incorrect) outcome status. Additional simulation studies are conducted to investigate these sampling approaches in a multi-table setting. Results We demonstrate that conditional validation sampling, across a range of subsampling fractions, can produce better estimates than those based on an unconditional simple random sample. This approach allows for greater flexibility in the chosen categorical composition of the validation data, as well as the potential for obtaining a more efficient estimator of the odds-ratio. We further demonstrate that this relationship holds for the Mantel-Haenszel misclassification bias-adjusted odds-ratio in stratified samples. Recommendations for the choice of validation subsampling fraction are also provided. Conclusions Careful consideration when choosing the sampling scheme used to draw internal validation samples can improve the properties of the outcome misclassification bias-adjusted odds-ratio estimator in a (multiple) contingency table.
引用
收藏
页码:227 / 233
页数:7
相关论文
共 10 条
[1]  
Agresti A., 2002, Categorical data analysis
[2]   Accuracy of ICD-9-CM codes in identifying infections of pneumonia and herpes simplex virus in administrative data [J].
Drahos, Jennifer ;
Vanwormer, Jeffrey J. ;
Greenlee, Robert T. ;
Landgren, Ola ;
Koshiol, Jill .
ANNALS OF EPIDEMIOLOGY, 2013, 23 (05) :291-293
[3]  
Gravel CA, 2015, THESIS
[4]   Maximum-likelihood and closed-form estimators of epidemiologic measures under misclassification [J].
Greenland, Sander .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2008, 138 (02) :528-538
[5]   Good practices for quantitative bias analysis [J].
Lash, Timothy L. ;
Fox, Matthew P. ;
MacLehose, Richard F. ;
Maldonado, George ;
McCandless, Lawrence C. ;
Greenland, Sander .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2014, 43 (06) :1969-1985
[6]   Use of electronic health record data to identify skin and soft tissue infections in primary care settings: a validation study [J].
Levine, Pamela J. ;
Elman, Miriam R. ;
Kullar, Ravina ;
Townes, John M. ;
Bearden, David T. ;
Vilches-Tran, Rowena ;
McClellan, Ian ;
McGregor, Jessina C. .
BMC INFECTIOUS DISEASES, 2013, 13
[7]   A note on estimating crude odds ratios in case-control studies with differentially misclassified exposure [J].
Lyles, RH .
BIOMETRICS, 2002, 58 (04) :1034-1036
[8]  
MANTEL N, 1959, JNCI-J NATL CANCER I, V22, P719
[9]   ESTIMATORS OF THE MANTEL-HAENSZEL VARIANCE CONSISTENT IN BOTH SPARSE DATA AND LARGE-STRATA LIMITING MODELS [J].
ROBINS, J ;
BRESLOW, N ;
GREENLAND, S .
BIOMETRICS, 1986, 42 (02) :311-323
[10]  
Tang Li, 2013, Epidemiol Methods, V2, P49