Selection and Misclassification Biases in Longitudinal Studies

被引:27
作者
Haine, Denis [1 ,2 ]
Dohoo, Ian [2 ,3 ]
Dufour, Simon [1 ,2 ]
机构
[1] Univ Montreal, Fac Med Vet, Montreal, PQ, Canada
[2] Canadian Bovine Mastitis & Milk Qual Res Network, St Hyacinthe, PQ, Canada
[3] Univ Prince Edward Isl, Atlantic Vet Coll, Ctr Vet Epidemiol Res, Charlottetown, PE, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
bias (epidemiology); longitudinal study; selection bias; misclassification; epidemiologic methods; NONDIFFERENTIAL MISCLASSIFICATION; DISEASE MISCLASSIFICATION; INTRAMAMMARY INFECTION; POWER CALCULATIONS; RISK-FACTORS; SAMPLE-SIZE; EXPOSURE; SENSITIVITY; PREVALENCE; REGRESSION;
D O I
10.3389/fvets.2018.00099
中图分类号
S85 [动物医学(兽医学)];
学科分类号
0906 ;
摘要
Using imperfect tests may lead to biased estimates of disease frequency and measures of association. Many studies have looked into the effect of misclassification on statistical inferences. These evaluations were either within a cross-sectional study framework, assessing biased prevalence, or for cohort study designs, evaluating biased incidence rate or risk ratio estimates based on misclassification at one of the two time-points (initial assessment or follow-up). However, both observations at risk and incident cases can be wrongly identified in longitudinal studies, leading to selection and misclassification biases, respectively. The objective of this paper was to evaluate the relative impact of selection and misclassification biases resulting from misclassification, together, on measures of incidence and risk ratio. To investigate impact on measure of disease frequency, data sets from a hypothetical cohort study with two samples collected one month apart were simulated and analyzed based on specific test and disease characteristics, with no elimination of disease during the sampling interval or clustering of observations. Direction and magnitude of bias due to selection, misclassification, and total bias was assessed for diagnostic test sensitivity and specificity ranging from 0.7 to 1.0 and 0.8 to 1.0, respectively, and for specific disease contexts, i.e., disease prevalences of 5 and 20%, and disease incidences of 0.01, 0.05, and 0.1 cases/animal-month. A hypothetical exposure with known strength of association was also generated. A total of 1,000 cohort studies of 1,000 observations each were simulated for these six disease contexts where the same diagnostic test was used to identify observations at risk at beginning of the cohort and incident cases at its end. Our results indicated that the departure of the estimates of disease incidence and risk ratio from their true value were mainly a function of test specificity, and disease prevalence and incidence. The combination of the two biases, at baseline and follow-up, revealed the importance of a good to excellent specificity relative to sensitivity for the diagnostic test. Small divergence from perfect specificity extended quickly to disease incidence over-estimation as true prevalence increased and true incidence decreased. A highly sensitive test to exclude diseased subjects at baseline was of less importance to minimize bias than using a highly specific one at baseline. Near perfect diagnostic test attributes were even more important to obtain a measure of association close to the true risk ratio, according to specific disease characteristics, especially its prevalence. Low prevalent and high incident disease lead to minimal bias if disease is diagnosed with high sensitivity and close to perfect specificity at baseline and follow-up. For more prevalent diseases we observed large risk ratio biases towards the null value, even with near perfect diagnosis.
引用
收藏
页数:8
相关论文
共 42 条
[1]  
[Anonymous], 2017, R LANG ENV STAT COMP
[2]  
[Anonymous], 2012, MODERN EPIDEMIOLOGY
[3]   TESTING INDEPENDENCE IN 2-WAY CONTINGENCY TABLES WITH DATA SUBJECT TO MISCLASSIFICATION [J].
ASSAKUL, K ;
PROCTOR, CH .
PSYCHOMETRIKA, 1967, 32 (01) :67-67
[4]   REPORTING ERRORS IN TIME-TO-PREGNANCY DATA COLLECTED WITH A SHORT QUESTIONNAIRE - IMPACT ON POWER AND ESTIMATION OF FECUNDABILITY RATIOS [J].
BAIRD, DD ;
WEINBERG, CR ;
ROWLAND, AS .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1991, 133 (12) :1282-1290
[5]   EFFECTS OF MISCLASSIFICATION ON ESTIMATION OF RELATIVE RISK [J].
BARRON, BA .
BIOMETRICS, 1977, 33 (02) :414-418
[6]   USE OF THE POSITIVE PREDICTIVE VALUE TO CORRECT FOR DISEASE MISCLASSIFICATION IN EPIDEMIOLOGIC STUDIES [J].
BRENNER, H ;
GEFELLER, O .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1993, 138 (11) :1007-1015
[7]   THE EFFECTS OF SENSITIVITY AND SPECIFICITY OF CASE SELECTION ON VALIDITY, SAMPLE-SIZE, PRECISION, AND POWER IN HOSPITAL-BASED CASE-CONTROL STUDIES [J].
BRENNER, H ;
SAVITZ, DA .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1990, 132 (01) :181-192
[8]   MISCLASSIFICATION IN 2 X 2 TABLES [J].
BROSS, I .
BIOMETRICS, 1954, 10 (04) :478-486
[9]   Simulation-based power calculations for large cohort studies [J].
Brown, Patrick ;
Jiang, Hedy .
BIOMETRICAL JOURNAL, 2010, 52 (05) :604-615
[10]   Bayesian estimation of the accuracy of the calf respiratory scoring chart and ultrasonography for the diagnosis of bovine respiratory disease in pre-weaned dairy calves [J].
Buczinski, Sebastien ;
Ollivett, Terri L. ;
Dendukuri, Nandini .
PREVENTIVE VETERINARY MEDICINE, 2015, 119 (3-4) :227-231