Bias due to differential participation in case-control studies and review of available approaches for adjustment

被引:18
作者
Aigner, Annette [1 ]
Grittner, Ulrike [2 ,3 ]
Becher, Heiko [1 ]
机构
[1] Univ Med Ctr Hamburg Eppendorf, Inst Med Biometry & Epidemiol, Hamburg, Germany
[2] Charite Univ Med Berlin, Ctr Stroke Res, Berlin, Germany
[3] Charite Univ Med Berlin, Dept Biostat & Clin Epidemiol, Berlin, Germany
关键词
LOGISTIC-REGRESSION; 2-STAGE; STROKE; RATES;
D O I
10.1371/journal.pone.0191327
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objectives Low response rates in epidemiologic research potentially lead to the recruitment of a nonrepresentative sample of controls in case-control studies. Problems in the unbiased estimation of odds ratios arise when characteristics causing the probability of participation are associated with exposure and outcome. This is a specific setting of selection bias and a realistic hazard in many case-control studies. This paper formally describes the problem and shows its potential extent, reviews existing approaches for bias adjustment applicable under certain conditions, compares and applies them. Methods We focus on two scenarios: a characteristic C causing differential participation of controls is linked to the outcome through its association with risk factor E (scenario I), and C is additionally a genuine risk factor itself (scenario II). We further assume external data sources are available which provide an unbiased estimate of C in the underlying population. Given these scenarios, we (i) review available approaches and their performance in the setting of bias due to differential participation; (ii) describe two existing approaches to correct for the bias in both scenarios in more detail; (iii) present the magnitude of the resulting bias by simulation if the selection of a non-representative sample is ignored; and (iv) demonstrate the approaches' application via data from a case-control study on stroke. Findings The bias of the effect measure for variable E in scenario I and C in scenario II can be large and should therefore be adjusted for in any analysis. It is positively associated with the difference in response rates between groups of the characteristic causing differential participation, and inversely associated with the total response rate in the controls. Adjustment in a standard logistic regression framework is possible in both scenarios if the population distribution of the characteristic causing differential participation is known or can be approximated well.
引用
收藏
页数:14
相关论文
共 26 条
[1]  
[Anonymous], 2016, R LANG ENV STAT COMP
[2]  
[Anonymous], 2011, APPL QUANTITATIVE BI
[3]   Sex Differences in Stroke Epidemiology A Systematic Review [J].
Appelros, Peter ;
Stegmayr, Birgitta ;
Terent, Andreas .
STROKE, 2009, 40 (04) :1082-1090
[4]   Socioeconomic Conditions in Childhood, Adolescence, and Adulthood and the Risk of Ischemic Stroke [J].
Becher, Heiko ;
Palm, Frederick ;
Aigner, Annette ;
Safer, Anton ;
Urbanek, Christian ;
Buggle, Florian ;
Grond-Ginsbach, Caspar ;
Grau, Armin J. .
STROKE, 2016, 47 (01) :173-179
[5]   ESTIMATION OF MULTIPLE RELATIVE RISK FUNCTIONS IN MATCHED CASE-CONTROL STUDIES [J].
BRESLOW, NE ;
DAY, NE ;
HALVORSEN, KT ;
PRENTICE, RL ;
SABAI, C .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1978, 108 (04) :299-307
[6]  
BRESLOW NE, 1988, BIOMETRIKA, V75, P11
[7]   LOGISTIC-REGRESSION ANALYSIS AND EFFICIENT DESIGN FOR 2-STAGE STUDIES [J].
CAIN, KC ;
BRESLOW, NE .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1988, 128 (06) :1198-1206
[8]  
Carroll RJ, 1988, Transformation and weighting in regressionNew
[9]   SOME METHODS FOR STRENGTHENING THE COMMON X2 TESTS [J].
COCHRAN, WG .
BIOMETRICS, 1954, 10 (04) :417-451
[10]   Socioeconomic status and stroke [J].
Cox, AM ;
McKevitt, C ;
Rudd, AG ;
Wolfe, CDA .
LANCET NEUROLOGY, 2006, 5 (02) :181-188