Inverse probability weighting and doubly robust methods in correcting the effects of non-response in the reimbursed medication and self-reported turnout estimates in the ATH survey

被引:62
作者
Harkanen, Tommi [1 ]
Kaikkonen, Risto [1 ]
Virtala, Esa [1 ]
Koskinen, Seppo [1 ]
机构
[1] Funct Capac & Welf Natl Inst Hlth & Welf THL, Dept Hlth, FI-00271 Helsinki, Finland
基金
芬兰科学院;
关键词
Missing data; Population survey; Register data; Model selection; Inverse probability weighting; Doubly robust methods; HEALTH SURVEYS; VOTER TURNOUT; MISSING DATA; FOLLOW-UP; REGISTER; PERFORMANCE; INFERENCE; VALIDITY; RATES;
D O I
10.1186/1471-2458-14-1150
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: To assess the nonresponse rates in a questionnaire survey with respect to administrative register data, and to correct the bias statistically. Methods: The Finnish Regional Health and Well-being Study (ATH) in 2010 was based on a national sample and several regional samples. Missing data analysis was based on socio-demographic register data covering the whole sample. Inverse probability weighting (IPW) and doubly robust (DR) methods were estimated using the logistic regression model, which was selected using the Bayesian information criteria. The crude, weighted and true self-reported turnout in the 2008 municipal election and prevalences of entitlements to specially reimbursed medication, and the crude and weighted body mass index (BMI) means were compared. Results: The IPW method appeared to remove a relatively large proportion of the bias compared to the crude prevalence estimates of the turnout and the entitlements to specially reimbursed medication. Several demographic factors were shown to be associated with missing data, but few interactions were found. Conclusions: Our results suggest that the IPW method can improve the accuracy of results of a population survey, and the model selection provides insight into the structure of missing data. However, health-related missing data mechanisms are beyond the scope of statistical methods, which mainly rely on socio-demographic information to correct the results.
引用
收藏
页数:10
相关论文
共 38 条
[1]  
[Anonymous], 2008, P IAOS C RESH OFF ST
[2]  
[Anonymous], 2011, R LANG ENV STAT COMP
[3]   The performance of different propensity-score methods for estimating relative risks [J].
Austin, Peter C. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2008, 61 (06) :537-545
[4]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[5]   Variable selection for propensity score models [J].
Brookhart, M. Alan ;
Schneeweiss, Sebastian ;
Rothman, Kenneth J. ;
Glynn, Robert J. ;
Avorn, Jerry ;
Sturmer, Til .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) :1149-1156
[6]  
de Heer W., 1999, J OFF STAT, V15, P129
[7]   Nonresponse in a community cohort study - Predictors and consequences for exposure-disease associations [J].
Eagan, TML ;
Eide, GE ;
Gulsvik, A ;
Bakke, PS .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2002, 55 (08) :775-781
[8]  
Finland S, 2004, USE REGISTERS ADM DA
[9]   Participation rates in epidemiologic studies [J].
Galea, Sandro ;
Tracy, Melissa .
ANNALS OF EPIDEMIOLOGY, 2007, 17 (09) :643-653
[10]   SELF-REPORTED TURNOUT AND VOTER VALIDATION [J].
GRANBERG, D ;
HOLMBERG, S .
AMERICAN JOURNAL OF POLITICAL SCIENCE, 1991, 35 (02) :448-459