Modelling bias in combining small area prevalence estimates from multiple surveys

被引:16
作者
Manzi, Giancarlo [3 ]
Spiegelhalter, David J.
Turner, Rebecca M.
Flowers, Julian [2 ]
Thompson, Simon G. [1 ]
机构
[1] MRC, Biostat Unit, Inst Publ Hlth, Cambridge CB2 0SR, England
[2] Eastern Reg Publ Hlth Observ, Cambridge, England
[3] Univ Milan, I-20122 Milan, Italy
基金
英国医学研究理事会;
关键词
Bias modelling; Hierarchical models; Meta-analysis; Mixed effects models; Multiple survey data; Small area estimation; Smoking prevalence; FRAME SURVEYS; RISK-FACTORS; INFORMATION; HEALTH;
D O I
10.1111/j.1467-985X.2010.00648.x
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Combining information from multiple surveys can improve the quality of small area estimates. Customary approaches, such as themultiple-frame and statistical matching methods, require individual level data, whereas in practice often only multiple aggregate estimates are available. Commercial surveys usually produce such estimates without clear description of the methodology that is used. In this context, bias modelling is crucial, and we propose a series of Bayesian hierarchical models which allow for additive biases. Some of these models can also be fitted in a classical context, by using a mixed effects framework. We apply these methods to obtain estimates of smoking prevalence in local authorities across the east of England from seven surveys. All the surveys provide smoking prevalence estimates and confidence intervals at the local authority level, but they vary by time, sample size and transparency of methodology. Our models adjust for the biases in commercial surveys but incorporate information from all the sources to provide more accurate and precise estimates.
引用
收藏
页码:31 / 50
页数:20
相关论文
共 34 条
[1]  
[Anonymous], 2011, Data analysis using regression and multilevel/hierarchical models
[2]  
Bajekal M., 2004, Synthetic estimation of healthy lifestyle indicators: Stage 1 report
[3]  
Bates D., 2009, Mixed-Effects Models in S and S-PLUS
[4]   Bayesian non-parametric models for regional prevalence estimation [J].
Branscum, Adam J. ;
Hanson, Timothy E. ;
Gardner, Ian A. .
JOURNAL OF APPLIED STATISTICS, 2008, 35 (05) :567-582
[5]  
*E REG PUBL HLTH O, 2007, SMOK E ENGL
[6]  
*E REG PUBL HLTH O, 2007, SIMPL METH EST SMOK
[7]  
Eberly LE, 2000, STAT MED, V19, P2279, DOI 10.1002/1097-0258(20000915/30)19:17/18<2279::AID-SIM569>3.0.CO
[8]  
2-R
[9]   A Bayesian approach to combining information from a census, a coverage measurement survey, and demographic analysis [J].
Elliott, MR ;
Little, RJA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (450) :351-362
[10]   Obtaining cancer risk factor prevalence estimates in small areas: combining data from two surveys [J].
Elliott, MR ;
Davis, WW .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 :595-609