A bootstrap method to avoid the effect of concurvity in generalised additive models in time series studies of air pollution

被引:18
作者
Figueiras, A
Roca-Pardiñas, J
Cadarso-Suárez, C
机构
[1] Univ Santiago de Compostela, Biostat Unit, Dept Stat & Operat Res, Santiago De Compostela, Spain
[2] Univ Vigo, Dept Stat & Operat Res, Vigo, Spain
[3] Univ Santiago de Compostela, Dept Prevent Med, Santiago De Compostela, Spain
关键词
D O I
10.1136/jech.2004.026740
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: In recent years a great number of studies have applied generalised additive models (GAMs) to time series data to estimate the short term health effects of air pollution. Lately, however, it has been found that concurvity-the non-parametric analogue of multicollinearity-might lead to underestimation of standard errors of the effects of independent variables. Underestimation of standard errors means that for concurvity levels commonly present in the data, the risk of committing type I error rises by over threefold. Methods: This study developed a conditional bootstrap methology that consists of assuming that the outcome in any observation is conditional upon the values of the set of independent variables used. It then tested this procedure by means of a simulation study using a Poisson additive model. The response variable of this model is a function of an unobserved confounding variable (that introduces trend and seasonality), real black smoke data, and temperature. Scenarios were created with different coefficients and degrees of concurvity. Results: Conditional bootstrap provides confidence intervals with coverages close to nominal (95%), irrespective of the degree of concurvity, number of variables in the model or magnitude of the coefficient to be estimated (for example, for a concurvity of 0.85, bootstrap confidence interval coverage is 95% compared with 71% in the case of the asymptotic interval obtained directly with S-plus gam function). Conclusions: The bootstrap method avoids the problem of concurvity in time series studies of air pollution, and is easily generalised to non-linear dose-risk effects. All bootstrap calculations described in this paper can be performed using S-Plus gam.boot software.
引用
收藏
页码:881 / 884
页数:4
相关论文
共 21 条
[1]  
[Anonymous], 1993, INTRO BOOTSTRAP
[2]   Control for seasonal variation and time trend in case crossover studies of acute effects of environmental exposures [J].
Bateson, TF ;
Schwartz, J .
EPIDEMIOLOGY, 1999, 10 (05) :539-544
[3]   Underestimation of standard errors in multi-site time series studies [J].
Daniels, MJ ;
Dominici, F ;
Zeger, S .
EPIDEMIOLOGY, 2004, 15 (01) :57-62
[4]   On the use of generalized additive models in time-series studies of air pollution and health [J].
Dominici, F ;
McDermott, A ;
Zeger, SL ;
Samet, JM .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2002, 156 (03) :193-203
[5]  
DOMINICI F, SOFTWARE COMPUTING A
[6]  
DOMINICI F, 2003, ISSUES SEMIPARAMETRI
[7]   Flexible smoothing with B-splines and penalties [J].
Eilers, PHC ;
Marx, BD .
STATISTICAL SCIENCE, 1996, 11 (02) :89-102
[8]   Analysis of case-crossover designs using longitudinal approaches -: A simulotion study [J].
Figueiras, A ;
Carracedo-Martínez, E ;
Saez, M ;
Taracido, M .
EPIDEMIOLOGY, 2005, 16 (02) :239-246
[9]   COMPARING NONPARAMETRIC VERSUS PARAMETRIC REGRESSION FITS [J].
HARDLE, W ;
MAMMEN, E .
ANNALS OF STATISTICS, 1993, 21 (04) :1926-1947
[10]  
Hastie T., 1990, Generalized additive model