Efforts to adjust for confounding by neighborhood using complex survey data

被引：12

作者：

Brumback, Babette A. ^{[1
]}

Dailey, Amy B. ^{[1
]}

He, Zhulin ^{[1
]}

Brumback, Lyndia C. ^{[2
]}

Livingston, Melvin D. ^{[1
]}

机构：

[1] Univ Florida, Dept Epidemiol & Biostat, Coll Publ Hlth & Hlth Profess, Gainesville, FL 32611 USA

[2] Univ Washington, Sch Publ Hlth, Dept Biostat, Seattle, WA 98195 USA

来源：

STATISTICS IN MEDICINE | 2010年 / 29卷 / 18期

关键词：

confounding; conditional logistic regression; generalized linear mixed models: misspecified mixing distribution; complex survey data: causal inference; LONGITUDINAL DATA; CLUSTERED DATA; MODELS; SELECTION;

D O I：

10.1002/sim.3946

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

In social epidemiology, one often considers neighborhood or contextual effects on health outcomes, in addition to effects of individual exposures. This paper is concerned with the estimation of an individual exposure effect in the presence of confounding by neighborhood effects, motivated by an analysis of National health Interview Survey (NHIS) data. In the analysis, we operationalize neighborhood as the secondary sampling unit of the survey, which consists of small groups of neighboring census blocks. Thus the neighborhoods are sampled with unequal probabilities, as are individuals within neighborhoods. We develop and compare several approaches for the analysis of the effect of dichotomized individual-level education on the receipt of adequate mammography screening. In the analysis, neighborhood effects are likely to confound the individual effects, due to such factors as differential availability of health services and differential neighborhood culture. The approaches can be grouped into three broad classes: ordinary logistic regression for survey data, with either no effect or a fixed effect for each cluster; conditional logistic regression extended for survey data; and generalized linear mixed model (GLMM) regression for survey data. Standard use of GLMMs with small clusters fails to adjust for confounding by cluster (e.g. neighborhood); this motivated us to develop an adaptation. We use theory, simulation, and analyses of the NIIIS data to compare and contrast all of these methods. One conclusion is that all of the methods perform poorly when the sampling bias is strong; more research and new methods are clearly needed. Copyright (C) 2010 John Wiley & Sons, Ltd.

引用

页码：1890 / 1899

页数：10

共 21 条

[1]

Agresti A, 2013, Categorical data analysis, V3rd

[2]

[Anonymous], 1999, Analysis of Health Surveys

[3]

[Anonymous], 1989, Analysis of Complex Surveys

[4] Separation of individual-level and cluster-level covariate effects in regression analysis of correlated data [J].

Begg, MD ;

Parides, MK .

STATISTICS IN MEDICINE, 2003, 22 (16) :2591-2602

[5] An empirical comparison of several clustered data approaches under confounding due to cluster effects in the analysis of complications of coronary angioplasty [J].

Berlin, JA ;

Kimmel, SE ;

Ten Have, TR ;

Sammel, MD .

BIOMETRICS, 1999, 55 (02) :470-476

[6]

BOTMAN SL, 2000, NATL CTR HLTH STAT V, V2, P1

[7] Conditional generalized estimating equations for the analysis of clustered and longitudinal data [J].

Goetgeluk, Sylvie ;

Vansteelandt, Stijn .

BIOMETRICS, 2008, 64 (03) :772-780

[8] A review of multilevel theory for ecologic analyses [J].

Greenland, S .

STATISTICS IN MEDICINE, 2002, 21 (03) :389-395

[9] Invited commentary: Variable selection versus shrinkage in the control of multiple confounders [J].

Greenland, Sander .

AMERICAN JOURNAL OF EPIDEMIOLOGY, 2008, 167 (05) :523-529

[10]

Grilli L., 2009, Measurement error in multilevel models with sample cluster means

← 1 2 3 →