Secondary outcome analysis for data from an outcome-dependent sampling design

被引:4
作者
Pan, Yinghao [1 ]
Cai, Jianwen [1 ]
Longnecker, Matthew P. [2 ]
Zhou, Haibo [1 ]
机构
[1] Univ North Carolina Chapel Hill, Dept Biostat, Chapel Hill, NC 27599 USA
[2] NIEHS, Epidemiol Branch, Res Triangle Pk, NC 27709 USA
关键词
biased sampling; estimating equation; missing data; secondary analysis; semiparametric estimation; validation sample; EMPIRICAL LIKELIHOOD METHOD; CASE-COHORT; POLYCHLORINATED-BIPHENYLS; ESTIMATING EQUATIONS; GESTATIONAL-AGE; BIRTH-WEIGHT; REGRESSION; MODELS;
D O I
10.1002/sim.7672
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Outcome-dependent sampling (ODS) scheme is a cost-effective way to conduct a study. For a study with continuous primary outcome, an ODS scheme can be implemented where the expensive exposure is only measured on a simple random sample and supplemental samples selected from 2 tails of the primary outcome variable. With the tremendous cost invested in collecting the primary exposure information, investigators often would like to use the available data to study the relationship between a secondary outcome and the obtained exposure variable. This is referred as secondary analysis. Secondary analysis in ODS designs can be tricky, as the ODS sample is not a random sample from the general population. In this article, we use the inverse probability weighted and augmented inverse probability weighted estimating equations to analyze the secondary outcome for data obtained from the ODS design. We do not make any parametric assumptions on the primary and secondary outcome and only specify the form of the regression mean models, thus allow an arbitrary error distribution. Our approach is robust to second- and higher-order moment misspecification. It also leads to more precise estimates of the parameters by effectively using all the available participants. Through simulation studies, we show that the proposed estimator is consistent and asymptotically normal. Data from the Collaborative Perinatal Project are analyzed to illustrate our method.
引用
收藏
页码:2321 / 2337
页数:17
相关论文
共 35 条
[1]  
CORNFIELD J, 1951, J NATL CANCER I, V11, P1269
[2]   FISH CONSUMPTION AND REPRODUCTIVE OUTCOMES IN GREEN-BAY, WISCONSIN [J].
DAR, E ;
KANAREK, MS ;
ANDERSON, HA ;
SONZOGNI, WC .
ENVIRONMENTAL RESEARCH, 1992, 59 (01) :189-201
[3]   PRENATAL EXPOSURE TO POLYCHLORINATED-BIPHENYLS - EFFECTS ON BIRTH SIZE AND GESTATIONAL-AGE [J].
FEIN, GG ;
JACOBSON, JL ;
JACOBSON, SW ;
SCHWARTZ, PM ;
DOWLER, JK .
JOURNAL OF PEDIATRICS, 1984, 105 (02) :315-320
[4]   UNIQUE CONSISTENT SOLUTION TO LIKELIHOOD EQUATIONS [J].
FOUTZ, RV .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1977, 72 (357) :147-148
[5]   Birthweight in a fishing community: Significance of essential fatty acids and marine food contaminants [J].
Grandjean, P ;
Bjerve, KS ;
Weihe, P ;
Steuerwald, U .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2001, 30 (06) :1272-1278
[6]   A GENERALIZATION OF SAMPLING WITHOUT REPLACEMENT FROM A FINITE UNIVERSE [J].
HORVITZ, DG ;
THOMPSON, DJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1952, 47 (260) :663-685
[7]   Secondary analysis of case-control data [J].
Jiang, YN ;
Scott, AJ ;
Wild, CJ .
STATISTICS IN MEDICINE, 2006, 25 (08) :1323-1339
[8]  
Karmaus Wilfried, 2004, Environ Health, V3, P1, DOI 10.1186/1476-069X-3-1
[9]   Additive hazards regression for case-cohort studies [J].
Kulich, M ;
Lin, DY .
BIOMETRIKA, 2000, 87 (01) :73-87
[10]  
Lee AJ, 1997, STAT MED, V16, P1377, DOI 10.1002/(SICI)1097-0258(19970630)16:12<1377::AID-SIM557>3.0.CO