A general regression framework for a secondary outcome in case-control studies

被引:35
作者
Tchetgen, Eric J. Tchetgen [1 ,2 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
关键词
Case-control studies; Generalized linear models; Statistical genetics; Secondary outcomes; LOGISTIC-REGRESSION; COMMON VARIANTS; PHENOTYPE; HEIGHT;
D O I
10.1093/biostatistics/kxt041
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Modern case-control studies typically involve the collection of data on a large number of outcomes, often at considerable logistical and monetary expense. These data are of potentially great value to subsequent researchers, who, although not necessarily concerned with the disease that defined the case series in the original study, may want to use the available information for a regression analysis involving a secondary outcome. Because cases and controls are selected with unequal probability, regression analysis involving a secondary outcome generally must acknowledge the sampling design. In this paper, the author presents a new framework for the analysis of secondary outcomes in case-control studies. The approach is based on a careful re-parameterization of the conditional model for the secondary outcome given the case-control outcome and regression covariates, in terms of (a) the population regression of interest of the secondary outcome given covariates and (b) the population regression of the case-control outcome on covariates. The error distribution for the secondary outcome given covariates and case-control status is otherwise unrestricted. For a continuous outcome, the approach sometimes reduces to extending model (a) by including a residual of (b) as a covariate. However, the framework is general in the sense that models (a) and (b) can take any functional form, and the methodology allows for an identity, log or logit link function for model (a).
引用
收藏
页码:117 / 128
页数:12
相关论文
共 26 条
[1]  
[Anonymous], TECHNICAL REPORT
[2]   On the semi-parametric efficiency of logistic regression under case-control sampling [J].
Breslow, NE ;
Robins, JM ;
Wellner, JA .
BERNOULLI, 2000, 6 (03) :447-455
[3]   Serniparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies [J].
Chatterjee, N ;
Carroll, RJ .
BIOMETRIKA, 2005, 92 (02) :399-418
[4]   Bias correction to secondary trait analysis with casecontrol design [J].
Chen, Hua Yun ;
Kittles, Rick ;
Zhang, Wei .
STATISTICS IN MEDICINE, 2013, 32 (09) :1494-1508
[5]   Unified Analysis of Secondary Traits in Case-Control Association Studies [J].
Ghosh, Arpita ;
Wright, Fred A. ;
Zou, Fei .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (502) :566-576
[6]   Secondary analysis of case-control data [J].
Jiang, YN ;
Scott, AJ ;
Wild, CJ .
STATISTICS IN MEDICINE, 2006, 25 (08) :1323-1339
[7]  
Lee AJ, 1997, STAT MED, V16, P1377, DOI 10.1002/(SICI)1097-0258(19970630)16:12<1377::AID-SIM557>3.0.CO
[8]  
2-K
[9]   Identification of ten loci associated with height highlights new biological pathways in human growth [J].
Lettre, Guillaume ;
Jackson, Anne U. ;
Gieger, Christian ;
Schumacher, Fredrick R. ;
Berndt, Sonja I. ;
Sanna, Serena ;
Eyheramendy, Susana ;
Voight, Benjamin F. ;
Butler, Johannah L. ;
Guiducci, Candace ;
Illig, Thomas ;
Hackett, Rachel ;
Heid, Iris M. ;
Jacobs, Kevin B. ;
Lyssenko, Valeriya ;
Uda, Manuela ;
Boehnke, Michael ;
Chanock, Stephen J. ;
Groop, Leif C. ;
Hu, Frank B. ;
Isomaa, Bo ;
Kraft, Peter ;
Peltonen, Leena ;
Salomaa, Veikko ;
Schlessinger, David ;
Hunter, David J. ;
Hayes, Richard B. ;
Abecasis, Goncalo R. ;
Wichmann, H-Erich ;
Mohlke, Karen L. ;
Hirschhorn, Joel N. .
NATURE GENETICS, 2008, 40 (05) :584-591
[10]   Using Cases to Strengthen Inference on the Association Between Single Nucleotide Polymorphisms and a Secondary Phenotype in Genome-Wide Association Studies [J].
Li, Huilin ;
Gail, Mitchell H. ;
Berndt, Sonja ;
Chatterjee, Nilanjan .
GENETIC EPIDEMIOLOGY, 2010, 34 (05) :427-433