Fitting logistic regression models with contaminated case-control data

被引:1
作者
Cheng, K. F. [1 ]
Chen, L. C.
机构
[1] Natl Cent Univ, Grad Inst Stat, Chungli, Taiwan
[2] Tamkang Univ, Dept Stat, Taipei, Taiwan
关键词
case-control data; contamination; logistic regression; maximum likelihood; misclassification;
D O I
10.1016/j.jspi.2005.07.009
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Errors in measurement frequently occur in observing responses. If case-control data are based on certain reported responses, which may not be the true responses, then we have contaminated case-control data. In this paper, we first show that the ordinary logistic regression analysis based on contaminated case-control data can lead to very serious biased conclusions. This can be concluded from the results of a theoretical argument, one example, and two simulation studies. We next derive the semiparametric maximum likelihood estimate (MLE) of the risk parameter of a logistic regression model when there is a validation subsample. The asymptotic normality of the semiparametric MLE will be shown along with consistent estimate of asymptotic variance. Our example and two simulation studies show these estimates to have reasonable performance under finite sample situations. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:4147 / 4160
页数:14
相关论文
共 11 条
[1]   REGRESSION-ANALYSIS WHEN DEPENDENT VARIABLE IS TRUNCATED NORMAL [J].
AMEMIYA, T .
ECONOMETRICA, 1973, 41 (06) :997-1016
[2]   Statistics in epidemiology: The case-control study [J].
Breslow, NE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (433) :14-28
[3]   CASE-CONTROL STUDIES WITH ERRORS IN COVARIATES [J].
CARROLL, RJ ;
GAIL, MH ;
LUBIN, JH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :185-199
[4]  
GLOVSKY L, 1964, TRAINING SCH B, V61, P76
[5]   Design of validation studies for estimating the odds ratio of exposure-disease relationships when exposure is misclassified [J].
Holcroft, CA ;
Spiegelman, D .
BIOMETRICS, 1999, 55 (04) :1193-1201
[6]   Case-control studies with contaminated controls [J].
Lancaster, T ;
Imbens, G .
JOURNAL OF ECONOMETRICS, 1996, 71 (1-2) :145-160
[7]  
Paulino CD, 2003, BIOMETRICS, V59, P670
[8]   LOGISTIC DISEASE INCIDENCE MODELS AND CASE-CONTROL STUDIES [J].
PRENTICE, RL ;
PYKE, R .
BIOMETRIKA, 1979, 66 (03) :403-411
[9]   Case-control study of cancer among US army veterans exposed to simian virus 40-contaminated adenovirus vaccine [J].
Rollison, DEM ;
Page, WF ;
Crawford, H ;
Gridley, G ;
Wacholder, S ;
Martin, J ;
Miller, R ;
Engels, EA .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2004, 160 (04) :317-324
[10]   Maximum likelihood for generalised case-control studies [J].
Scott, AJ ;
Wild, CJ .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2001, 96 (01) :3-27