Bias and efficiency loss due to misclassified responses in binary regression

被引:137
|
作者
Neuhaus, JM [1 ]
机构
[1] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA
基金
美国国家卫生研究院;
关键词
asymptotic relative efficiency; attenuation; generalised linear model; misspecified link function; misspecified model;
D O I
10.1093/biomet/86.4.843
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Methods that ignore errors in binary responses yield biased estimators of the associations of covariates with response. This paper derives general expressions for the magnitude of the bias due to errors in the response and shows that, unless both the sensitivity and specificity are very high, ignoring errors in the responses will yield highly biased covariate effect estimators. When the true, error-free response follows a generalised linear model and misclassification probabilities are known and independent of covariate values, responses observed with error also follow such a model with a modified link function. We describe a simple method to obtain consistent estimators of covariate effects and associated errors in this case, and derive an expression for the asymptotic relative efficiency of covariate effect estimators from the correct likelihood for the responses with errors with respect to estimates based on the true, error-free responses. This expression shows that errors in the response can lead to substantial losses of information about covariate effects. Data from a study on infection with human papilloma virus among women and simulation studies motivate this work and illustrate the findings.
引用
收藏
页码:843 / 855
页数:13
相关论文
共 50 条
  • [1] NONPARAMETRIC REGRESSION ESTIMATES USING MISCLASSIFIED BINARY RESPONSES
    CHU, CK
    CHENG, KF
    BIOMETRIKA, 1995, 82 (02) : 315 - 325
  • [2] Inference on regression model with misclassified binary response
    Chatterjee, Arindam
    Bandyopadhyay, Tathagata
    Bhattacharya, Ayoushman
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2024, 231
  • [3] Bias and efficiency loss in regression estimates due to duplicated observations: a Monte Carlo simulation
    Sarracino, Francesco
    Mikucka, Malgorzata
    SURVEY RESEARCH METHODS, 2017, 11 (01): : 17 - 44
  • [4] Measurement error model for misclassified binary responses
    Roy, S
    Banerjee, T
    Maiti, T
    STATISTICS IN MEDICINE, 2005, 24 (02) : 269 - 283
  • [5] IDENTIFICATION OF REGRESSION MODELS WITH A MISCLASSIFIED AND ENDOGENOUS BINARY REGRESSOR
    Kasahara, Hiroyuki
    Shimotsu, Katsumi
    ECONOMETRIC THEORY, 2022, 38 (06) : 1117 - 1139
  • [6] Regression analysis for differentially misclassified correlated binary outcomes
    Tang, Li
    Lyles, Robert H.
    King, Caroline C.
    Hogan, Joseph W.
    Lo, Yungtai
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2015, 64 (03) : 433 - 449
  • [7] Binary regression with differentially misclassified response and exposure variables
    Tang, Li
    Lyles, Robert H.
    King, Caroline C.
    Celentano, David D.
    Lo, Yungtai
    STATISTICS IN MEDICINE, 2015, 34 (09) : 1605 - 1620
  • [8] Bias and efficiency loss due to categorizing an explanatory variable
    Taylor, JMG
    Yu, MG
    JOURNAL OF MULTIVARIATE ANALYSIS, 2002, 83 (01) : 248 - 263
  • [9] Marginal methods for correlated binary data with misclassified responses
    Chen, Zhijian
    Yi, Grace Y.
    Wu, Changbao
    BIOMETRIKA, 2011, 98 (03) : 647 - 662
  • [10] Reducing bias due to misclassified exposures using instrumental variables
    Manuel, Christopher
    Sinha, Samiran
    Wang, Suojin
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2023, 51 (02): : 503 - 530