Raking and regression calibration: Methods to address bias from correlated covariate and time-to-event error

被引:12
|
作者
Oh, Eric J. [1 ]
Shepherd, Bryan E. [2 ]
Lumley, Thomas [3 ]
Shaw, Pamela A. [1 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
[2] Vanderbilt Univ, Dept Biostat, 221 Kirkland Hall, Nashville, TN 37235 USA
[3] Univ Auckland, Dept Stat, Auckland, New Zealand
基金
美国国家卫生研究院;
关键词
calibration; electronic health records; measurement error; misclassification; raking; survival analysis; PROPORTIONAL HAZARDS MODELS; 2-PHASE STRATIFIED SAMPLES; SEMIPARAMETRIC MODELS; WEIGHTED LIKELIHOOD; ESTIMATORS;
D O I
10.1002/sim.8793
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Medical studies that depend on electronic health records (EHR) data are often subject to measurement error, as the data are not collected to support research questions under study. These data errors, if not accounted for in study analyses, can obscure or cause spurious associations between patient exposures and disease risk. Methodology to address covariate measurement error has been well developed; however, time-to-event error has also been shown to cause significant bias, but methods to address it are relatively underdeveloped. More generally, it is possible to observe errors in both the covariate and the time-to-event outcome that are correlated. We propose regression calibration (RC) estimators to simultaneously address correlated error in the covariates and the censored event time. Although RC can perform well in many settings with covariate measurement error, it is biased for nonlinear regression models, such as the Cox model. Thus, we additionally propose raking estimators which are consistent estimators of the parameter defined by the population estimating equation. Raking can improve upon RC in certain settings with failure-time data, require no explicit modeling of the error structure, and can be utilized under outcome-dependent sampling designs. We discuss features of the underlying estimation problem that affect the degree of improvement the raking estimator has over the RC approach. Detailed simulation studies are presented to examine the performance of the proposed estimators under varying levels of signal, error, and censoring. The methodology is illustrated on observational EHR data on HIV outcomes from the Vanderbilt Comprehensive Care Clinic.
引用
收藏
页码:631 / 649
页数:19
相关论文
共 12 条
  • [1] Improved generalized raking estimators to address dependent covariate and failure-time outcome error
    Oh, Eric J.
    Shepherd, Bryan E.
    Lumley, Thomas
    Shaw, Pamela A.
    BIOMETRICAL JOURNAL, 2021, 63 (05) : 1006 - 1027
  • [2] Considerations for analysis of time-to-event outcomes measured with error: Bias and correction with SIMEX
    Oh, Eric J.
    Shepherd, Bryan E.
    Lumley, Thomas
    Shaw, Pamela A.
    STATISTICS IN MEDICINE, 2018, 37 (08) : 1276 - 1289
  • [3] Adjusted regression estimation for time-to-event data with differential measurement error
    Yu, Menggang
    BIOMETRIKA, 2013, 100 (03) : 757 - 763
  • [4] Semiparametric Modeling of Longitudinal Measurements and Time-to-Event Data-A Two-Stage Regression Calibration Approach
    Ye, Wen
    Lin, Xihong
    Taylor, Jeremy M. G.
    BIOMETRICS, 2008, 64 (04) : 1238 - 1246
  • [5] Statistical methods for the time-to-event analysis of individual participant data from multiple epidemiological studies
    Thompson, Simon
    Kaptoge, Stephen
    White, Ian
    Wood, Angela
    Perry, Philip
    Danesh, John
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2010, 39 (05) : 1345 - 1359
  • [6] Comparison of a time-varying covariate model and a joint model of time-to-event outcomes in the presence of measurement error and interval censoring: application to kidney transplantation
    Campbell, Kristen R.
    Juarez-Colunga, Elizabeth
    Grunwald, Gary K.
    Cooper, James
    Davis, Scott
    Gralla, Jane
    BMC MEDICAL RESEARCH METHODOLOGY, 2019, 19 (1)
  • [7] Comparison of a time-varying covariate model and a joint model of time-to-event outcomes in the presence of measurement error and interval censoring: application to kidney transplantation
    Kristen R. Campbell
    Elizabeth Juarez-Colunga
    Gary K. Grunwald
    James Cooper
    Scott Davis
    Jane Gralla
    BMC Medical Research Methodology, 19
  • [8] Minimizing confounding in comparative observational studies with time-to-event outcomes: An extensive comparison of covariate balancing methods using Monte Carlo simulation
    Cafri, Guy
    Fortin, Stephen
    Austin, Peter C.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2024, 33 (08) : 1437 - 1460
  • [9] Estimating a time-to-event distribution from right-truncated data in an epidemic: A review of methods
    Seaman, Shaun R.
    Presanis, Anne
    Jackson, Christopher
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2022, 31 (09) : 1641 - 1655
  • [10] Review of Statistical Methods for Evaluating the Performance of Survival or Other Time-to-Event Prediction Models (from Conventional to Deep Learning Approaches)
    Park, Seo Young
    Park, Ji Eun
    Kim, Hyungjin
    Park, Seong Ho
    KOREAN JOURNAL OF RADIOLOGY, 2021, 22 (10) : 1697 - 1707