Inverse Probability of Treatment Weighting and Confounder Missingness in Electronic Health Record-based Analyses: A Comparison of Approaches Using Plasmode Simulation

被引:1
|
作者
Vader, Daniel T. [1 ]
Mamtani, Ronac [2 ]
Li, Yun [1 ]
Griffith, Sandra D. [3 ]
Calip, Gregory S. [3 ]
Hubbard, Rebecca A. [1 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, 3215 Market St, Philadelphia, PA 19104 USA
[2] Univ Penn, Div Hematol & Oncol, Philadelphia, PA 19104 USA
[3] Flatiron Hlth, New York, NY USA
基金
美国国家卫生研究院;
关键词
Comparative effectiveness research; Data quality; Electronic health records; Inverse probability of treatment weighting; Missing data; Propensity scores; Statistical bias; MARGINAL STRUCTURAL MODELS; PROPENSITY SCORE; MEASUREMENT-ERROR; REGRESSION CALIBRATION; LOGISTIC-REGRESSION; SURVIVAL; TIME; CARE;
D O I
10.1097/EDE.0000000000001618
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background:Electronic health record (EHR) data represent a critical resource for comparative effectiveness research, allowing investigators to study intervention effects in real-world settings with large patient samples. However, high levels of missingness in confounder variables is common, challenging the perceived validity of EHR-based investigations. Methods:We investigated performance of multiple imputation and propensity score (PS) calibration when conducting inverse probability of treatment weights (IPTW)-based comparative effectiveness research using EHR data with missingness in confounder variables and outcome misclassification. Our motivating example compared effectiveness of immunotherapy versus chemotherapy treatment of advanced bladder cancer with missingness in a key prognostic variable. We captured complexity in EHR data structures using a plasmode simulation approach to spike investigator-defined effects into resamples of a cohort of 4361 patients from a nationwide deidentified EHR-derived database. We characterized statistical properties of IPTW hazard ratio estimates when using multiple imputation or PS calibration missingness approaches. Results:Multiple imputation and PS calibration performed similarly, maintaining <= 0.05 absolute bias in the marginal hazard ratio even when >= 50% of subjects had missing at random or missing not at random confounder data. Multiple imputation required greater computational resources, taking nearly 40 times as long as PS calibration to complete. Outcome misclassification minimally increased bias of both methods. Conclusion:Our results support multiple imputation and PS calibration approaches to missingness in missing completely at random or missing at random confounder variables in EHR-based IPTW comparative effectiveness analyses, even with missingness >= 50%. PS calibration represents a computationally efficient alternative to multiple imputation.
引用
收藏
页码:520 / 530
页数:11
相关论文
共 20 条
  • [1] The Impact of Electronic Health Record-Based Simulation During Intern Boot Camp: Interventional Study
    Miller, Matthew E.
    Scholl, Gretchen
    Corby, Sky
    Mohan, Vishnu
    Gold, Jeffrey A.
    JMIR MEDICAL EDUCATION, 2021, 7 (01):
  • [2] Propensity score trimming mitigates bias due to covariate measurement error in inverse probability of treatment weighted analyses: A plasmode simulation
    Conover, Mitchell M.
    Rothman, Kenneth J.
    Sturmer, Til
    Ellis, Alan R.
    Poole, Charles
    Funk, Michele Jonsson
    STATISTICS IN MEDICINE, 2021, 40 (09) : 2101 - 2112
  • [3] Electronic Health Record-Based Registries: Clinical Research Using Registries in Colon and Rectal Surgery
    Carlson, Jacob
    Laryea, Jonathan
    CLINICS IN COLON AND RECTAL SURGERY, 2019, 32 (01) : 82 - 90
  • [4] Inverse probability weighting for selection bias in a Delaware community health center electronic medical record study of community deprivation and hepatitis C prevalence
    Goldstein, Neal D.
    Kahal, Deborah
    Testa, Karla
    Burstyn, Igor
    ANNALS OF EPIDEMIOLOGY, 2021, 60 : 1 - 7
  • [5] Inverse Probability of Treatment Weighting (Propensity Score) using the Military Health System Data Repository and National Death Index
    Mitchell, Joshua D.
    Gage, Brian F.
    Fergestrom, Nicole
    Novak, Eric
    Villines, Todd C.
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2020, (155):
  • [6] Return to Work After Breast Cancer Treatment: An Electronic Health Record-based Study in North Norway
    Nieder, Carsten
    Haukland, Ellinor C.
    Mannsaker, Bard
    ANTICANCER RESEARCH, 2024, 44 (07) : 3193 - 3198
  • [7] Optimizing Antihypertensive Medication Classification in Electronic Health Record-Based Data: Classification System Development and Methodological Comparison
    McDonough, Caitrin W.
    Smith, Steven M.
    Cooper-DeHoff, Rhonda M.
    Hogan, William R.
    JMIR MEDICAL INFORMATICS, 2020, 8 (02)
  • [8] Screening for undiagnosed atrial fibrillation using an electronic health record-based clinical prediction model: clinical pilot implementation initiative
    Grout, Randall W.
    Ateya, Mohammad
    Direnzo, Baely
    Hart, Sara
    King, Chase
    Rajkumar, Joshua
    Sporrer, Susan
    Torabi, Asad
    Walroth, Todd A.
    Kovacs, Richard J.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [9] Inverse-probability weighting and multiple imputation for evaluating selection bias in the estimation of childhood obesity prevalence using data from electronic health records
    Sayon-Orea, Carmen
    Moreno-Iribas, Conchi
    Delfrade, Josu
    Sanchez-Echenique, Manuela
    Amiano, Pilar
    Ardanaz, Eva
    Gorricho, Javier
    Basterra, Garbine
    Nuin, Marian
    Guevara, Marcela
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
  • [10] Measuring cognitive effort using tabular transformer-based language models of electronic health record-based audit log action sequences
    Kim, Seunghwan
    Warner, Benjamin C.
    Lew, Daphne
    Lou, Sunny S.
    Kannampallil, Thomas
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (10) : 2228 - 2235