Bias correction models for electronic health records data in the presence of non-random sampling

被引:0
|
作者
Kim, Jiyu [1 ]
Anthopolos, Rebecca [1 ]
Zhong, Judy [1 ]
机构
[1] NYU, NYU Grossman Sch Med, Dept Populat Hlth, 180 Madison Ave, New York, NY 10016 USA
基金
美国国家卫生研究院;
关键词
bias correction; EHRs; SNAR; social determinants of health; SELECTION MODELS; POPULATION; INFERENCE;
D O I
10.1093/biomtc/ujae014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Electronic health records (EHRs) contain rich clinical information for millions of patients and are increasingly used for public health research. However, non-random inclusion of subjects in EHRs can result in selection bias, with factors such as demographics, socioeconomic status, healthcare referral patterns, and underlying health status playing a role. While this issue has been well documented, little work has been done to develop or apply bias-correction methods, often due to the fact that most of these factors are unavailable in EHRs. To address this gap, we propose a series of Heckman type bias correction methods by incorporating social determinants of health selection covariates to model the EHR non-random sampling probability. Through simulations under various settings, we demonstrate the effectiveness of our proposed method in correcting biases in both the association coefficient and the outcome mean. Our method augments the utility of EHRs for public health inferences, as we show by estimating the prevalence of cardiovascular disease and its correlation with risk factors in the New York City network of EHRs.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Non-random reflections on health services research
    Normand, C
    HEALTH ECONOMICS, 1998, 7 (03) : 280 - 280
  • [22] Evaluating Alternate Bias Correction Methods in Estimating Diabetes Prevalence with Electronic Health Records
    Conderino, Sarah
    Anthopolos, Rebeccca
    Thorpe, Lorna
    Cai, Bo
    Shao, Hui
    Ong, Toan C.
    Crume, Tessa L.
    Schwartz, Brian S.
    Kirchner, H. Lester
    Rosenman, Marc
    Zhong, Victor W.
    Reynolds, Kristi
    Park, Seho
    Utidjian, Levon H.
    Divers, Jasmin
    DIABETES, 2022, 71
  • [23] Effect of non-random sampling on the estimation of parameters in population genetics
    Tajima, F
    GENETICS RESEARCH, 1995, 66 (03) : 267 - 276
  • [24] NON-RANDOM SAMPLING OF INDIVIDUALS IN CROSS-CULTURAL RESEARCH
    BRISLIN, RW
    BAUMGARD.SR
    JOURNAL OF CROSS-CULTURAL PSYCHOLOGY, 1971, 2 (04) : 397 - 400
  • [25] Non-random sampling leads to biased estimates of transcriptome association
    Foulkes, A. S.
    Balasubramanian, R.
    Qian, J.
    Reilly, M. P.
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [26] Non-random sampling leads to biased estimates of transcriptome association
    A. S. Foulkes
    R. Balasubramanian
    J. Qian
    M. P. Reilly
    Scientific Reports, 10
  • [27] Bias Associated with Mining Electronic Health Records
    Hripcsak, George
    Knirsch, Charles
    Zhou, Li
    Wilcox, Adam
    Melton, Genevieve B.
    JOURNAL OF BIOMEDICAL DISCOVERY AND COLLABORATION, 2011, 6
  • [28] Non-random Study Attrition: Assessing Correction Techniques and the Magnitude of Bias in a Longitudinal Study of Reentry from Prison
    Meghan M. Mitchell
    Chantal Fahmy
    Kendra J. Clark
    David C. Pyrooz
    Journal of Quantitative Criminology, 2022, 38 : 755 - 790
  • [29] Stratified split sampling of electronic health records
    Tianyao Huo
    Deborah H. Glueck
    Elizabeth A. Shenkman
    Keith E. Muller
    BMC Medical Research Methodology, 23
  • [30] Non-random Study Attrition: Assessing Correction Techniques and the Magnitude of Bias in a Longitudinal Study of Reentry from Prison
    Mitchell, Meghan M.
    Fahmy, Chantal
    Clark, Kendra J.
    Pyrooz, David C.
    JOURNAL OF QUANTITATIVE CRIMINOLOGY, 2022, 38 (03) : 755 - 790