Bias correction models for electronic health records data in the presence of non-random sampling

被引:0
|
作者
Kim, Jiyu [1 ]
Anthopolos, Rebecca [1 ]
Zhong, Judy [1 ]
机构
[1] NYU, NYU Grossman Sch Med, Dept Populat Hlth, 180 Madison Ave, New York, NY 10016 USA
基金
美国国家卫生研究院;
关键词
bias correction; EHRs; SNAR; social determinants of health; SELECTION MODELS; POPULATION; INFERENCE;
D O I
10.1093/biomtc/ujae014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Electronic health records (EHRs) contain rich clinical information for millions of patients and are increasingly used for public health research. However, non-random inclusion of subjects in EHRs can result in selection bias, with factors such as demographics, socioeconomic status, healthcare referral patterns, and underlying health status playing a role. While this issue has been well documented, little work has been done to develop or apply bias-correction methods, often due to the fact that most of these factors are unavailable in EHRs. To address this gap, we propose a series of Heckman type bias correction methods by incorporating social determinants of health selection covariates to model the EHR non-random sampling probability. Through simulations under various settings, we demonstrate the effectiveness of our proposed method in correcting biases in both the association coefficient and the outcome mean. Our method augments the utility of EHRs for public health inferences, as we show by estimating the prevalence of cardiovascular disease and its correlation with risk factors in the New York City network of EHRs.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Addressing Bias from Non-Random Missing Attributes in Health Data
    Napoli, Nicholas J.
    Kotoriy, Madeline E.
    Barnhardt, William
    Young, Jeffrey S.
    Barnes, Laura E.
    2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 265 - 268
  • [2] Evaluation of Uplift Models with Non-Random Assignment Bias
    Rafla, Mina
    Voisine, Nicolas
    Cremilleux, Bruno
    ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 251 - 263
  • [3] Correction of bias from non-random missing longitudinal data using auxiliary information
    Wang, Cuiling
    Hall, Charles B.
    STATISTICS IN MEDICINE, 2010, 29 (06) : 671 - 679
  • [4] Equity and bias in electronic health records data
    Boyd, Andrew D.
    Gonzalez-Guarda, Rosa
    Lawrence, Katharine
    Patil, Crystal L.
    Ezenwa, Miriam O.
    O'Brien, Emily C.
    Paek, Hyung
    Braciszewski, Jordan M.
    Adeyemi, Oluwaseun
    Cuthel, Allison M.
    Darby, Juanita E.
    Zigler, Christina K.
    Ho, P. Michael
    Faurot, Keturah R.
    Staman, Karen
    Leigh, Jonathan W.
    Dailey, Dana L.
    Cheville, Andrea
    Del Fiol, Guilherme
    Knisely, Mitchell R.
    Marsolo, Keith
    Richesson, Rachel L.
    Schlaeger, Judith M.
    CONTEMPORARY CLINICAL TRIALS, 2023, 130
  • [5] A non-random data sampling method for classification model assessment
    Sprevak, D
    Azuaje, F
    Wang, HY
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 406 - 409
  • [6] On the Nature of Informative Presence Bias in Analyses of Electronic Health Records
    McGee, Glen
    Haneuse, Sebastien
    Coull, Brent A.
    Weisskopf, Marc G.
    Rotem, Ran S.
    EPIDEMIOLOGY, 2022, 33 (01) : 105 - 113
  • [7] A Quantitative Bias Analysis Approach to Informative Presence Bias in Electronic Health Records
    Zhang, Hanxi
    Clark, Amy S.
    Hubbard, Rebecca A.
    EPIDEMIOLOGY, 2024, 35 (03) : 349 - 358
  • [8] Informative presence bias in analyses of electronic health records-derived data: a cautionary note
    Harton, Joanna
    Mitra, Nandita
    Hubbard, Rebecca A.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (07) : 1191 - 1199
  • [9] PROBLEMS OF DEFINING QUOTAS IN NON-RANDOM SAMPLING
    DEROO, M
    METRA, 1973, 12 (01): : 141 - 157
  • [10] Non-random sampling for reproductive status induces bias in probabilistic maturation reaction norm midpoints
    Sahashi, Genki
    Morita, Kentaro
    FISHERIES RESEARCH, 2015, 170 : 24 - 29