Reporting of demographic data and representativeness in machine learning models using electronic health records

被引:29
|
作者
Bozkurt, Selen [1 ]
Cahan, Eli M. [1 ,2 ]
Seneviratne, Martin G. [1 ]
Sun, Ran [1 ]
Lossio-Ventura, Juan A. [1 ]
Ioannidis, John P. A. [1 ,3 ,4 ,5 ,6 ]
Hernandez-Boussard, Tina [1 ,4 ,7 ]
机构
[1] Stanford Univ, Dept Med, Stanford, CA 94306 USA
[2] NYU, Sch Med, New York, NY USA
[3] Stanford Univ, Sch Med, Dept Epidemiol & Populat Hlth, Stanford, CA 94306 USA
[4] Stanford Univ, Dept Biomed Data Sci, Stanford, CA 94306 USA
[5] Stanford Univ, Dept Stat, Stanford, CA 94306 USA
[6] Stanford Univ, Metares Innovat Ctr Stanford, Stanford, CA 94306 USA
[7] Stanford Univ, Dept Surg, Stanford, CA 94306 USA
关键词
demographic data; machine learning; electronic health record; clinical decision support; bias; transparency; PREDICTION; RISK; BIAS;
D O I
10.1093/jamia/ocaa164
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: The development of machine learning (ML) algorithms to address a variety of issues faced in clinical practice has increased rapidly. However, questions have arisen regarding biases in their development that can affect their applicability in specific populations. We sought to evaluate whether studies developing ML models from electronic health record (EHR) data report sufficient demographic data on the study populations to demonstrate representativeness and reproducibility. Materials and Methods: We searched PubMed for articles applying ML models to improve clinical decision-making using EHR data. We limited our search to papers published between 2015 and 2019. Results: Across the 164 studies reviewed, demographic variables were inconsistently reported and/or included as model inputs. Race/ethnicity was not reported in 64%; gender and age were not reported in 24% and 21% of studies, respectively. Socioeconomic status of the population was not reported in 92% of studies. Studies that mentioned these variables often did not report if they were included as model inputs. Few models (12%) were validated using external populations. Few studies (17%) open-sourced their code. Populations in the ML studies include higher proportions of White and Black yet fewer Hispanic subjects compared to the general US population. Discussion: The demographic characteristics of study populations are poorly reported in the ML literature based on EHR data. Demographic representativeness in training data and model transparency is necessary to ensure that ML models are deployed in an equitable and reproducible manner. Wider adoption of reporting guidelines is warranted to improve representativeness and reproducibility.
引用
收藏
页码:1878 / 1884
页数:7
相关论文
共 50 条
  • [21] Benchmarking emergency department prediction models with machine learning and public electronic health records
    Feng Xie
    Jun Zhou
    Jin Wee Lee
    Mingrui Tan
    Siqi Li
    Logasan S/O Rajnthern
    Marcel Lucas Chee
    Bibhas Chakraborty
    An-Kwok Ian Wong
    Alon Dagan
    Marcus Eng Hock Ong
    Fei Gao
    Nan Liu
    Scientific Data, 9
  • [22] Benchmarking emergency department prediction models with machine learning and public electronic health records
    Xie, Feng
    Zhou, Jun
    Lee, Jin Wee
    Tan, Mingrui
    Li, Siqi
    Rajnthern, Logasan S. O.
    Chee, Marcel Lucas
    Chakraborty, Bibhas
    Wong, An-Kwok Ian
    Dagan, Alon
    Ong, Marcus Eng Hock
    Gao, Fei
    Liu, Nan
    SCIENTIFIC DATA, 2022, 9 (01)
  • [23] Applying Machine Learning Models to Electronic Health Records for Chronic Disease Diagnosis in Kuwait
    Alenezi, Talal M.
    Sulaiman, Taiseer H.
    Abdelaziz, Amr M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 667 - 676
  • [24] Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review
    Xiao, Cao
    Choi, Edward
    Sun, Jimeng
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (10) : 1419 - 1428
  • [25] Machine learning models for atrial fibrillation detection in primary care using electronic health records: systematic review
    Chalati, Mhd Diaa
    Shirvankar, Chetan
    Rahimi, Samira
    ANNALS OF FAMILY MEDICINE, 2024, 22
  • [26] Machine learning models to detect and predict patient safety events using electronic health records: A systematic review
    Deimazar, Ghasem
    Sheikhtaheri, Abbas
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 180
  • [27] Machine Learning and Electronic Health Records: A Paradigm Shift
    Adkins, Daniel E.
    AMERICAN JOURNAL OF PSYCHIATRY, 2017, 174 (02): : 93 - 94
  • [28] Postoperative delirium prediction using machine learning models and preoperative electronic health record data
    Andrew Bishara
    Catherine Chiu
    Elizabeth L. Whitlock
    Vanja C. Douglas
    Sei Lee
    Atul J. Butte
    Jacqueline M. Leung
    Anne L. Donovan
    BMC Anesthesiology, 22
  • [29] Individualized melanoma risk prediction using machine learning with electronic health records
    Wan, G.
    Nguyen, N.
    Yan, B.
    Khattab, S.
    Estiri, H.
    Semenov, Y.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2024, 144 (08) : S35 - S35
  • [30] Postoperative delirium prediction using machine learning models and preoperative electronic health record data
    Bishara, Andrew
    Chiu, Catherine
    Whitlock, Elizabeth L.
    Douglas, Vanja C.
    Lee, Sei
    Butte, Atul J.
    Leung, Jacqueline M.
    Donovan, Anne L.
    BMC ANESTHESIOLOGY, 2022, 22 (01)