Scalable and accurate deep learning with electronic health records

被引:1335
作者
Rajkomar, Alvin [1 ,2 ]
Oren, Eyal [1 ]
Chen, Kai [1 ]
Dai, Andrew M. [1 ]
Hajaj, Nissan [1 ]
Hardt, Michaela [1 ]
Liu, Peter J. [1 ]
Liu, Xiaobing [1 ]
Marcus, Jake [1 ]
Sun, Mimi [1 ]
Sundberg, Patrik [1 ]
Yee, Hector [1 ]
Zhang, Kun [1 ]
Zhang, Yi [1 ]
Flores, Gerardo [1 ]
Duggan, Gavin E. [1 ]
Irvine, Jamie [1 ]
Quoc Le [1 ]
Litsch, Kurt [1 ]
Mossin, Alexander [1 ]
Tansuwan, Justin [1 ]
Wang, De [1 ]
Wexler, James [1 ]
Wilson, Jimbo [1 ]
Ludwig, Dana [2 ]
Volchenboum, Samuel L. [3 ]
Chou, Katherine [1 ]
Pearson, Michael [1 ]
Madabushi, Srinivasan [1 ]
Shah, Nigam H. [4 ]
Butte, Atul J. [2 ]
Howell, Michael D. [1 ]
Cui, Claire [1 ]
Corrado, Greg S. [1 ]
Dean, Jeffrey [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
[2] Univ Calif San Francisco, San Francisco, CA 94143 USA
[3] Univ Chicago Med, Chicago, IL USA
[4] Stanford Univ, Stanford, CA 94305 USA
来源
NPJ DIGITAL MEDICINE | 2018年 / 1卷
关键词
RISK PREDICTION MODELS; EARLY WARNING SCORE; BIG DATA; HOSPITAL READMISSION; MEDICAL-RECORDS; VALIDATION; CARE; INPATIENT; ANALYTICS; PATIENT;
D O I
10.1038/s41746-018-0029-1
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that discards the vast majority of information in each patient's record. We propose a representation of patients' entire raw EHR records based on the Fast Healthcare Interoperability Resources (FHIR) format. We demonstrate that deep learning methods using this representation are capable of accurately predicting multiple medical events from multiple centers without site-specific data harmonization. We validated our approach using de-identified EHR data from two US academic medical centers with 216,221 adult patients hospitalized for at least 24 h. In the sequential format we propose, this volume of EHR data unrolled into a total of 46,864,534,945 data points, including clinical notes. Deep learning models achieved high accuracy for tasks such as predicting: in-hospital mortality (area under the receiver operator curve [AUROC] across sites 0.93-0.94), 30-day unplanned readmission (AUROC 0.75-0.76), prolonged length of stay (AUROC 0.85-0.86), and all of a patient's final discharge diagnoses (frequency-weighted AUROC 0.90). These models outperformed traditional, clinically-used predictive models in all cases. We believe that this approach can be used to create accurate and scalable predictions for a variety of clinical scenarios. In a case study of a particular prediction, we demonstrate that neural networks can be used to identify relevant information from the patient's chart.
引用
收藏
页数:10
相关论文
共 50 条
[41]   An overview of electronic personal health records [J].
Alsahafi, A. Yaser A. ;
Gay, B. Valerie .
HEALTH POLICY AND TECHNOLOGY, 2018, 7 (04) :427-432
[42]   The unfulfilled promises of electronic health records [J].
Looi, Jeffrey C. L. ;
Kisely, Steve ;
Allison, Stephen ;
Bastiampillai, Tarun ;
Maguire, Paul A. .
AUSTRALIAN HEALTH REVIEW, 2023, 47 (06) :744-746
[43]   Barriers to Implement Electronic Health Records [J].
Carlos, Maldonado ;
Ore Sussy, Bayona .
2016 11TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2016,
[44]   Electronic Health Records and Ambulatory Quality [J].
Berger, Zackary D. .
JOURNAL OF GENERAL INTERNAL MEDICINE, 2013, 28 (09) :1132-1132
[45]   Blockchain: A Panacea for Electronic Health Records? [J].
Kassab, Mohamad ;
DeFranco, Joanna ;
Malas, Tarek ;
Graciano Neto, Valdemar Vicente ;
Destefanis, Giuseppe .
2019 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING FOR HEALTHCARE (SEH 2019), 2019, :21-24
[46]   Leveraging Large-Scale Electronic Health Records and Interpretable Machine Learning for Clinical Decision Making at the Emergency Department: Protocol for System Development and Validation [J].
Liu, Nan ;
Xie, Feng ;
Siddiqui, Fahad Javaid ;
Ho, Andrew Fu Wah ;
Chakraborty, Bibhas ;
Nadarajan, Gayathri Devi ;
Tan, Kenneth Boon Kiat ;
Ong, Marcus Eng Hock .
JMIR RESEARCH PROTOCOLS, 2022, 11 (03)
[47]   Safeguarding Confidentiality in Electronic Health Records [J].
Shenoy, Akhil ;
Appel, Jacob M. .
CAMBRIDGE QUARTERLY OF HEALTHCARE ETHICS, 2017, 26 (02) :337-341
[48]   Electronic health records and biomedical research [J].
Daniel, Christel ;
Jais, Jean-Philippe ;
El Fadly, Naji ;
Landais, Paul .
PRESSE MEDICALE, 2009, 38 (10) :1468-1475
[49]   Predictability Bounds of Electronic Health Records [J].
Dahlem, Dominik ;
Maniloff, Diego ;
Ratti, Carlo .
SCIENTIFIC REPORTS, 2015, 5
[50]   Predicting the Risk of Diabetes in Big Data Electronic Health Records by using Scalable Random Forest Classification Algorithm [J].
Rallapalli, Sreekanth ;
Suryakanthi, T. .
2016 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND ENGINEERING (ICACCE 2016), 2016, :281-284