TAPER: Time-Aware Patient EHR Representation

被引:38
作者
Darabi, Sajad [1 ]
Kachuee, Mohammad [1 ]
Fazeli, Shayan [1 ]
Sarrafzadeh, Majid [1 ]
机构
[1] Univ Calif Los Angeles, Comp Sci Dept, Los Angeles, CA 90025 USA
关键词
Medical diagnostic imaging; Task analysis; Medical services; Bit error rate; Natural language processing; Informatics; Electronic health records; representation learning; time-aware; transformer network; patient repre-sentation; MODEL;
D O I
10.1109/JBHI.2020.2984931
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effective representation learning of electronic health records is a challenging task and is becoming more important as the availability of such data is becoming pervasive. The data contained in these records are irregular and contain multiple modalities such as notes, and medical codes. They are preempted by medical conditions the patient may have, and are typically recorded by medical staff. Accompanying codes are notes containing valuable information about patients beyond the structured information contained in electronic health records. We use transformer networks and the recently proposed BERT language model to embed these data streams into a unified vector representation. The presented approach effectively encodes a patients visit data into a single a distributed representation, which can be used for downstream tasks. Our model demonstrates superior performance and generalization on mortality, readmission and length of stay tasks using the publicly available MIMIC-III ICU dataset.
引用
收藏
页码:3268 / 3275
页数:8
相关论文
共 34 条
[1]  
Alsentzer E., 2019, P 2 CLIN NAT LANG PR, DOI DOI 10.18653/V1/W19-1909
[2]  
[Anonymous], 2011, ARXIV14090473
[3]  
[Anonymous], 2008, P 25 INT C MACHINE L
[4]  
[Anonymous], 2013, NIPS
[5]  
[Anonymous], 2018, P 3 MACH LEARN HEALT
[6]   Improving palliative care with deep learning [J].
Avati, Anand ;
Jung, Kenneth ;
Harman, Stephanie ;
Downing, Lance ;
Ng, Andrew ;
Shah, Nigam H. .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
[7]   EHR phenotyping via jointly embedding medical concepts and words into a unified vector space [J].
Bai, Tian ;
Chanda, Ashis Kumar ;
Egleston, Brian L. ;
Vucetic, Slobodan .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
[8]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[9]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[10]  
Cai XR, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3984