Patient Subtyping via Time-Aware LSTM Networks

被引:423
作者
Baytas, Inci M. [1 ]
Xiao, Cao [2 ]
Zhang, Xi [3 ]
Wang, Fei [3 ]
Jain, Anil K. [1 ]
Zhou, Jiayu [1 ]
机构
[1] Michigan State Univ, Comp Sci & Engn, 428 S Shaw Ln, E Lansing, MI 48824 USA
[2] IBM TJ Watson Res Ctr, Ctr Computat Hlth, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
[3] Cornell Univ, Weill Cornell Med Sch, Healthcare Policy & Res, New York, NY 10065 USA
来源
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2017年
基金
美国国家科学基金会;
关键词
Patient subtyping; Recurrent Neural Network; Long-Short Term Memory;
D O I
10.1145/3097983.3097997
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the study of various diseases, heterogeneity among patients usually leads to different progression patterns and may require different types of therapeutic intervention. Therefore, it is important to study patient subtyping, which is grouping of patients into disease characterizing subtypes. Subtyping from complex patient data is challenging because of the information heterogeneity and temporal dynamics. Long-Short Term Memory (LSTM) has been successfully used in many domains for processing sequential data, and recently applied for analyzing longitudinal patient records. The LSTM units are designed to handle data with constant elapsed times between consecutive elements of a sequence. Given that time lapse between successive elements in patient records can vary from days to months, the design of traditional LSTM may lead to suboptimal performance. In this paper, we propose a novel LSTM unit called Time-Aware LSTM (T-LSTM) to handle irregular time intervals in longitudinal patient records. We learn a subspace decomposition of the cell memory which enables time decay to discount the memory content according to the elapsed time. We propose a patient subtyping model that leverages the proposed T-LSTM in an auto-encoder to learn a powerful single representation for sequential records of patients, which are then used to cluster patients into clinical subtypes. Experiments on synthetic and real world datasets show that the proposed T-LSTM architecture captures the underlying structures in the sequences with time irregularities.
引用
收藏
页码:65 / 74
页数:10
相关论文
共 32 条
[1]  
[Anonymous], 2017, SDM
[2]  
[Anonymous], ARXIV12065538V3CSLG
[3]  
[Anonymous], ARXIV160800570CSCY
[4]  
[Anonymous], 2017, HLTH IT DASHBOARD
[5]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[6]   Recurrent Neural Networks for Multivariate Time Series with Missing Values [J].
Che, Zhengping ;
Purushotham, Sanjay ;
Cho, Kyunghyun ;
Sontag, David ;
Liu, Yan .
SCIENTIFIC REPORTS, 2018, 8
[7]   Deep Computational Phenotyping [J].
Che, Zhengping ;
Kale, David ;
Li, Wenzhe ;
Bahadori, Mohammad Taha ;
Liu, Yan .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :507-516
[8]  
Cho K, 2014, ARXIV14061078, P1724, DOI [DOI 10.3115/V1/D14-1179, 10.3115/V1/D14-1179]
[9]   Multi-layer Representation Learning for Medical Concepts [J].
Choi, Edward ;
Bahadori, Mohammad Taha ;
Searles, Elizabeth ;
Coffey, Catherine ;
Thompson, Michael ;
Bost, James ;
Tejedor-Sojo, Javier ;
Sun, Jimeng .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :1495-1504
[10]  
Choi Edward, 2016, ARXIV151105942V11CSL