A Temporal Pattern Mining Approach for Classifying Electronic Health Record Data

被引:93
作者
Batal, Iyad [1 ]
Valizadegan, Hamed [1 ]
Cooper, Gregory F. [1 ]
Hauskrecht, Milos [1 ]
机构
[1] Univ Pittsburgh, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
Algorithms; Experimentation; Performance; Temporal pattern mining; multivariate time series; temporal abstractions; time-interval patterns; classification; DISCOVERY; CLASSIFICATION; ALGORITHM; SERIES;
D O I
10.1145/2508037.2508044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of learning classification models from complex multivariate temporal data encountered in electronic health record systems. The challenge is to define a good set of features that are able to represent well the temporal aspect of the data. Our method relies on temporal abstractions and temporal pattern mining to extract the classification features. Temporal pattern mining usually returns a large number of temporal patterns, most of which may be irrelevant to the classification task. To address this problem, we present the Minimal Predictive Temporal Patterns framework to generate a small set of predictive and nonspurious patterns. We apply our approach to the real-world clinical task of predicting patients who are at risk of developing heparin-induced thrombocytopenia. The results demonstrate the benefit of our approach in efficiently learning accurate classifiers, which is a key step for developing intelligent clinical monitoring systems.
引用
收藏
页数:22
相关论文
共 46 条
[1]  
AGRAWAL R., 1994, P INT C VER LAR DAT
[2]   TOWARDS A GENERAL-THEORY OF ACTION AND TIME [J].
ALLEN, JF .
ARTIFICIAL INTELLIGENCE, 1984, 23 (02) :123-154
[3]  
[Anonymous], P SIAM INT C DAT MIN
[4]  
[Anonymous], 1995, P 11 INT C DAT ENG T
[5]  
BATAL I., 2009, P INT C MACH LEARN A
[6]  
Batal Iyad, 2010, P INT C INF KNOWL MA
[7]  
BAYARDO R. J., 1998, P INT C MAN DAT SIGM
[8]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]  
CHENG H., 2007, P INT C DAT ENG ICDE
[10]  
COMBI C., 2007, DATA MINING KNOWL DI