Procedure prediction from symbolic Electronic Health Records via time intervals analytics

被引:20
作者
Moskovitch, Robert [1 ,2 ,3 ,4 ,5 ]
Polubriaginof, Fernanda [1 ,2 ,3 ,4 ]
Weiss, Aviram [5 ]
Ryan, Patrick [1 ]
Tatonetti, Nicholas [1 ,2 ,3 ,4 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY 10027 USA
[2] Columbia Univ, Dept Syst Biol, New York, NY 10027 USA
[3] Columbia Univ, Dept Med, New York, NY 10027 USA
[4] OHDSI, New York, NY USA
[5] Ben Gurion Univ Negev, Dept Software & Informat Syst Engn, Beer Sheva, Israel
关键词
Temporal data mining; Time intervals mining; Prediction; Electronic Health Records; TEMPORAL REPRESENTATION; EHR DATA; KNOWLEDGE; ABSTRACTION; ALGORITHM; PATTERNS; MEDICINE; SERIES; RULES;
D O I
10.1016/j.jbi.2017.07.018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prediction of medical events, such as clinical procedures, is essential for preventing disease, understanding disease mechanism, and increasing patient quality of care. Although longitudinal clinical data from Electronic Health Records provides opportunities to develop predictive models, the use of these data faces significant challenges. Primarily, while the data are longitudinal and represent thousands of conceptual events having duration, they are also sparse, complicating the application of traditional analysis approaches. Furthermore, the framework presented here takes advantage of the events duration and gaps. International standards for electronic healthcare data represent data elements, such as procedures, conditions, and drug exposures, using eras, or time intervals. Such eras contain both an event and a duration and enable the application of time intervals mining - a relatively new subfield of data mining. In this study, we present Maitreya, a framework for time intervals analytics in longitudinal clinical data. Maitreya discovers frequent time intervals related patterns (TIRPs), which we use as prognostic markers for modelling clinical events. We introduce three novel TIRP metrics that are normalized versions of the horizontal-support, that represents the number of TIRP instances per patient. We evaluate Maitreya on 28 frequent and clinically important procedures, using the three novel TIRP representation metrics in comparison to no temporal repreSentation and previous TIRP5 metrics. We also evaluate the epsilon value that makes Allen's relations more flexible with several settings of 30, 60, 90 and 180 days in comparison to the default zero. For twenty-two of these procedures, the use of temporal patterns as predictors was superior to non-temporal features, and the use of the vertically normalized horizontal support metric to represent TIRPs as features was most effective. The use of the epsilon value with thirty days was slightly better than the zero. (C) 2017 Published by Elsevier Inc.
引用
收藏
页码:70 / 82
页数:13
相关论文
共 62 条
[1]   Temporal representation and reasoning in medicine: Research directions and challenges [J].
Adlassnig, Klaus-Peter ;
Combi, Carlo ;
Das, Amar K. ;
Keravnou, Elpida T. ;
Pozzi, Giuseppe .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2006, 38 (02) :101-113
[2]   Dynamical Phenotyping: Using Temporal Analysis of Clinically Collected Physiologic Data to Stratify Populations [J].
Albers, D. J. ;
Elhadad, Noemie ;
Tabak, E. ;
Perotte, A. ;
Hripcsak, George .
PLOS ONE, 2014, 9 (06)
[3]   MAINTAINING KNOWLEDGE ABOUT TEMPORAL INTERVALS [J].
ALLEN, JF .
COMMUNICATIONS OF THE ACM, 1983, 26 (11) :832-843
[4]  
Batal I., 2012, P KNOWL DISC DAT MIN
[5]   A Temporal Pattern Mining Approach for Classifying Electronic Health Record Data [J].
Batal, Iyad ;
Valizadegan, Hamed ;
Cooper, Gregory F. ;
Hauskrecht, Milos .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (04)
[6]   Predictive data mining in clinical medicine: Current issues and guidelines [J].
Bellazzi, Riccardo ;
Zupan, Blaz .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2008, 77 (02) :81-97
[7]   Systems biology approaches for identifying adverse drug reactions and elucidating their underlying biological mechanisms [J].
Boland, Mary Regina ;
Jacunski, Alexandra ;
Lorberbaum, Tal ;
Romano, Joseph D. ;
Moskovitch, Robert ;
Tatonetti, Nicholas P. .
WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE, 2016, 8 (02) :104-122
[8]   A novel algorithm for mining closed temporal patterns from interval-based data [J].
Chen, Yi-Cheng ;
Weng, Julia Tzu-Ya ;
Hui, Lin .
KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 46 (01) :151-183
[9]   Mining Temporal Patterns in Time Interval-Based Data [J].
Chen, Yi-Cheng ;
Peng, Wen-Chih ;
Lee, Suh-Yin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (12) :3318-3331
[10]  
Cheng H., 2007, INT C DAT ENG ICDE